Home

About

Advanced Search

Browse by Discipline

Scientific Societies

E-print Alerts

Add E-prints

E-print Network
FAQHELPSITE MAPCONTACT US


  Advanced Search  

 
ROBUST DIGIT RECOGNITION USING PHASE-DEPENDENT TIME-FREQUENCY MASKING
 

Summary: ROBUST DIGIT RECOGNITION USING PHASE-DEPENDENT TIME-
FREQUENCY MASKING
Guangji Shi Parham Aarabi
Department of Elec. and Comp. Engineering, University of Toronto, Toronto, Ontario, Canada
guangji@comm.utoronto.ca parham@ecf.utoronto.ca
ABSTRACT
A technique using the time-frequency phase information
of two microphones is proposed to estimate an ideal time-
frequency mask using time-delay-of-arrival (TDOA) of
the signal of interest. At a signal-to-noise ratio (SNR) of
0dB, the proposed technique using two microphones
achieves a digit recognition rate (average over 5 speakers,
each speaking 20-30 digits) of 71%. In contrast, delay-
and-sum beamforming only achieves a 40% recognition
rate with two microphones and 60% with four
microphones. Superdirective beamforming achieves a
44% recognition rate with two microphones and 65%
with four microphones.
1. INTRODUCTION
In various applications such as speech recognition and

  

Source: Aarabi, Parham - Department of Electrical and Computer Engineering, University of Toronto

 

Collections: Computer Technologies and Information Sciences