| | |
Summary: ROBUST DIGIT RECOGNITION USING PHASEDEPENDENT TIME
FREQUENCY MASKING
Guangji Shi Parham Aarabi
Department of Elec. and Comp. Engineering, University of Toronto, Toronto, Ontario, Canada
guangji@comm.utoronto.ca parham@ecf.utoronto.ca
ABSTRACT
A technique using the timefrequency phase information
of two microphones is proposed to estimate an ideal time
frequency mask using timedelayofarrival (TDOA) of
the signal of interest. At a signaltonoise ratio (SNR) of
0dB, the proposed technique using two microphones
achieves a digit recognition rate (average over 5 speakers,
each speaking 2030 digits) of 71%. In contrast, delay
andsum beamforming only achieves a 40% recognition
rate with two microphones and 60% with four
microphones. Superdirective beamforming achieves a
44% recognition rate with two microphones and 65%
with four microphones.
1. INTRODUCTION
In various applications such as speech recognition and
|