| | |
Summary: THE RELATION BETWEEN SPEECH SEGMENT SELECTIVITY AND SOURCE
LOCALIZATION ACCURACY
Parham Aarabi, Alborz Mahdavi
The Edward S. Rogers Sr. Department of Electrical and Computer Engineering
University of Toronto
10 Kings College Road
Ontario, Canada, M5S 3G4
ABSTRACT
An experimental analysis of the relation between speech
signal segment power and the source directionofarrival es
timation accuracy is conducted. A total of 10 different speak
ers, including both male and female speakers, totaling to
approximately 2 hours of speech are used to analyze the
performance of the Phase Transform, the Maximum Likeli
hood, and the Unfiltered Cross Correlation timedelay esti
mation techniques. For female speakers, it is determined
that the Phase Transform technique has a lower percent
age of anomalies and a lower directionofarrival root mean
square error (DOA RMSE). Conversely, for male speakers,
it is determined that the Unfiltered Cross Correlation has a
|