| | |
Summary: THE RELATION BETWEEN SPEECH SEGMENT SELECTIVITY AND SOURCE
LOCALIZATION ACCURACY
Parham Aarabi, Alborz Mahdavi
The Edward S. Rogers Sr. Department of Electrical and Computer Engineering
University of Toronto
10 Kings College Road
Ontario, Canada, M5S 3G4
ABSTRACT
An experimental analysis of the relation between speech
signal segment power and the source direction-of-arrival es-
timation accuracy is conducted. A total of 10 different speak-
ers, including both male and female speakers, totaling to
approximately 2 hours of speech are used to analyze the
performance of the Phase Transform, the Maximum Likeli-
hood, and the Unfiltered Cross Correlation time-delay esti-
mation techniques. For female speakers, it is determined
that the Phase Transform technique has a lower percent-
age of anomalies and a lower direction-of-arrivalroot mean-
square error (DOA RMSE). Conversely, for male speakers,
it is determined that the Unfiltered Cross Correlation has a
|