Home

About

Advanced Search

Browse by Discipline

Scientific Societies

E-print Alerts

Add E-prints

E-print Network
FAQHELPSITE MAPCONTACT US


  Advanced Search  

 
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, VOL. 5, NO. 5, SEPTEMBER 1997 451 A Model of Dynamic Auditory Perception and
 

Summary: IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, VOL. 5, NO. 5, SEPTEMBER 1997 451
A Model of Dynamic Auditory Perception and
Its Application to Robust Word Recognition
Brian Strope and Abeer Alwan, Member, IEEE
Abstract-- This paper describes two mechanisms that aug-
ment the common automatic speech recognition (ASR) front end
and provide adaptation and isolation of local spectral peaks. A
dynamic model consisting of a linear filterbank with a novel
additive logarithmic adaptation stage after each filter output is
proposed. An extensive series of perceptual forward masking
experiments, together with previously reported forward masking
data, determine the model's dynamic parameters. Once parame-
terized, the simple exponential dynamic mechanism predicts the
nature of forward masking data from several studies across wide
ranging frequencies, input levels, and probe delay times. An
initial evaluation of the dynamic model together with a local peak
isolation mechanism as a front end for dynamic time warp (DTW)
and hidden Markov model (HMM) word recognition systems
shows an improvement in robustness to background noise when
compared to Mel-frequency cepstral coefficients (MFCC), lin-

  

Source: Alwan, Abeer - Electrical Engineering Department, University of California at Los Angeles

 

Collections: Computer Technologies and Information Sciences