Adding articulatory features to acoustic features for automatic speech recognition

Zlokarnik, I

doi:10.1121/1.411699

Adding articulatory features to acoustic features for automatic speech recognition

Journal Article · Mon May 01 00:00:00 EDT 1995 · Journal of the Acoustical Society of America

DOI:https://doi.org/10.1121/1.411699· OSTI ID:44505

Zlokarnik, I ^[1]

Los Alamos Natl. Lab., CIC-3, MS B256, Los Alamos, NM 87545 (United States)

A hidden-Markov-model (HMM) based speech recognition system was evaluated that makes use of simultaneously recorded acoustic and articulatory data. The articulatory measurements were gathered by means of electromagnetic articulography and describe the movement of small coils fixed to the speakers` tongue and jaw during the production of German V{sub 1}CV{sub 2} sequences [P. Hoole and S. Gfoerer, J. Acoust. Soc. Am. Suppl. 1 {bold 87}, S123 (1990)]. Using the coordinates of the coil positions as an articulatory representation, acoustic and articulatory features were combined to make up an acoustic--articulatory feature vector. The discriminant power of this combined representation was evaluated for two subjects on a speaker-dependent isolated word recognition task. When the articulatory measurements were used both for training and testing the HMMs, the articulatory representation was capable of reducing the error rate of comparable acoustic-based HMMs by a relative percentage of more than 60%. In a separate experiment, the articulatory movements during the testing phase were estimated using a multilayer perceptron that performed an acoustic-to-articulatory mapping. Under these more realistic conditions, when articulatory measurements are only available during the training, the error rate could be reduced by a relative percentage of 18% to 25%.

Sponsoring Organization:: USDOE

OSTI ID:: 44505

Journal Information:: Journal of the Acoustical Society of America, Journal Name: Journal of the Acoustical Society of America Journal Issue: 5 Vol. 97; ISSN 0001-4966; ISSN JASMAN

Country of Publication:: United States

Language:: English

Similar Records

An articulatorily constrained, maximum entropy approach to speech recognition and speech coding

Technical Report · Mon Dec 30 23:00:00 EST 1996 · OSTI ID:432946

Improving on hidden Markov models: An articulatorily constrained, maximum likelihood approach to speech recognition and speech coding

Technical Report · Mon Nov 04 23:00:00 EST 1996 · OSTI ID:431136

Accurate recovery of articulator positions from acoustics: New conclusions based on human data

Journal Article · Sun Sep 01 00:00:00 EDT 1996 · Journal of the Acoustical Society of America · OSTI ID:286916

Related Subjects

66 PHYSICS
ERRORS
MARKOV PROCESS
SPEECH

Adding articulatory features to acoustic features for automatic speech recognition

Citation Formats

Similar Records

Related Subjects