skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Adding articulatory features to acoustic features for automatic speech recognition

Abstract

A hidden-Markov-model (HMM) based speech recognition system was evaluated that makes use of simultaneously recorded acoustic and articulatory data. The articulatory measurements were gathered by means of electromagnetic articulography and describe the movement of small coils fixed to the speakers` tongue and jaw during the production of German V{sub 1}CV{sub 2} sequences [P. Hoole and S. Gfoerer, J. Acoust. Soc. Am. Suppl. 1 {bold 87}, S123 (1990)]. Using the coordinates of the coil positions as an articulatory representation, acoustic and articulatory features were combined to make up an acoustic--articulatory feature vector. The discriminant power of this combined representation was evaluated for two subjects on a speaker-dependent isolated word recognition task. When the articulatory measurements were used both for training and testing the HMMs, the articulatory representation was capable of reducing the error rate of comparable acoustic-based HMMs by a relative percentage of more than 60%. In a separate experiment, the articulatory movements during the testing phase were estimated using a multilayer perceptron that performed an acoustic-to-articulatory mapping. Under these more realistic conditions, when articulatory measurements are only available during the training, the error rate could be reduced by a relative percentage of 18% to 25%.

Authors:
 [1]
  1. Los Alamos Natl. Lab., CIC-3, MS B256, Los Alamos, NM 87545 (United States)
Publication Date:
Sponsoring Org.:
USDOE
OSTI Identifier:
44505
Resource Type:
Journal Article
Journal Name:
Journal of the Acoustical Society of America
Additional Journal Information:
Journal Volume: 97; Journal Issue: 5; Other Information: PBD: May 1995
Country of Publication:
United States
Language:
English
Subject:
66 PHYSICS; SPEECH; MARKOV PROCESS; ERRORS

Citation Formats

Zlokarnik, I. Adding articulatory features to acoustic features for automatic speech recognition. United States: N. p., 1995. Web. doi:10.1121/1.411699.
Zlokarnik, I. Adding articulatory features to acoustic features for automatic speech recognition. United States. https://doi.org/10.1121/1.411699
Zlokarnik, I. 1995. "Adding articulatory features to acoustic features for automatic speech recognition". United States. https://doi.org/10.1121/1.411699.
@article{osti_44505,
title = {Adding articulatory features to acoustic features for automatic speech recognition},
author = {Zlokarnik, I},
abstractNote = {A hidden-Markov-model (HMM) based speech recognition system was evaluated that makes use of simultaneously recorded acoustic and articulatory data. The articulatory measurements were gathered by means of electromagnetic articulography and describe the movement of small coils fixed to the speakers` tongue and jaw during the production of German V{sub 1}CV{sub 2} sequences [P. Hoole and S. Gfoerer, J. Acoust. Soc. Am. Suppl. 1 {bold 87}, S123 (1990)]. Using the coordinates of the coil positions as an articulatory representation, acoustic and articulatory features were combined to make up an acoustic--articulatory feature vector. The discriminant power of this combined representation was evaluated for two subjects on a speaker-dependent isolated word recognition task. When the articulatory measurements were used both for training and testing the HMMs, the articulatory representation was capable of reducing the error rate of comparable acoustic-based HMMs by a relative percentage of more than 60%. In a separate experiment, the articulatory movements during the testing phase were estimated using a multilayer perceptron that performed an acoustic-to-articulatory mapping. Under these more realistic conditions, when articulatory measurements are only available during the training, the error rate could be reduced by a relative percentage of 18% to 25%.},
doi = {10.1121/1.411699},
url = {https://www.osti.gov/biblio/44505}, journal = {Journal of the Acoustical Society of America},
number = 5,
volume = 97,
place = {United States},
year = {Mon May 01 00:00:00 EDT 1995},
month = {Mon May 01 00:00:00 EDT 1995}
}