Start/End Delays of Voiced and Unvoiced Speech Signals

Herrnstein, A

doi:10.2172/15006006

Title: Start/End Delays of Voiced and Unvoiced Speech Signals

Technical Report · Fri Sep 24 00:00:00 EDT 1999

DOI:https://doi.org/10.2172/15006006· OSTI ID:15006006

Herrnstein, A

Recent experiments using low power EM-radar like sensors (e.g, GEMs) have demonstrated a new method for measuring vocal fold activity and the onset times of voiced speech, as vocal fold contact begins to take place. Similarly the end time of a voiced speech segment can be measured. Secondly it appears that in most normal uses of American English speech, unvoiced-speech segments directly precede or directly follow voiced-speech segments. For many applications, it is useful to know typical duration times of these unvoiced speech segments. A corpus, assembled earlier of spoken ''Timit'' words, phrases, and sentences and recorded using simultaneously measured acoustic and EM-sensor glottal signals, from 16 male speakers, was used for this study. By inspecting the onset (or end) of unvoiced speech, using the acoustic signal, and the onset (or end) of voiced speech using the EM sensor signal, the average duration times for unvoiced segments preceding onset of vocalization were found to be 300ms, and for following segments, 500ms. An unvoiced speech period is then defined in time, first by using the onset of the EM-sensed glottal signal, as the onset-time marker for the voiced speech segment and end marker for the unvoiced segment. Then, by subtracting 300ms from the onset time mark of voicing, the unvoiced speech segment start time is found. Similarly, the times for a following unvoiced speech segment can be found. While data of this nature have proven to be useful for work in our laboratory, a great deal of additional work remains to validate such data for use with general populations of users. These procedures have been useful for applying optimal processing algorithms over time segments of unvoiced, voiced, and non-speech acoustic signals. For example, these data appear to be of use in speaker validation, in vocoding, and in denoising algorithms.

View Technical Report

Cite

Export

Save

Research Organization:: Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States)

Sponsoring Organization:: US Department of Energy (US)

DOE Contract Number:: W-7405-ENG-48

OSTI ID:: 15006006

Report Number(s):: UCRL-TR-155600; TRN: US200402%%296

Resource Relation:: Other Information: PBD: 24 Sep 1999

Country of Publication:: United States

Language:: English

Similar Records

Denoising of human speech using combined acoustic and em sensor signal processing

Conference · Mon Nov 29 00:00:00 EST 1999 · OSTI ID:15006006

Ng, L C; Burnett, G C; Holzrichter, J F; +1 more

System and method for characterizing voiced excitations of speech and acoustic signals, removing acoustic noise from speech, and synthesizing speech

Patent · Tue Aug 08 00:00:00 EDT 2006 · OSTI ID:15006006

Burnett, Greg C; Holzrichter, John F; Ng, Lawrence C

System And Method For Characterizing Voiced Excitations Of Speech And Acoustic Signals, Removing Acoustic Noise From Speech, And Synthesizi

Patent · Tue Apr 25 00:00:00 EDT 2006 · OSTI ID:15006006

Burnett, Greg C; Holzrichter, John F; Ng, Lawrence C

Related Subjects

99 GENERAL AND MISCELLANEOUS//MATHEMATICS, COMPUTING, AND INFORMATION SCIENCE
ACOUSTICS
ALGORITHMS
MALES
PROCESSING
SPEECH
VALIDATION

Title: Start/End Delays of Voiced and Unvoiced Speech Signals

Citation Formats

Similar Records

Related Subjects