| | |
Summary: D
R
AFT
Log Likelihood Ratio Based Annotation Verification
of a Norwegian Speech Synthesis Database
Ingunn Amdal, Magne Hallstein Johnsen, and Torbjørn Svendsen
Department of Electronics and Telecommunications
Norwegian University of Science and Technology, N-7491 Trondheim, Norway
E-mail: {ingunn.amdal,mhj,torbjorn}@iet.ntnu.no
URL: http://www.iet.ntnu.no/projects/fonema/
ABSTRACT
Accurate labeling and segmentation of the unit inventory
database is of vital importance to the quality of unit selec-
tion text-to-speech synthesis. Misalignments and mismatch
between the predicted and pronounced unit sequences re-
quire manual correction to achieve natural sounding syn-
thesis. In this paper we have used a log likelihood ra-
tio based utterance verification to automatically detect
annotation errors in a Norwegian two-speaker synthesis
database. Each sentence is assigned a confidence score
|