Home

About

Advanced Search

Browse by Discipline

Scientific Societies

E-print Alerts

Add E-prints

E-print Network
FAQHELPSITE MAPCONTACT US


  Advanced Search  

 
RUNDKAST: An Annotated Norwegian Broadcast News Speech Corpus Ingunn Amdal (1)
 

Summary: RUNDKAST: An Annotated Norwegian Broadcast News Speech Corpus
Ingunn Amdal (1)
, Ole Morten Strand (1, 2)
, Jørn Almberg (1)
, and Torbjørn Svendsen (1)
(1)
Department of Electronics and Telecommunications
Norwegian University of Science and Technology
NO-7049 Trondheim, Norway
(2)
Norwegian Defence Research Establishment
NO-2027 Kjeller, Norway
E-mail: {ingunn.amdal,torbjorn}@iet.ntnu.no, jorn.almberg@hf.ntnu.no, ole-morten.strand@ffi.no
Abstract
This paper describes the Norwegian broadcast news speech corpus RUNDKAST. The corpus contains recordings of approximately 77
hours of broadcast news shows from the Norwegian broadcasting company NRK. The corpus covers both read and spontaneous speech
as well as spontaneous dialogues and multipart discussions, including frequent occurrences of non-speech material (e.g. music, jingles).
The recordings have large variations in speaking styles, dialect use and recording/transmission quality. RUNDKAST has been
annotated for research in speech technology. The entire corpus has been manually segmented and transcribed using hierarchical levels.
A subset of one hour of read and spontaneous speech from 10 different speakers has been manually annotated using broad phonetic

  

Source: Amdal, Ingunn - Department of Electronics and Telecommunications, Norwegian University of Science and Technology
Svendsen, Torbjørn - Department of Electronics and Telecommunications, Norwegian University of Science and Technology

 

Collections: Computer Technologies and Information Sciences; Engineering