Publication | Closed Access
MATBN: A Mandarin Chinese Broadcast News Corpus
119
Citations
7
References
2005
Year
Unknown Venue
EngineeringSpeech CorpusCorpus LinguisticsJournalismText MiningSpeech RecognitionNatural Language ProcessingData ScienceComputational LinguisticsMatbn Mandarin ChineseNews AnalyticsVoice RecognitionLanguage StudiesMandarin LanguageNews SemanticsMachine TranslationSpeech CommunicationSpeech AnalysisNews CorpusSpeech ProcessingSpeech InputSpeech PerceptionLinguistics
The MATBN Mandarin Chinese broadcast news corpus contains a total of 198 hours of broadcast news from the Public Television Service Foundation (Taiwan) with corresponding transcripts. The primary purpose of this collection is to provide training and testing data for continuous speech recognition evaluation in the broadcast news domain. In this paper, we briefly introduce. the speech corpus and report on some preliminary statistical analysis and speech recognition evaluation results.
| Year | Citations | |
|---|---|---|
Page 1
Page 1