Publication | Closed Access
Building a highly accurate Mandarin speech recognizer
21
Citations
19
References
2007
Year
Unknown Venue
Darpa Gale 2006Machine LearningEngineeringSpoken Language ProcessingCorpus LinguisticsSpeech RecognitionNatural Language ProcessingCross AdaptationData ScienceComputational LinguisticsRobust Speech RecognitionVoice RecognitionLanguage StudiesMachine TranslationCharacter Error RateSpeech CommunicationMulti-speaker Speech RecognitionSpeech ProcessingSpeech InputSpeech PerceptionLinguistics
We describe a highly accurate large-vocabulary continuous Mandarin speech recognizer, a collaborative effort among four research organizations. Particularly, we build two acoustic models (AMs) with significant differences but similar accuracy for the purposes of cross adaptation and system combination. This paper elaborates on the main differences between the two systems, where one recognizer incorporates a discriminatively trained feature while the other utilizes a discriminative feature transformation. Additionally we present an improved acoustic segmentation algorithm and topic-based language model (LM) adaptation. Coupled with increased acoustic training data, we reduced the character error rate (CER) of the DARPA GALE 2006 evaluation set to 15.3% from 18.4%.
| Year | Citations | |
|---|---|---|
Page 1
Page 1