Publication | Closed Access
Duration modeling in large vocabulary speech recognition
31
Citations
5
References
2002
Year
Unknown Venue
Natural Language ProcessingPhoneme DurationPhoneme Duration ModelingEngineeringHealth SciencesData ScienceSpeech CorpusCorpus LinguisticsSpeech AnalysisBaseline Recognition SystemRobust Speech RecognitionSpeech ProcessingSpoken Language ProcessingSpeech InputSpeech PerceptionLinguisticsSpeech CommunicationSpeech Recognition
This paper presents a study of different methods for phoneme duration modeling in large vocabulary speech recognition. We investigate the employment of phoneme duration and the effect of context, speaking rate and lexical stress in the duration of phoneme segments in a large vocabulary speech recognition system. The duration models are used in a postprocessing phase of BYBLOS, our baseline HMM-based recognition system, to rescore the N-Best hypotheses. We describe experiments with the 5 K word ARPA Wall Street Journal (WSJ) corpus. The results show that integration of duration models that take into account context and speaking rate can improve the word accuracy of the baseline recognition system.
| Year | Citations | |
|---|---|---|
Page 1
Page 1