Publication | Closed Access
Context-independent phonetic hidden Markov models for speaker-independent continuous speech recognition
154
Citations
28
References
1990
Year
Speech SciencesEngineeringGeneralized TriphonesSpoken Language ProcessingVoice EvaluationSpeech RecognitionNatural Language ProcessingPhoneticsComputational LinguisticsRobust Speech RecognitionVoice RecognitionAcoustic AnalysisSpoken Language UnderstandingHealth SciencesContext-dependent Phone ModelsSignal ProcessingSpeech CommunicationSpeech TechnologyVoiceMulti-speaker Speech RecognitionSpeech AcousticsContext-dependent ModelsSpeech ProcessingSpeech InputSpeech PerceptionLinguisticsSpeaker Recognition
Context-dependent phone models are applied to speaker-independent continuous speech recognition and shown to be effective in this domain. Several previously proposed context-dependent models are evaluated, and two new context-dependent phonetic units are introduced: function-word-dependent phone models, which focus on the most difficult subvocabulary; and generalized triphones, which combine similar triphones on the basis of an information-theoretic measure. The subword clustering procedure used for generalized triphones can find the optimal number of models, given a fixed amount of training data. It is shown that context-dependent modeling reduces the error rate by as much as 60%.< <ETX xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">></ETX>
| Year | Citations | |
|---|---|---|
Page 1
Page 1