Publication | Closed Access
Large-vocabulary speaker-independent continuous speech recognition using HMM
95
Citations
10
References
2003
Year
Unknown Venue
Speech SciencesEngineeringSpoken Language ProcessingWord AccuracyLanguage ProcessingSpeech RecognitionNatural Language ProcessingComputational LinguisticsRobust Speech RecognitionAutomatic RecognitionVoice RecognitionSpoken Language UnderstandingHealth SciencesLinguisticsComputer ScienceDistant Speech RecognitionSpeech CommunicationBigram GrammarMulti-speaker Speech RecognitionSpeech AcousticsSpeech ProcessingSpeech InputSpeech PerceptionHidden Markov ModelsSpeech Interface
SPHINX, the first large-vocabulary speaker-independent continuous-speech recognizer is described. SPHINX is a hidden-Markov-model (HMM)-based recognizer using multiple codebooks of various LPC-derived features. Two types of HMMs are used in SPHINX: context-independent phone models and function-word-dependent phone models. On a 997-word task using a bigram grammar, SPHINX achieved a word accuracy of 93%. This demonstrates the feasibility of speaker-independent continuous-speech recognition, and the appropriateness of hidden Markov models for such a task.< <ETX xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">></ETX>
| Year | Citations | |
|---|---|---|
Page 1
Page 1