Publication | Closed Access
Continuous optical automatic speech recognition by lipreading
70
Citations
11
References
2002
Year
Unknown Venue
Optical RecognitionSpeech SciencesMachine LearningPercent RecognitionEngineeringSpoken Language ProcessingSpeech RecognitionNatural Language ProcessingPattern RecognitionRobust Speech RecognitionAutomatic RecognitionHealth SciencesComputer ScienceSignal ProcessingSpeech CommunicationSpeech TechnologyVoiceSpeech AcousticsSpeech ProcessingSpeech InputSpeech PerceptionHidden Markov ModelsLinguistics
We describe a continuous optical automatic speech recognizer (OASR) that uses optical information from the oral-cavity shadow of a speaker. The system achieves a 25.3 percent recognition on sentences having a perplexity of 150 without using any syntactic, semantic, acoustic, or contextual guides. We introduce 13, mostly dynamic, oral-cavity features used for optical recognition, present phones that appear optically similar (visemes) for our speaker, and present the recognition results for our hidden Markov models (HMMs) using visemes, trisemes, and generalized trisemes. We conclude that future research is warranted for optical recognition, especially when combined with other input modalities.< <ETX xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">></ETX>
| Year | Citations | |
|---|---|---|
Page 1
Page 1