Publication | Closed Access
A fast approximate acoustic match for large vocabulary speech recognition
64
Citations
11
References
1993
Year
EngineeringSpoken Language ProcessingAcoustic ModelingLanguage ProcessingSpeech RecognitionNatural Language ProcessingAcoustic Signal SegmentComputational LinguisticsRobust Speech RecognitionSpeech Signal AnalysisSpoken Language UnderstandingHealth SciencesComputer ScienceDistant Speech RecognitionSpeech CommunicationSpeech TechnologyApproximate Acoustic MatchSpeech AcousticsSpeech ProcessingSpeech InputSpeech PerceptionHidden Markov ModelsLinguistics
In a large vocabulary speech recognition system using hidden Markov models, calculating the likelihood of an acoustic signal segment for all the words in the vocabulary involves a large amount of computation. In order to run in real time on a modest amount of hardware, it is important that these detailed acoustic likelihood computations be performed only on words which have a reasonable probability of being the word that was spoken. The authors describe a scheme for rapidly obtaining an approximate acoustic match for all the words in the vocabulary in such a way as to ensure that the correct word is, with high probability, one of a small number of words examined in detail. Using fast search methods, they obtain a matching algorithm that is about a hundred times faster than doing a detailed acoustic likelihood computation on all the words in the IBM Office Correspondence isolated word dictation task, which has a vocabulary of 20000 words. Experimental results showing the effectiveness of such a fast match for a number of talkers are given.< <ETX xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">></ETX>
| Year | Citations | |
|---|---|---|
Page 1
Page 1