Publication | Closed Access
Speech recognition using segmental neural nets
35
Citations
7
References
1992
Year
Unknown Venue
Speech SciencesEngineeringSpoken Language ProcessingPhonetic SegmentSpeech RecognitionNatural Language ProcessingHidden Markov ModelPhoneticsLikely Phonetic SegmentationsAutomatic RecognitionSpeech Signal AnalysisSpoken Language UnderstandingHealth SciencesComputer ScienceSpeech AcquisitionSpeech CommunicationSpeech TechnologyVoiceMulti-speaker Speech RecognitionSpeech AcousticsSpeech ProcessingSpeech InputSpeech PerceptionLinguistics
The authors present the concept of a segmental neural net (SNN) for phonetic modeling in continuous speech recognition (CSR) and demonstrate how this can be used with a multiple hypothesis (or N-Best) paradigm to combine different CSR systems. In particular, the authors developed a system that combines the SNN with a hidden Markov model (HMM) system. In a speaker-independent, 1000-word CSR test using a word-pair grammar, the error rate for the hybrid system dropped 25% from that of a state-of-the-art HMM system alone. By taking into account all the frames of a phonetic segment simultaneously, the SNN overcomes the well-known conditional-independence limitation of HMMs. The hybrid SNN/HMM system generates likely phonetic segmentations from the HMM N-best list, which are scored by the SNN. The HMM and SNN scores are then combined to optimize performance.< <ETX xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">></ETX>
| Year | Citations | |
|---|---|---|
Page 1
Page 1