Publication | Closed Access
Modeling state durations in hidden Markov models for automatic speech recognition
77
Citations
4
References
1992
Year
Unknown Venue
EngineeringMachine LearningComputational ComplexitySpoken Language ProcessingSpeech RecognitionData ScienceRobust Speech RecognitionAutomatic RecognitionVoice RecognitionSpeech Signal AnalysisHealth SciencesComputer ScienceSignal ProcessingSpeech CommunicationSpeech TechnologyAutomatic Speech RecognitionVoiceSpeech AcousticsSpeech ProcessingHidden Markov ModelingSpeech InputState DurationsSpeech PerceptionHidden Markov ModelsLinguisticsTraditional Hmm
Hidden Markov modeling (HMM) techniques have been used successfully for connected speech recognition in the last several years. In the traditional HMM algorithms, the probability of duration of a state decreases exponentially with time which is not appropriate for representing the temporal structure of speech. Non-parametric modeling of duration using semi-Markov chains does accomplish the task with a large increase in the computational complexity. Applying a postprocessing state duration penalty after Viterbi decoding adds very little computation but does not affect the forward recognition path. The authors present a way of modeling state durations in HMM using time-dependent state transitions. This inhomogeneous HMM (IHMM) does increase the computation by a small amount but reduces recognition error rates by 14-25%. Also, a suboptimal implementation of this scheme that requires no more computation than the traditional HMM is presented which also has reduced errors by 14-22% on a variety of databases.< <ETX xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">></ETX>
| Year | Citations | |
|---|---|---|
Page 1
Page 1