Publication | Closed Access
Tandem connectionist feature extraction for conventional HMM systems
693
Citations
10
References
2002
Year
Unknown Venue
Subword UnitsEngineeringMachine LearningGaussian Mixture ModelsSpoken Language ProcessingSpeech RecognitionNatural Language ProcessingPattern RecognitionRobust Speech RecognitionLanguage StudiesGaussian-mixture Distribution ModelingComputer ScienceDeep LearningDistant Speech RecognitionSignal ProcessingSpeech CommunicationConventional Hmm SystemsMulti-speaker Speech RecognitionSpeech ProcessingSpeech InputLinguistics
Hidden Markov model speech recognition systems typically use Gaussian mixture models to estimate the distributions of decorrelated acoustic feature vectors that correspond to individual subword units. By contrast, hybrid connectionist-HMM systems use discriminatively-trained neural networks to estimate the probability distribution among subword units given the acoustic observations. In this work we show a large improvement in word recognition performance by combining neural-net discriminative feature processing with Gaussian-mixture distribution modeling. By training the network to generate the subword probability posteriors, then using transformations of these estimates as the base features for a conventionally-trained Gaussian-mixture based system, we achieve relative error rate reductions of 35% or more on the multicondition Aurora noisy continuous digits task.
| Year | Citations | |
|---|---|---|
Page 1
Page 1