Publication | Closed Access
Speaker adaptation using combined transformation and Bayesian methods
100
Citations
17
References
1996
Year
EngineeringMachine LearningSpoken Language ProcessingPhonologyCorpus LinguisticsSpeech RecognitionNatural Language ProcessingBayesian TechniquesSpeaker DiarizationRobust Speech RecognitionBayesian MethodsVoice RecognitionLanguage StudiesGaussian Mixture DensitiesComponent DensitiesSpeech CommunicationMulti-speaker Speech RecognitionSpeech ProcessingStatistical InferenceSpeech PerceptionLinguisticsSpeaker Recognition
Adapting the parameters of a statistical speaker independent continuous-speech recognizer to the speaker and the channel can significantly improve the recognition performance and robustness of the system. In continuous mixture-density hidden Markov models the number of component densities is typically very large, and it may not be feasible to acquire a sufficient amount of adaptation data for robust maximum-likelihood estimates. To solve this problem, we have recently proposed a constrained estimation technique for Gaussian mixture densities. To improve the behavior of our adaptation scheme for large amounts of adaptation data, we combine it here with Bayesian techniques. We evaluate our algorithms on the large-vocabulary Wall Street Journal corpus for nonnative speakers of American English. The recognition error rate is approximately halved with only a small amount of adaptation data, and it approaches the speaker-independent accuracy achieved for native speakers.
| Year | Citations | |
|---|---|---|
Page 1
Page 1