Publication | Closed Access
Rapid speaker adaptation using a probabilistic spectral mapping
68
Citations
6
References
2005
Year
Unknown Venue
Natural Language ProcessingEngineeringMachine LearningMulti-speaker Speech RecognitionComputational LinguisticsSpeaker DiarizationRobust Speech RecognitionSpeech ProcessingRapid Speaker AdaptationComputer ScienceSpeech InputLanguage StudiesSpeech PerceptionHidden Markov ModelsLinguisticsSpeech CommunicationSpeaker RecognitionSpeech Recognition
This paper deals with rapid speaker adaptation for speech recognition. We introduce a new algorithm that transforms hidden Markov models of speech derived from one "prototype" speaker so that they model the speech of a new speaker. The Speaker normalization is accomplished by a probabilistic spectral mapping from one speaker to another. For a 350 word task with a grammar and using only 15 seconds of speech for normalization, the recognition accuracy is 97% averaged over 6 speakers. This accuracy would normally require over 5 minutes of speaker dependent training. We derive the probabilistic spectral transformation of HMMs, describe an algorithm to estimate the transformation, and present recognition results.
| Year | Citations | |
|---|---|---|
Page 1
Page 1