Publication | Closed Access
Discriminative speaker adaptation with conditional maximum likelihood linear regression
63
Citations
10
References
2001
Year
Unknown Venue
EngineeringMachine LearningSpoken Language ProcessingSpeech RecognitionNatural Language ProcessingData ScienceSpeaker DiarizationRobust Speech RecognitionVoice RecognitionStatisticsMachine TranslationHealth SciencesContinuous Emission DensitySpeech TechnologySpeech CommunicationMulti-speaker Speech RecognitionSimplified DerivationSpeech ProcessingStatistical InferenceSpeaker RecognitionMaximum Mutual InformationSpeech PerceptionLinguisticsDiscriminative Speaker Adaptation
We present a simplified derivation of the extended Baum-Welch procedure, which shows that it can be used for Maximum Mutual Information (MMI) of a large class of continuous emission density hidden Markov models (HMMs). We use the extended Baum-Welch procedure for discriminative estimation of MLLR-type speaker adaptation transformations. The resulting adaptation procedure, termed Conditional Maximum Likelihood Linear Regression (CMLLR), is used successfully for supervised and unsupervised adaptation tasks on the Switchboard corpus, yielding an improvement over MLLR. The interaction of unsupervised CMLLR with segmental minimum Bayes risk lattice voting procedures is also explored, showing that the two procedures are complimentary.
| Year | Citations | |
|---|---|---|
Page 1
Page 1