Publication | Closed Access
Robust feature space adaptation for telephony speech recognition
28
Citations
6
References
2006
Year
Unknown Venue
EngineeringMachine LearningOnline Fmllr AdaptationSpeech RecognitionData SciencePattern RecognitionTelephony Speech RecognitionRobust Speech RecognitionVoice RecognitionHealth SciencesComputer ScienceDistant Speech RecognitionSignal ProcessingSpeech CommunicationFeature Transform EstimationFeature Space MaximumSpeech ProcessingSpeech InputSpeech PerceptionSpeaker Recognition
Speaker adaptation is critical for modern speech recognition systems. Due to the computational and multi-channel model sharing considerations, the use of model adaptation techniques is limited in telephony speech recognition systems. On the other hand, feature space adaptation methods such as feature space maximum likelihood linear regression (fMLLR) are efficient approaches suitable for telephony systems. In this work, we first describe techniques for efficient implementation of online fMLLR adaptation. Then feature space maximum a posteriori linear regression (fMAPLR) is proposed to incorporate prior knowledge for the feature transform estimation and improve the robustness of the conventional fMLLR approach. Experiments on telephony data indicate that fMAPLR is significantly more robust than fMLLR, and outperforms fMLLR especially when the adaptation data is very limited.
| Year | Citations | |
|---|---|---|
Page 1
Page 1