Publication | Closed Access
Speaker normalization using efficient frequency warping procedures
262
Citations
0
References
2002
Year
Unknown Venue
EngineeringBiometricsSpeaker NormalizationPhonologySpeech RecognitionPattern RecognitionPhoneticsSpeaker DiarizationRobust Speech RecognitionLinear FrequencyVoice RecognitionLanguage StudiesMaximum LikelihoodFrequency WarpingDistant Speech RecognitionSpeech CommunicationSpeech TechnologySpeech ProcessingSpeech PerceptionSpeaker Recognition
In an effort to reduce the degradation in speech recognition performance caused by variation in vocal tract shape among speakers, a frequency warping approach to speaker normalization is investigated. A set of low complexity, maximum likelihood based frequency warping procedures have been applied to speaker normalization for a telephone based connected digit recognition task. This paper presents an efficient means for estimating a linear frequency warping factor and a simple mechanism for implementing frequency warping by modifying the filter-bank in mel-frequency cepstrum feature analysis. An experimental study comparing these techniques to other well-known techniques for reducing variability is described. The results showed that frequency warping was consistently able to reduce word error rate by 20% even for very short utterances.