Publication | Closed Access
Multi-band speech recognition in noisy environments
126
Citations
9
References
2002
Year
Unknown Venue
EngineeringConventional AsrSpeech RecognitionPattern RecognitionNoiseRobust Speech RecognitionMulti-band AsrVoice RecognitionHealth SciencesNoisy SpeechMulti-channel ProcessingDistant Speech RecognitionSignal ProcessingSpeech CommunicationMulti-speaker Speech RecognitionMulti-band Speech RecognitionSpeech ProcessingSpeech PerceptionSpeaker Recognition
This paper presents a new approach for multi-band based automatic speech recognition (ASR). Previous work by Bourlard et al. (see Proc. Int. Conf. on Spoken Language Processing, Philadelphia, p.426-9, 1996) and Hermansky et al. (see Proc. Int. Conf. on Spoken Language Processing, Philadelphia, p.1579-82, 1996) suggests that multi-band ASR gives a more accurate recognition, especially in noisy acoustic environments, by combining the likelihoods of different frequency bands. Here we evaluate this likelihood recombination (LC) approach to multi-band ASR, and propose an alternative method, namely feature recombination (FC). In the FC system, after different acoustic analyzers are applied to each sub-band individually, a vector is composed by combining the sub-band features. The speech classifier then calculates the likelihood from the single vector. Thus, band-limited noise affects only a few of the feature components, as in the multi-band LC system, but, at the same time, all feature components are jointly modeled, as in conventional ASR. The experimental results show that the FC system can yield better performance than both the conventional ASR and the LC strategy for noisy speech.
| Year | Citations | |
|---|---|---|
Page 1
Page 1