Publication | Closed Access
Band-Independent Mask Estimation for Missing-Feature Reconstruction in the Presence of Unknown Background Noise
27
Citations
6
References
2006
Year
Unknown Venue
EngineeringMachine LearningSpeech EnhancementSpeech RecognitionFrequency BandDeblurringImage AnalysisData SciencePattern RecognitionMissing-feature ReconstructionPhoneticsSignal ReconstructionRobust Speech RecognitionVoice RecognitionMask EstimationHealth SciencesMachine VisionInverse ProblemsComputer ScienceBand-independent Mask EstimationDistant Speech RecognitionSignal ProcessingUnknown Background NoiseSpeech CommunicationComputer VisionVoiceCompressive SensingSpeech ProcessingImage RestorationSpeech PerceptionSignal SeparationWhite NoiseSpeaker Recognition
An effective mask estimation scheme for missing-feature reconstruction is described that achieves robust speech recognition in the presence of unknown noise. In previous work on Bayesian classification for mask estimation, white noise and colored noise were used for training mask estimators. This paper, which is concerned with both the simulation of a more diverse set of background environments and with mitigating the "sparse training" problem, describes a new Bayesian mask-estimation procedure in which each frequency band is trained independently. The new method employs colored noise for training, which is obtained by partitioning each frequency subband. We also propose a reevaluation method of voiced/unvoiced decisions to alleviate performance degradation that is caused by errors in pitch detection. Experimental results indicate that the proposed procedure in conjunction with cluster-based missing-feature imputation improves speech recognition accuracy on the Aurora 2.0 database in the presence for all types of background noise considered.
| Year | Citations | |
|---|---|---|
Page 1
Page 1