Publication | Closed Access
An SVM based classification approach to speech separation
37
Citations
14
References
2011
Year
Unknown Venue
Source SeparationEngineeringMachine LearningClassification ApproachSpeech EnhancementSpeech RecognitionPattern RecognitionMonaural Separation ProblemRobust Speech RecognitionHealth SciencesSignal ProcessingSpeech CommunicationAuditory Segmentation StageMulti-speaker Speech RecognitionSpeech SeparationSpeech ProcessingMonaural Speech SeparationSpeech PerceptionLinguistics
Monaural speech separation is a very challenging task. CASA-based systems utilize acoustic features to produce a time-frequency (T-F) mask. In this study, we propose a classification approach to monaural separation problem. Our feature set consists of pitch-based features and amplitude modulation spectrum features, which can discriminate both voiced and unvoiced speech from nonspeech interference. We employ support vector machines (SVMs) followed by a re-thresholding method to classify each T-F unit as either target-dominated or interference-dominated. An auditory segmentation stage is then utilized to improve SVM-generated results. Systematic evaluations show that our approach produces high quality binary masks and outperforms a previous system in terms of classification accuracy.
| Year | Citations | |
|---|---|---|
Page 1
Page 1