Publication | Closed Access
Unvoiced Speech Segregation From Nonspeech Interference via CASA and Spectral Subtraction
57
Citations
25
References
2010
Year
Source SeparationEngineeringMonaural MixturesSpeech EnhancementSpeech RecognitionNoiseRobust Speech RecognitionHealth SciencesSpectral SubtractionUnvoiced Speech SegregationUnvoiced SpeechDistant Speech RecognitionSignal ProcessingSpeech CommunicationMulti-speaker Speech RecognitionSpeech ProcessingSpeech SeparationSpeech PerceptionSignal Separation
While a lot of effort has been made in computational auditory scene analysis to segregate voiced speech from monaural mixtures, unvoiced speech segregation has not received much attention. Unvoiced speech is highly susceptible to interference due to its relatively weak energy and lack of harmonic structure, and hence makes its segregation extremely difficult. This paper proposes a new approach to segregation of unvoiced speech from nonspeech interference. The proposed system first removes estimated voiced speech, and the periodic part of interference based on cross-channel correlation. The resultant interference becomes more stationary and we estimate the noise energy in unvoiced intervals using segregated speech in neighboring voiced intervals. Then unvoiced speech segregation occurs in two stages: segmentation and grouping. In segmentation, we apply spectral subtraction to generate time-frequency segments in unvoiced intervals. Unvoiced speech segments are subsequently grouped based on frequency characteristics of unvoiced speech using simple thresholding as well as Bayesian classification. The proposed algorithm is computationally efficient, and systematic evaluation and comparison show that our approach considerably improves the performance of unvoiced speech segregation.
| Year | Citations | |
|---|---|---|
Page 1
Page 1