Publication | Open Access
A Robust Method to Count and Locate Audio Sources in a Multichannel Underdetermined Mixture
113
Citations
22
References
2009
Year
MusicSource SeparationEngineeringMixing DirectionsSpeech RecognitionRobust MethodData ScienceMixture AnalysisAudio AnalysisAcoustic Signal ProcessingConfidence LevelStatisticsHealth SciencesMulti-channel ProcessingSignal ProcessingLocate Audio SourcesAnechoic MixturesMixture DistributionSpeech ProcessingStatistical InferenceSignal Separation
We propose a method to count and estimate the mixing directions in an underdetermined multichannel mixture. The approach is based on the hypothesis that in the neighborhood of <i xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">some</i> time-frequency points, only one source essentially contributes to the mixture: such time-frequency points can provide robust local estimates of the corresponding source direction. At the core of our contribution is a statistical model to exploit a local confidence measure, which detects the time-frequency regions where such robust information is available. A clustering algorithm called DEMIX is proposed to merge the information from all time-frequency regions according to their confidence level. So as to estimate the delays of anechoic mixtures and overcome the intrinsic ambiguities of phase unwrapping as met with DUET, we propose a technique similar to GCC-PHAT that is able to estimate delays that can largely exceed one sample. We propose an extensive experimental study that shows the resulting method is more robust in conditions where all DUET-like comparable methods fail, that is, in particular, a) when time-delays largely exceed one sample and b) when the source directions are very close.
| Year | Citations | |
|---|---|---|
Page 1
Page 1