Publication | Open Access
Adaptation of Multiple Sound Source Localization Neural Networks with Weak Supervision and Domain-adversarial Training
41
Citations
16
References
2019
Year
Unknown Venue
MusicEngineeringMachine LearningLocalizationAcoustic ModelingSpeech RecognitionDomain Adaptation MethodsSound Source LocalizationSpeaker LocalizationAudio Signal ProcessingNoiseAudio AnalysisAcoustic Signal ProcessingHealth SciencesWeak SupervisionDeep LearningGenerative Adversarial NetworkDomain AdaptationSpeech ProcessingDomain-adversarial Training
Despite the recent success of deep neural network-based approaches in sound source localization, these approaches suffer the limitations that the required annotation process is costly, and the mismatch between the training and test conditions undermines the performance. This paper addresses the question of how models trained with simulation can be exploited for multiple sound source localization in real scenarios by domain adaptation. In particular, two domain adaptation methods are investigated: weak supervision and domain-adversarial training. Our experiments show that the weak supervision with the knowledge of the number of sources can significantly improve the performance of an unadapted model. However, the domain-adversarial training does not yield significant improvement for this particular problem.
| Year | Citations | |
|---|---|---|
Page 1
Page 1