Publication | Closed Access
Polyphonic sound event detection using multi label deep neural networks
259
Citations
17
References
2015
Year
Unknown Venue
MusicAudio MiningEngineeringMachine LearningHealth SciencesMusic ClassificationPattern RecognitionAudio AnalysisMulti Label ClassificationSpeech ProcessingAudio RetrievalDistant Speech RecognitionDeep LearningDeep Neural NetworkAcoustic ModelingSound EventsSpeech Recognition
In this paper, the use of multi label neural networks are proposed for detection of temporally overlapping sound events in realistic environments. Real-life sound recordings typically have many overlapping sound events, making it hard to recognize each event with the standard sound event detection methods. Frame-wise spectral-domain features are used as inputs to train a deep neural network for multi label classification in this work. The model is evaluated with recordings from realistic everyday environments and the obtained overall accuracy is 63.8%. The method is compared against a state-of-the-art method using non-negative matrix factorization as a pre-processing stage and hidden Markov models as a classifier. The proposed method improves the accuracy by 19% percentage points overall.
| Year | Citations | |
|---|---|---|
Page 1
Page 1