Publication | Closed Access
A comparison of Deep Learning methods for environmental sound detection
140
Citations
22
References
2017
Year
Unknown Venue
Stereo Audio RecordingsMachine LearningEngineeringNoise PredictionAcoustic ModelingSpeech RecognitionOcean AcousticsIeee ChallengeData SciencePattern RecognitionAudio AnalysisNoiseAcoustic Signal ProcessingHealth SciencesMachine VisionComputer ScienceDeep LearningDistant Speech RecognitionComputer VisionEnvironmental Sound DetectionAudio MiningSpeech ProcessingEnvironmental Signal Processing
Environmental sound detection is a challenging application of machine learning because of the noisy nature of the signal, and the small amount of (labeled) data that is typically available. This work thus presents a comparison of several state-of-the-art Deep Learning models on the IEEE challenge on Detection and Classification of Acoustic Scenes and Events (DCASE) 2016 challenge task and data, classifying sounds into one of fifteen common indoor and outdoor acoustic scenes, such as bus, cafe, car, city center, forest path, library, train, etc. In total, 13 hours of stereo audio recordings are available, making this one of the largest datasets available.
| Year | Citations | |
|---|---|---|
Page 1
Page 1