Publication | Closed Access
Inside the spectrogram: Convolutional Neural Networks in audio processing
56
Citations
7
References
2017
Year
Unknown Venue
Convolutional Neural NetworkEngineeringMachine LearningNeural NetworkAcoustic ModelingSpeech RecognitionImage AnalysisPattern RecognitionAudio Signal ProcessingAudio AnalysisVoice RecognitionHealth SciencesAudio RetrievalComputer ScienceDeep LearningRaw Audio SignalAudio MiningConvolutional Neural NetworksSpeech Processing
Convolutional Neural Networks have established a new standard in many machine learning applications not only in image but also in audio processing. In this contribution we investigate the interplay between the primary representation mapping a raw audio signal to some kind of image (feature) and the convolutional layers of an ensuing neural network. We introduce a new notion of equivalence of feature-network pairs and show the relation of feature and networks for the example of mel-spectrogram input on the one hand and varying analysis windows on the other hand.
| Year | Citations | |
|---|---|---|
Page 1
Page 1