Publication | Closed Access
Construction and evaluation of a robust multifeature speech/music discriminator
870
Citations
8
References
2002
Year
Unknown Venue
MusicEngineeringSpeech RecognitionData SciencePhoneticsAudio AnalysisRobust Speech RecognitionHealth SciencesAudio RetrievalComputer ScienceSpeech CommunicationDigital Audio InputAudio MiningMusic ClassificationReal-time Computer SystemMusic SignalsSpeech ProcessingSpeech PerceptionAudio InterfaceSpeaker Recognition
We report on the construction of a real-time computer system capable of distinguishing speech signals from music signals over a wide range of digital audio input. We have examined 13 features intended to measure conceptually distinct properties of speech and/or music signals, and combined them in several multidimensional classification frameworks. We provide extensive data on system performance and the cross-validated training/test setup used to evaluate the system. For the datasets currently in use, the best classifier classifies with 5.8% error on a frame-by-frame basis, and 1.4% error when integrating long (2.4 second) segments of sound.
| Year | Citations | |
|---|---|---|
Page 1
Page 1