Publication | Closed Access
Speech/music discrimination for multimedia applications
180
Citations
11
References
2002
Year
Unknown Venue
MusicAudio MiningEngineeringHealth SciencesAutomatic DiscriminationMusic ClassificationPattern RecognitionAudio AnalysisRobust Speech RecognitionDifferential ParametersSpeech ProcessingAudio RetrievalComputer ScienceSpectral ParametersSpeech PerceptionSignal ProcessingSpeech/music DiscriminationSpeech Recognition
Automatic discrimination of speech and music is an important tool in many multimedia applications. Previous work has focused on using long-term features such as differential parameters, variances and time-averages of spectral parameters. These classifiers use features estimated over windows of 0.5-5 seconds, and are relatively complex. We present our results of combining the line spectral frequencies (LSFs) and zero crossing-based features for frame-level narrowband speech/music discrimination. Our classification results for different types of music and speech show the good discriminating power of these features. Our classification algorithms operate using only a frame delay of 20 ms, making them suitable for real-time multimedia applications.
| Year | Citations | |
|---|---|---|
Page 1
Page 1