Publication | Closed Access
Audio segmentation for speech recognition using segment features
66
Citations
9
References
2009
Year
Unknown Venue
Audio SegmentationEngineeringMachine LearningSegmentation MethodsSpeech RecognitionImage AnalysisData SciencePattern RecognitionAudio AnalysisRobust Speech RecognitionVoice RecognitionHealth SciencesPromising Segmentation QualityComputer ScienceDistant Speech RecognitionSignal ProcessingComputer VisionSpeech CommunicationAudio MiningSpeech ProcessingSpeech Perception
Audio segmentation is an essential preprocessing step in several audio processing applications with a significant impact e.g. on speech recognition performance. We introduce a novel framework which combines the advantages of different well known segmentation methods. An automatically estimated log-linear segment model is used to determine the segmentation of an audio stream in a holistic way by a maximum a posteriori decoding strategy, instead of classifying change points locally. A comparison to other segmentation techniques in terms of speech recognition performance is presented, showing a promising segmentation quality of our approach.
| Year | Citations | |
|---|---|---|
Page 1
Page 1