Publication | Closed Access
Contextual Information Improves OOV Detection in Speech
61
Citations
25
References
2010
Year
Unknown Venue
Oov RegionsEngineeringMachine LearningSpoken Language ProcessingOov DetectionCorpus LinguisticsSpeech RecognitionNatural Language ProcessingData SciencePattern RecognitionRobust Speech RecognitionReal-time LanguageHealth SciencesComputer ScienceDeep LearningSpeech CommunicationSpeech AnalysisSpeech ProcessingSpeech InputMissed Oov RateSpeech PerceptionLinguistics
Out-of-vocabulary (OOV) words represent an important source of error in large vocabulary continuous speech recognition (LVCSR) systems. These words cause recognition failures, which propagate through pipeline systems impacting the performance of downstream applications. The detection of OOV regions in the output of a LVCSR system is typically addressed as a binary classification task, where each region is independently classified using local information. In this paper, we show that jointly predicting OOV regions, and including contextual information from each region, leads to substantial improvement in OOV detection. Compared to the state-of-the-art, we reduce the missed OOV rate from 42.6% to 28.4% at 10% false alarm rate.
| Year | Citations | |
|---|---|---|
Page 1
Page 1