Publication | Closed Access
Quality Estimation for Automatic Speech Recognition
27
Citations
33
References
2014
Year
EngineeringMachine LearningRegression TaskWord Error RateSpoken Language ProcessingLanguage ProcessingSpeech RecognitionNatural Language ProcessingData ScienceComputational LinguisticsRobust Speech RecognitionAutomatic RecognitionVoice RecognitionHealth SciencesClinical LanguageComputer ScienceSignal ProcessingSpeech CommunicationSpeech TechnologySpeech AnalysisAutomatic Speech RecognitionSpeech AcousticsSpeech ProcessingSpeech InputQuality EstimationSpeech PerceptionLinguistics
We address the problem of estimating the quality of Automatic Speech Recognition (ASR) output at utterance level, without recourse to manual reference transcriptions and when information about system’s confidence is not accessible. Given a source signal and its automatic transcription, we approach this problem as a regression task where the word error rate of the transcribed utterance has to be predicted. To this aim, we explore the contribution of different feature sets and the potential of different algorithms in testing conditions of increasing complexity. Results show that our automatic quality estimates closely approximate the word error rate scores calculated over reference transcripts, outperforming a strong baseline in all the testing conditions.
| Year | Citations | |
|---|---|---|
Page 1
Page 1