Publication | Open Access
Assessing and improving the performance of speech recognition for incremental systems
50
Citations
6
References
2009
Year
Unknown Venue
EngineeringSpoken Language ProcessingSpoken Dialog SystemIntelligent SystemsCorpus LinguisticsIncremental AsrSpeech RecognitionNatural Language ProcessingComputational LinguisticsIncremental Asr ComponentsRobust Speech RecognitionSystems EngineeringSpeech InterfaceIncremental SystemsConversation AnalysisVoice RecognitionLanguage StudiesReal-time LanguageMachine TranslationDialogue ManagementComputer EngineeringComputer ScienceSimple Incremental Post-processingSignal ProcessingSpeech CommunicationSpeech TechnologySpeech ProcessingSpeech InputSpeech PerceptionLinguistics
In incremental spoken dialogue systems, partial hypotheses about what was said are required even while the utterance is still ongoing. We define measures for evaluating the quality of incremental ASR components with respect to the relative correctness of the partial hypotheses compared to hypotheses that can optimize over the complete input, the timing of hypothesis formation relative to the portion of the input they are about, and hypothesis stability, defined as the number of times they are revised. We show that simple incremental post-processing can improve stability dramatically, at the cost of timeliness (from 90 % of edits of hypotheses being spurious down to 10 % at a lag of 320 ms). The measures are not independent, and we show how system designers can find a desired operating point for their ASR. To our knowledge, we are the first to suggest and examine a variety of measures for assessing incremental ASR and improve performance on this basis.
| Year | Citations | |
|---|---|---|
Page 1
Page 1