Publication | Closed Access
Multimodal new vocabulary recognition through speech and handwriting in a whiteboard scheduling application
28
Citations
37
References
2005
Year
Unknown Venue
EngineeringSpeech CorpusSpoken Language ProcessingLanguage LearningSpeech RecognitionNatural Language ProcessingMultimodal InterfaceData ScienceComputational LinguisticsLanguage AcquisitionMultimodal InteractionSpeech InterfaceLanguage StudiesMultimodal Signal ProcessingComputer ScienceOov WordsSpeech CommunicationSpeech TechnologyNew VocabularySpeech ProcessingSpeech InputSpeech PerceptionLinguistics
Our goal is to automatically recognize and enroll new vocabulary in a multimodal interface. To accomplish this our technique aims to leverage the mutually disambiguating aspects of co-referenced, co-temporal handwriting and speech. The co-referenced semantics are spatially and temporally determined by our multimodal interface for schedule chart creation. This paper motivates and describes our technique for recognizing out-of-vocabulary (OOV) terms and enrolling them dynamically in the system. We report results for the detection and segmentation of OOV words within a small multimodal test set. On the same test set we also report utterance, word and pronunciation level error rates both over individual input modes and multimodally. We show that combining information from handwriting and speech yields significantly better results than achievable by either mode alone.
| Year | Citations | |
|---|---|---|
Page 1
Page 1