Publication | Open Access
Automatic acquisition of names using speak and spell mode in spoken dialogue systems
33
Citations
16
References
2003
Year
Unknown Venue
EngineeringSpeech CorpusAutomatic AcquisitionSpoken Language ProcessingSpoken Dialog SystemCorpus LinguisticsSpeech RecognitionNatural Language ProcessingDialogue SystemLanguage DocumentationPhoneticsComputational LinguisticsSpoken Dialogue SystemsSpeech InterfaceLanguage StudiesMachine TranslationOpen SetComputer ScienceNew NameSpeech CommunicationSpeech TechnologyLanguage RecognitionSpeech ProcessingSpeech InputSpeech PerceptionLinguistics
This paper describes a novel multi-stage recognition procedure for deducing the spelling and pronunciation of an open set of names. The overall goal is the automatic acquisition of unknown words in a human computer conversational system. The names are spoken and spelled in a single utterance, achieving a concise and natural dialogue flow. The first recognition pass extracts letter hypotheses from the spelled part of the waveform and maps them to phonemic hypotheses via a hierarchical sublexical model capable of generating graphemephoneme mappings. A second recognition pass determines the name by combining information from the spoken and spelled part of the waveform, augmented with language model constraints. The procedure is integrated into a spoken dialogue system where users are asked to enroll their names for the first time. The acquisition process is implemented in multiple parallel threads for real-time operation. Subsequent to inducing the spelling and pronunciation of a new name, a series of operations automatically updates the recognition and natural language systems to immediately accommodate the new word. Experiments show promising results for letter and phoneme accuracies on a preliminary dataset.
| Year | Citations | |
|---|---|---|
Page 1
Page 1