Publication | Closed Access
Full expansion of context-dependent networks in large vocabulary speech recognition
67
Citations
13
References
2002
Year
Unknown Venue
Full ExpansionEngineeringSpoken Language ProcessingMultilingual PretrainingLarge Language ModelCorpus LinguisticsText MiningSpeech RecognitionNatural Language ProcessingWord EmbeddingsData ScienceComputational LinguisticsRobust Speech RecognitionLanguage StudiesLanguage ModelsMachine TranslationN-gram Language ModelNlp TaskDeep LearningDistant Speech RecognitionContext-dependent Network RepresentationSpeech CommunicationWeighted NetworksMulti-speaker Speech RecognitionSpeech ProcessingSpeech InputSpeech PerceptionLinguistics
We combine our earlier approach to context-dependent network representation with our algorithm for determining weighted networks to build optimized networks for large-vocabulary speech recognition combining an n-gram language model, a pronunciation dictionary and context-dependency modeling. While fully-expanded networks have been used before in restrictive settings (medium vocabulary or no cross-word contexts), we demonstrate that our network determination method makes it practical to use fully-expanded networks also in large-vocabulary recognition with full cross-word context modeling. For the DARPA North American Business News task (NAB), we give network sizes and recognition speeds and accuracies using bigram and trigram grammars with vocabulary sizes ranging from 10000 to 160000 words. With our construction, the fully-expanded NAB context-dependent networks contain only about twice as many arcs as the corresponding language models. Interestingly, we also find that, with these networks, real-time word accuracy is improved by increasing the vocabulary size and n-gram order.
| Year | Citations | |
|---|---|---|
Page 1
Page 1