Publication | Closed Access
A class-based language model for large-vocabulary speech recognition extracted from part-of-speech statistics
46
Citations
12
References
1999
Year
Unknown Venue
Large-vocabulary Speech RecognitionEngineeringSpoken Language ProcessingAmbiguity ClassClass-based Language ModelCorpus LinguisticsSpeech RecognitionNatural Language ProcessingComputational LinguisticsPhoneticsRobust Speech RecognitionLanguage StudiesAmbiguity ClassesMachine TranslationPart-of-speech StatisticsClass-based LanguageSpeech CommunicationSpeech TechnologyLanguage RecognitionSpeech ProcessingSpeech InputSpeech PerceptionLinguistics
A novel approach is presented to class-based language modeling based on part-of-speech statistics. It uses a deterministic word-to-class mapping, which handles words with alternative part-of-speech assignments through the use of ambiguity classes. The predictive power of word-based language models and the generalization capability of class-based language models are combined using both linear interpolation and word-to-class backoff, and both methods are evaluated. Since each word belongs to one precisely ambiguity class, an exact word-to-class backoff model can easily be constructed. Empirical evaluations on large-vocabulary speech-recognition tasks show perplexity improvements and significant reductions in word error-rate.
| Year | Citations | |
|---|---|---|
Page 1
Page 1