Publication | Open Access
Adaptive language modeling using minimum discriminant estimation
79
Citations
5
References
1992
Year
Unknown Venue
EngineeringClosest N-gram DistributionCorpus LinguisticsText MiningSpeech RecognitionNatural Language ProcessingApplied LinguisticsWord EmbeddingsStatic N-gram DistributionInformation RetrievalData ScienceLanguage AdaptationComputational LinguisticsLanguage EngineeringLanguage StudiesMachine TranslationN-gram Language ModelNlp TaskDistributional SemanticsLanguage RecognitionAdaptive LanguageSpeech ProcessingText ProcessingLinguistics
We present an algorithm to adapt a n-gram language model to a document as it is dictated. The observed partial document is used to estimate a unigram distribution for the words that already occurred. Then, we find the closest n-gram distribution to the static n-gram distribution (using the discrimination information distance measure) and that satisfies the marginal constraints derived from the document. The resulting minimum discrimination information model results in a perplexity of 208 instead of 290 for the static trigram model on a document of 321 words.
| Year | Citations | |
|---|---|---|
Page 1
Page 1