Publication | Open Access
Two decades of statistical language modeling: where do we go from here?
729
Citations
78
References
2000
Year
EngineeringSpoken Language ProcessingMultilingual PretrainingLarge Language ModelCorpus LinguisticsSpeech RecognitionApplied LinguisticsNatural Language ProcessingSyntaxLinguistic TheoriesComputational LinguisticsLanguage EngineeringGrammarLanguage StudiesStatisticsMachine TranslationStatistical Language ModelsLanguage TechnologyDistributional SemanticsStatistical Language ModelingLanguage RecognitionLanguage CorpusStatistical InferenceLinguistics
Statistical language models estimate the distribution of various natural language phenomena for the purpose of speech recognition and other language technologies. Since the first significant model was proposed in 1980, many attempts have been made to improve the state of the art. We review them, point to a few promising directions, and argue for a Bayesian approach to integration of linguistic theories with data.
| Year | Citations | |
|---|---|---|
Page 1
Page 1