Publication | Closed Access
Efficient estimation of maximum entropy language models with n-gram features: an SRILM extension
45
Citations
13
References
2010
Year
Unknown Venue
EngineeringMachine LearningSrilm ExtensionSpoken Language ProcessingN -Gram FeaturesMultilingual PretrainingLarge Language ModelCorpus LinguisticsText MiningSpeech RecognitionNatural Language ProcessingData ScienceComputational LinguisticsGrammarLanguage StudiesN -Gram ModelsMachine TranslationN-gram FeaturesLanguage RecognitionSpeech ProcessingSrilm ToolkitEfficient EstimationSpeech InputLinguisticsPo Tagging
We present an extension to the SRILM toolkit for training maximum entropy language models with N -gram features. The extension uses a hierarchical parameter estimation procedure [1] for making the training time and memory consumption feasible for moderately large training data (hundreds of millions of words). Experiments on two speech recognition tasks indicate that the models trained with our implementation perform equally to or better than N -gram models built with interpolated Kneser-Ney discounting.
| Year | Citations | |
|---|---|---|
Page 1
Page 1