Publication | Closed Access
Comparing approaches to convert recurrent neural networks into backoff language models for efficient decoding
24
Citations
13
References
2014
Year
Unknown Venue
EngineeringMachine LearningCross-lingual RepresentationMultilingual PretrainingLarge Language ModelRecurrent Neural NetworkCorpus LinguisticsSpeech RecognitionNatural Language ProcessingData ScienceComputational LinguisticsRecurrent Neural NetworksLanguage StudiesLanguage ModelsMachine TranslationSequence ModellingIterative ConversionLinguisticsComputer ScienceDeep LearningNeural Machine TranslationBackoff Language ModelsEfficient DecodingLanguage ModelingSpeech Translation
In this paper, we investigate and compare three different possibilities to convert recurrent neural network language models (RNNLMs) into backoff language models (BNLM). While RNNLMs often outperform traditional n-gram approaches in the task of language modeling, their computational demands make them unsuitable for an efficient usage during decoding in an LVCSR system. It is, therefore, of interest to convert them into BNLMs in order to integrate their information into the decoding process. This paper compares three different approaches: a text based conversion, a probability based conversion and an iterative conversion. The resulting language models are evaluated in terms of perplexity and mixed error rate in the context of the Code-Switching data corpus SEAME. Although the best results are obtained by combining the results of all three approaches, the text based conversion approach alone leads to significant improvements on the SEAME corpus as well while offering the highest computational efficiency. In total, the perplexity can be reduced by 11.4% relative on the evaluation set and the mixed error rate by 3.0% relative on the same data set.
| Year | Citations | |
|---|---|---|
Page 1
Page 1