Publication | Closed Access
Robust topic inference for latent semantic language model adaptation
13
Citations
17
References
2007
Year
Unknown Venue
Llm Fine-tuningEngineeringMachine LearningMultilingual PretrainingLarge Language ModelCorpus LinguisticsText MiningWord EmbeddingsNatural Language ProcessingSpeech RecognitionLanguage Model AdaptationData ScienceLanguage AdaptationComputational LinguisticsLanguage StudiesAsr ConfidenceMachine TranslationRobust Topic InferenceTopic ModelSpeech ProcessingTopic MixtureLinguistics
We perform topic-based, unsupervised language model adaptation under an N-best rescoring framework by using previous-pass system hypotheses to infer a topic mixture which is used to select topic-dependent LMs for interpolation with a topici-ndependent LM. Our primary focus is on techniques for improving the robustness of topic inference for a given utterance with respect to recognition errors, including the use of ASR confidence and contextual information from surrounding utterances. We describe a novel application of metadata-based pseudo-story segmentation to language model adaptation, and present good improvements to character error rate on multigenre GALE Project data in Mandarin Chinese.
| Year | Citations | |
|---|---|---|
Page 1
Page 1