Concepedia

Publication | Closed Access

Domain Adaptation for Machine Translation by Mining Unseen Words

138

Citations

13

References

2011

Year

Abstract

We show that unseen words account for a large part of the translation error when moving to new domains. Using an extension of a recent approach to mining translations from comparable corpora (Haghighi et al., 2008), we are able to find translations for otherwise OOV terms. We show several approaches to integrating such translations into a phrasebased translation system, yielding consistent improvements in translations quality (between 0.5 and 1.5 Bleu points) on four domains and two language pairs. 1

References

YearCitations

Page 1