Concepedia

Publication | Open Access

Detecting Cross-Lingual Semantic Divergence for Neural Machine Translation

56

Citations

37

References

2017

Year

Abstract

Parallel corpora are often not as parallel as one might assume: non-literal translations and noisy translations abound, even in curated corpora routinely used for training and evaluation. We use a cross-lingual textual entailment system to distinguish sentence pairs that are parallel in meaning from those that are not, and show that filtering out divergent examples from training improves translation quality.

References

YearCitations

Page 1