Publication | Open Access
Automatic evaluation of machine translation quality using longest common subsequence and skip-bigram statistics
719
Citations
16
References
2004
Year
Unknown Venue
Natural Language ProcessingComputer-assisted TranslationLongest Common SubsequenceEngineeringMultimodal TranslationAutomatic EvaluationCorpus LinguisticsComputational LinguisticsMachine Translation QualityLanguage EngineeringCandidate TranslationLanguage StudiesSkip-bigram StatisticsLinguisticsText MiningMachine TranslationNeural Machine Translation
In this paper we describe two new objective automatic evaluation methods for machine translation. The first method is based on longest common subsequence between a candidate translation and a set of reference translations. Longest common subsequence takes into account sentence level structure similarity naturally and identifies longest co-occurring in-sequence n-grams automatically. The second method relaxes strict n-gram matching to skip-bigram matching. Skip-bigram is any pair of words in their sentence order. Skip-bigram cooccurrence statistics measure the overlap of skip-bigrams between a candidate translation and a set of reference translations. The empirical results show that both methods correlate with human judgments very well in both adequacy and fluency.
| Year | Citations | |
|---|---|---|
Page 1
Page 1