Publication | Open Access
The Alignment Template Approach to Statistical Machine Translation
930
Citations
40
References
2004
Year
EngineeringCorpus LinguisticsLanguage ProcessingText MiningNatural Language ProcessingAlignment Template ApproachSyntaxComputational LinguisticsTranslation ApproachCorpus AnalysisLanguage StudiesAlignment Template SystemMachine TranslationComputer-assisted TranslationLinguisticsLanguage Modeling (Natural Language Processing)Neural Machine TranslationMachine Translation EvaluationLanguage Modeling (Theoretical Linguistics)Speech Translation
The paper proposes the alignment template approach, a phrase‑based statistical machine translation method, and evaluates its components on the German‑English Verbmobil task. The method models many‑to‑many word relations using a log‑linear framework that incorporates context and word‑order changes, learns phrasal translations with feature functions, and searches for best alignments, evaluated on three tasks. The alignment template system is easier to extend and achieves significantly better results than single‑word models on the French‑English Hansards task and outperforms all competing systems on the Chinese‑English NIST 2002 evaluation.
A phrase-based statistical machine translation approach — the alignment template approach — is described. This translation approach allows for general many-to-many relations between words. Thereby, the context of words is taken into account in the translation model, and local changes in word order from source to target language can be learned explicitly. The model is described using a log-linear modeling approach, which is a generalization of the often used source-channel approach. Thereby, the model is easier to extend than classical statistical machine translation systems. We describe in detail the process for learning phrasal translations, the feature functions used, and the search algorithm. The evaluation of this approach is performed on three different tasks. For the German-English speech Verbmobil task, we analyze the effect of various system components. On the French-English Canadian Hansards task, the alignment template system obtains significantly better results than a single-word-based translation model. In the Chinese-English 2002 National Institute of Standards and Technology (NIST) machine translation evaluation it yields statistically significantly better NIST scores than all competing research and commercial translation systems.
| Year | Citations | |
|---|---|---|
Page 1
Page 1