Concepedia

Publication | Closed Access

Improving machine translation of null subjects in Italian and Spanish

10

Citations

17

References

2012

Year

Abstract

Null subjects are non overtly expressed subject pronouns found in pro-drop languages such as Italian and Spanish. In this study we quantify and compare the occurrence of this phenomenon in these two languages. Next, we evaluate null subjects’ translation into French, a “non prodrop” language. We use the Europarl corpus to evaluate two MT systems on their performance regarding null subject translation: Its-2, a rule-based system developed at LATL, and a statistical system built using the Moses toolkit. Then we add a rule-based preprocessor and a statistical post-editor to the Its-2 translation pipeline. A second evaluation of the improved Its-2 system shows an average increase of 15.46 % in correct pro-drop translations for Italian-French and 12.80 % for Spanish-French. 1

References

YearCitations

Page 1