Publication | Open Access
Marian: Cost-effective High-Quality Neural Machine Translation in C++
54
Citations
9
References
2018
Year
Unknown Venue
Natural Language ProcessingArtificial IntelligenceComputer-assisted TranslationPareto FrontierEngineeringMachine LearningLarge Ai ModelWnmt 2018Computer EngineeringMulti-task LearningTransformer ModelComputer ScienceVideo TransformerSpeech TranslationModel CompressionMachine TranslationNeural Machine Translation
This paper describes the submissions of the "Marian" team to the WNMT 2018 shared task. We investigate combinations of teacher-student training, low-precision matrix products, auto-tuning and other methods to optimize the Transformer model on GPU and CPU. By further integrating these methods with the new averaging attention networks, a recently introduced faster Transformer variant, we create a number of high-quality, high-performance models on the GPU and CPU, dominating the Pareto frontier for this shared task.
| Year | Citations | |
|---|---|---|
Page 1
Page 1