Publication | Closed Access
Parallel implementations of word alignment tool
369
Citations
8
References
2008
Year
Unknown Venue
EngineeringMultilingual PretrainingLarge Language ModelCorpus LinguisticsText MiningParallel ToolSpeech RecognitionNatural Language ProcessingParallel SoftwareData ScienceComputational LinguisticsLanguage StudiesParallel ComputingMachine TranslationComputer-assisted TranslationWord Alignment ToolLarge CorporaLinguisticsComputer ScienceNeural Machine TranslationParallel ProcessingWord Alignment ProcessParallel ProgrammingWord Alignment ModelsSpeech Translation
Training word alignment models on large corpora is a very time-consuming processes. This paper describes two parallel implementations of GIZA++ that accelerate this word alignment process. One of the implementations runs on computer clusters, the other runs on multi-processor system using multi-threading technology. Results show a near-linear speed-up according to the number of CPUs used, and alignment quality is preserved.
| Year | Citations | |
|---|---|---|
Page 1
Page 1