Publication | Open Access
Automatic Transliteration of Romanized Dialectal Arabic
71
Citations
20
References
2014
Year
Unknown Venue
Arabic Dialect LinguisticsEngineeringArabic Morphological AnalysisArabic OrthographyCorpus LinguisticsSpeech RecognitionNatural Language ProcessingLatin ScriptArabicComputational LinguisticsHistorical LinguisticsArabic Dialect OrthographyLanguage StudiesCharacter RecognitionMachine TranslationComputer-assisted TranslationMorphologyAutomatic TransliterationArabic Dialect Morphological AnalysisCoda ConventionNeural Machine TranslationSpeech TranslationLanguage RecognitionLinguistics
In this paper, we address the problem of converting Dialectal Arabic (DA) text that is written in the Latin script (called Arabizi) into Arabic script following the CODA convention for DA orthography. The presented system uses a finite state transducer trained at the character level to generate all possible transliterations for the input Arabizi words. We then filter the generated list using a DA morphological analyzer. After that we pick the best choice for each input word using a language model. We achieve an accuracy of 69.4% on an unseen test set compared to 63.1% using a system which represents a previously proposed approach.
| Year | Citations | |
|---|---|---|
Page 1
Page 1