Publication | Open Access
Cross language text categorization by acquiring multilingual domain models from comparable corpora
48
Citations
9
References
2005
Year
Unknown Venue
EngineeringMultilingualismLabeled ExamplesCross-language PerspectiveCorpus LinguisticsText MiningMultilingual Domain ModelsApplied LinguisticsNatural Language ProcessingLanguage DocumentationInformation RetrievalComputational LinguisticsDocument ClassificationLanguage StudiesMachine TranslationAutomatic ClassificationMultilingual ScenarioCross-language RetrievalSource LanguageComparable CorporaLinguistics
In a multilingual scenario, the classical monolingual text categorization problem can be reformulated as a cross language TC task, in which we have to cope with two or more languages (e.g. English and Italian). In this setting, the system is trained using labeled examples in a source language (e.g. English), and it classifies documents in a different target language (e.g. Italian).
| Year | Citations | |
|---|---|---|
Page 1
Page 1