Publication | Open Access
Inverted indexing for cross-lingual NLP
83
Citations
21
References
2015
Year
Unknown Venue
EngineeringCross-lingual RepresentationInter-lingual Word RepresentationsCorpus LinguisticsInverted IndexingText MiningWord EmbeddingsNatural Language ProcessingApplied LinguisticsCross-lingual NlpInformation RetrievalComputational LinguisticsLanguage StudiesCount-based ApproachMachine TranslationCross-language RetrievalText IndexingLinguisticsPo Tagging
We present a novel, count-based approach to obtaining inter-lingual word representations based on inverted indexing of Wikipedia. We present experiments applying these representations to 17 datasets in document classification, POS tagging, dependency parsing, and word alignment. Our approach has the advantage that it is simple, computationally efficient and almost parameter-free, and, more importantly, it enables multi-source crosslingual learning. In 14/17 cases, we improve over using state-of-the-art bilingual embeddings.
| Year | Citations | |
|---|---|---|
Page 1
Page 1