Publication | Open Access
Unsupervised sense disambiguation using bilingual probabilistic models
29
Citations
21
References
2004
Year
Unknown Venue
EngineeringCross-lingual RepresentationParallel LanguageSemanticsCorpus LinguisticsText MiningWord EmbeddingsApplied LinguisticsNatural Language ProcessingInformation RetrievalData ScienceParallel CorporaComputational LinguisticsLanguage StudiesMachine TranslationBilingual Probabilistic ModelsEntity DisambiguationKnowledge DiscoveryDistributional SemanticsSense InventoryLinguisticsWord-sense Disambiguation
We describe two probabilistic models for unsupervised word-sense disambiguation using parallel corpora. The first model, which we call the Sense model, builds on the work of Diab and Resnik (2002) that uses both parallel text and a sense inventory for the target language, and recasts their approach in a probabilistic framework. The second model, which we call the Concept model, is a hierarchical model that uses a concept latent variable to relate different language specific sense labels. We show that both models improve performance on the word sense disambiguation task over previous unsupervised approaches, with the Concept model showing the largest improvement. Furthermore, in learning the Concept model, as a by-product, we learn a sense inventory for the parallel language.
| Year | Citations | |
|---|---|---|
Page 1
Page 1