Publication | Closed Access
Cross-lingual knowledge linking across wiki knowledge bases
88
Citations
30
References
2012
Year
Unknown Venue
EngineeringSemantic WebLink PredictionCorpus LinguisticsSemantic WikiText MiningCross-lingual KnowledgeKnowledge Graph EmbeddingsInformation RetrievalData ScienceComputational LinguisticsLanguage StudiesMachine TranslationEntity DisambiguationEnglish WikipediaKnowledge DiscoveryCross-language RetrievalWiki Knowledge BasesChinese ArticlesKnowledge BaseSemantic GraphLinguistics
Wikipedia becomes one of the largest knowledge bases on the Web. It has attracted 513 million page views per day in January 2012. However, one critical issue for Wikipedia is that articles in different language are very unbalanced. For example, the number of articles on Wikipedia in English has reached 3.8 million, while the number of Chinese articles is still less than half million and there are only 217 thousand cross-lingual links between articles of the two languages. On the other hand, there are more than 3.9 million Chinese Wiki articles on Baidu Baike and Hudong.com, two popular encyclopedias in Chinese. One important question is how to link the knowledge entries distributed in different knowledge bases. This will immensely enrich the information in the online knowledge bases and benefit many applications. In this paper, we study the problem of cross-lingual knowledge linking and present a linkage factor graph model. Features are defined according to some interesting observations. Experiments on the Wikipedia data set show that our approach can achieve a high precision of 85.8% with a recall of 88.1%. The approach found 202,141 new cross-lingual links between English Wikipedia and Baidu Baike.
| Year | Citations | |
|---|---|---|
Page 1
Page 1