Publication | Closed Access
Learning to link with wikipedia
1.3K
Citations
16
References
2008
Year
Unknown Venue
Natural Language ProcessingEngineeringInformation RetrievalData ScienceLargest Knowledge BaseEntity DisambiguationComputational LinguisticsKnowledge DiscoveryDocument ClassificationWikipedia ArticlesTerminology ExtractionCross-language RetrievalSemantic WebNamed-entity RecognitionLink PredictionCross-reference DocumentsCorpus LinguisticsText Mining
This paper describes how to automatically cross-reference documents with Wikipedia: the largest knowledge base ever known. It explains how machine learning can be used to identify significant terms within unstructured text, and enrich it with links to the appropriate Wikipedia articles. The resulting link detector and disambiguator performs very well, with recall and precision of almost 75%. This performance is constant whether the system is evaluated on Wikipedia articles or "real world" documents.
| Year | Citations | |
|---|---|---|
Page 1
Page 1