Publication | Closed Access
Corroborate and learn facts from the web
16
Citations
11
References
2007
Year
Unknown Venue
Robust Bootstrapping ApproachEngineeringKnowledge ExtractionIntelligent Information RetrievalSemantic WebCorpus LinguisticsText MiningNatural Language ProcessingInformation RetrievalData ScienceData MiningContent AnalysisLearn FactsKnowledge DiscoveryWebometricsBootstrapping ProcessWeb ScienceInformation ExtractionWeb MiningWeb IntelligenceCountry Facts
The web contains lots of interesting factual information about entities, such as celebrities, movies or products. This paper describes a robust bootstrapping approach to corroborate facts and learn more facts simultaneously. This approach starts with retrieving relevant pages from a crawl repository for each entity in the seed set. In each learning cycle, known facts of an entity are corroborated first in a relevant page to find fact mentions. When fact mentions are found, they are taken as examples for learning new facts from the page via HTML pattern discovery. Extracted new facts are added to the known fact set for the next learning cycle. The bootstrapping process continues until no new facts can be learned. This approach is language-independent. It demonstrated good performance in experiment on country facts. Results of a large scale experiment will also be shown with initial facts imported from wikipedia.
| Year | Citations | |
|---|---|---|
Page 1
Page 1