Publication | Closed Access
Truth Discovery with Multiple Conflicting Information Providers on the Web
662
Citations
12
References
2008
Year
EngineeringKnowledge ExtractionVeracity ProblemSemantic WebInformation QualityMisinformationText MiningInformation RetrievalData ScienceData MiningData IntegrationDisinformation DetectionContent AnalysisPost-truthTrue InformationKnowledge DiscoveryComputer ScienceFact CheckingAutomated ReasoningWeb IntelligenceBelief MergingTruth Discovery
The Web is the primary information source for most people, yet its content is not guaranteed to be correct and often contains conflicting information across sites. This study introduces the Veracity problem, aiming to determine true facts from large volumes of contradictory data provided by many websites. The authors propose a general framework and the TRUTHFlNDER algorithm, which iteratively assesses website trustworthiness and information correctness by exploiting mutual reinforcement between sites and facts. Experiments demonstrate that TRUTHFlNDER accurately extracts true facts and identifies trustworthy sites more effectively than popular search engines.
The World Wide Web has become the most important information source for most of us. Unfortunately, there is no guarantee for the correctness of information on the Web. Moreover, different websites often provide conflicting information on a subject, such as different specifications for the same product. In this paper, we propose a new problem, called Veracity, i.e., conformity to truth, which studies how to find true facts from a large amount of conflicting information on many subjects that is provided by various websites. We design a general framework for the Veracity problem and invent an algorithm, called TRUTHFlNDER, which utilizes the relationships between websites and their information, i.e., a website is trustworthy if it provides many pieces of true information, and a piece of information is likely to be true if it is provided by many trustworthy websites. An iterative method is used to infer the trustworthiness of websites and the correctness of information from each other. Our experiments show that TRUTHFlNDER successfully finds true facts among conflicting information and identifies trustworthy websites better than the popular search engines.
| Year | Citations | |
|---|---|---|
Page 1
Page 1