Publication | Closed Access
Entity Resolution with crowd errors
73
Citations
19
References
2015
Year
Unknown Venue
EngineeringSemantic WebLanguage ProcessingText MiningQuery SuggestionNatural Language ProcessingInformation RetrievalData ScienceComputational LinguisticsManagementData IntegrationQuery ExpansionNamed-entity RecognitionData ManagementEntity ResolutionQuestion AnsweringEntity DisambiguationKnowledge DiscoveryComputer ScienceQuery OptimizationRelational QueriesSame EntityCrowd ComputingAutomated ReasoningApproximate Query AnsweringMaximum Likelihood Formulation
Given a set of records, an Entity Resolution (ER) algorithm finds records that refer to the same real-world entity. Humans can often determine if two records refer to the same entity, and hence we study the problem of selecting questions to ask error-prone humans. We give a Maximum Likelihood formulation for the problem of finding the “most beneficial” questions to ask next. Our theoretical results lead to a lightweight and practical algorithm, bDENSE, for selecting questions to ask humans. Our experimental results show that bDENSE can more quickly reach an accurate outcome, compared to two approaches proposed recently. Moreover, through our experimental evaluation, we identify the strengths and weaknesses of all three approaches.
| Year | Citations | |
|---|---|---|
Page 1
Page 1