Publication | Open Access
Waldo
45
Citations
32
References
2017
Year
Unknown Venue
Natural Language ProcessingArtificial IntelligenceEngineeringInformation RetrievalData ScienceCrowd CostEntity DisambiguationKnowledge DiscoveryCrowd Entity ResolutionData IntegrationComputer ScienceCrowdsourcingSemantic WebNamed-entity RecognitionData ManagementHuman ComputationEntity Resolution
In Entity Resolution, the objective is to find which records of a dataset refer to the same real-world entity. Crowd Entity Resolution uses humans, in addition to machine algorithms, to improve the quality of the outcome. We study a hybrid approach that combines two common interfaces for human tasks in Crowd Entity Resolution, taking into account key observations about the advantages and disadvantages of the two interfaces. We give a formal definition to the problem of human task selection and we derive algorithms with strong optimality guarantees. Our experiments with four real-world datasets show that our hybrid approach gives an improvement of 50% to 300% in the crowd cost to resolve a dataset, compared to using a single interface.
| Year | Citations | |
|---|---|---|
Page 1
Page 1