Publication | Open Access
Eliminating the redundancy in blocking-based entity resolution methods
47
Citations
27
References
2011
Year
Unknown Venue
EngineeringSemantic WebText MiningInformation RetrievalData ScienceData MiningManagementData IntegrationSemi-structured DataNamed-entity RecognitionData ManagementEntity ResolutionVery Large DatabaseEntity DisambiguationKnowledge DiscoveryComputer ScienceDatabase TheoryCitation MatchingRecord LinkageAutomated ReasoningMultiple BlocksFormal MethodsSimilarity Search
Entity resolution is the task of identifying entities that refer to the same real-world object. It has important applications in the context of digital libraries, such as citation matching and author disambiguation. Blocking is an established methodology for efficiently addressing this problem; it clusters similar entities together, and compares solely entities inside each cluster. In order to effectively deal with the current large, noisy and heterogeneous data collections, novel blocking methods that rely on redundancy have been introduced: they associate each entity with multiple blocks in order to increase recall, thus increasing the computational cost, as well.
| Year | Citations | |
|---|---|---|
Page 1
Page 1