Publication | Open Access
JHU1
16
Citations
5
References
2007
Year
Unknown Venue
EngineeringCorpus LinguisticsText MiningNatural Language ProcessingInformation RetrievalData ScienceData MiningComputational LinguisticsPerson Name DisambiguationPolysemous NamesLanguage StudiesNamed-entity RecognitionRich-feature-enhanced Document VectorsEntity DisambiguationKnowledge DiscoveryTerminology ExtractionVector Space ModelLinguisticsWord-sense Disambiguation
This paper presents an approach to person name disambiguation using K-means clustering on rich-feature-enhanced document vectors, augmented with additional web-extracted snippets surrounding the polysemous names to facilitate term bridging. This yields a significant F-measure improvement on the shared task training data set. The paper also illustrates the significant divergence between the properties of the training and test data in this shared task, substantially skewing results. Our system optimized on F0.2 rather than F0.5 would have achieved top performance in the shared task.
| Year | Citations | |
|---|---|---|
Page 1
Page 1