Publication | Closed Access
Pairwise similarity of TopSig document signatures
11
Citations
11
References
2012
Year
Unknown Venue
EngineeringSimilarity MeasureBiometricsInformation ForensicsText MiningInformation RetrievalData ScienceData MiningPattern RecognitionRelevance FeedbackTopsig Retrieval ModelData RetrievalVector Space ModelsStatisticsKnowledge DiscoveryComputer ScienceVector Space ModelPairwise SimilaritySimilarity SearchDocument ProcessingPairwise Distances
This paper analyses the pairwise distances of signatures produced by the TopSig retrieval model on two document collections. The distribution of the distances are compared to purely random signatures. It explains why TopSig is only competitive with state of the art retrieval models at early precision. Only the local neighbourhood of the signatures is interpretable. We suggest this is a common property of vector space models.
| Year | Citations | |
|---|---|---|
Page 1
Page 1