Concepedia

Publication | Closed Access

Pairwise similarity of TopSig document signatures

11

Citations

11

References

2012

Year

Abstract

This paper analyses the pairwise distances of signatures produced by the TopSig retrieval model on two document collections. The distribution of the distances are compared to purely random signatures. It explains why TopSig is only competitive with state of the art retrieval models at early precision. Only the local neighbourhood of the signatures is interpretable. We suggest this is a common property of vector space models.

References

YearCitations

Page 1