Publication | Closed Access
iDistance
590
Citations
33
References
2005
Year
EngineeringInformation RetrievalData ScienceData MiningPattern RecognitionKnn SearchIdistance PartitionsIndexing TechniqueKnowledge DiscoveryBig Data IndexingComputer ScienceBig Data SearchIdistance Knn SearchData ManagementSimilarity SearchBig DataData Indexing
In this article, we present an efficient B + -tree based indexing method, called iDistance, for K-nearest neighbor (KNN) search in a high-dimensional metric space. iDistance partitions the data based on a space- or data-partitioning strategy, and selects a reference point for each partition. The data points in each partition are transformed into a single dimensional value based on their similarity with respect to the reference point. This allows the points to be indexed using a B + -tree structure and KNN search to be performed using one-dimensional range search. The choice of partition and reference points adapts the index structure to the data distribution.We conducted extensive experiments to evaluate the iDistance technique, and report results demonstrating its effectiveness. We also present a cost model for iDistance KNN search, which can be exploited in query optimization.
| Year | Citations | |
|---|---|---|
Page 1
Page 1