Publication | Closed Access
Efficiently matching sets of features with random histograms
50
Citations
27
References
2008
Year
Unknown Venue
EngineeringMachine LearningRandom HistogramsImage RetrievalImage SearchImage AnalysisInformation RetrievalData ScienceData MiningPattern RecognitionSingle Feature VectorSearch EngineMachine VisionKnowledge DiscoveryComputer ScienceImage SimilarityDeep LearningComputer VisionRandomized AlgorithmSimilarity SearchContent-based Image Retrieval
As the commonly used representation of a feature-rich data object has evolved from a single feature vector to a set of feature vectors, a key challenge in building a content-based search engine for feature-rich data is to match feature-sets efficiently. Although substantial progress has been made during the past few years, existing approaches are still inefficient and inflexible for building a search engine for massive datasets. This paper presents a randomized algorithm to embed a set of features into a single high-dimensional vector to simplify the feature-set matching problem. The main idea is to project feature vectors into an auxiliary space using locality sensitive hashing and to represent a set of features as a histogram in the auxiliary space. A histogram is simply a high dimensional vector, and efficient similarity measures like L1 and L2 distances can be employed to approximate feature-set distance measures.
| Year | Citations | |
|---|---|---|
Page 1
Page 1