Publication | Closed Access
A candidate filtering mechanism for fast top-k query processing on modern cpus
24
Citations
28
References
2013
Year
Unknown Venue
Cluster ComputingEngineeringComputer ArchitectureCandidate Filtering MechanismBlock-max IndexesText MiningSuch Top-k QueriesInformation RetrievalData ScienceData MiningParallel ComputingData ManagementModern CpusHigh-performance Data AnalyticsKnowledge DiscoveryComputer ScienceBig Data SearchBlock MaximaDistributed Query ProcessingQuery OptimizationData IndexingParallel ProgrammingSearch Engine IndexingApproximate Query AnsweringBig Data
A large amount of research has focused on faster methods for finding top-k results in large document collections, one of the main scalability challenges for web search engines. In this paper, we propose a method for accelerating such top-k queries that builds on and generalizes methods recently proposed by several groups of researchers based on Block-Max Indexes. In particular, we describe a system that uses a new filtering mechanism, based on a combination of block maxima and bitmaps, that radically reduces the number of documents that have to be further evaluated. Our filtering mechanism exploits the SIMD processing capabilities of current microprocessors, and it is optimized through caching policies that select and store suitable filter structures based on properties of the query load. Our experimental evaluation shows that the mechanism results in very significant speed-ups for disjunctive top-k queries under several state-of-the-art algorithms, including a speed-up of more than a factor of 2 over the fastest previously known methods.
| Year | Citations | |
|---|---|---|
Page 1
Page 1