Publication | Open Access
A Scalable Index for Top-k Subtree Similarity Queries
11
Citations
24
References
2019
Year
Unknown Venue
Query Tree QClassical SolutionEngineeringBig Data IndexingText MiningInformation RetrievalData ScienceData MiningScalable IndexKnowledge DiscoveryText IndexingComputer ScienceTree Edit DistanceData IndexingGraph TheoryBusinessSearch Engine IndexingIndexing TechniqueSimilarity Search
Given a query tree Q, the top-k subtree similarity query retrieves the k subtrees in a large document tree T that are closest to Q in terms of tree edit distance. The classical solution scans the entire document, which is slow. The state-of-the-art approach precomputes an index to reduce the query time. However, the index is large (quadratic in the document size), building the index is expensive, updates are not supported, and data-specific tuning is required.
| Year | Citations | |
|---|---|---|
Page 1
Page 1