A Scalable Index for Top-k Subtree Similarity Queries

Concepedia

Publication | Open Access

DOI Full Paper Access

Citations

References

2019

Year

Daniel Kocher, Nikolaus Augsten

Unknown Venue

Query Tree QClassical SolutionEngineeringBig Data IndexingText MiningInformation RetrievalData ScienceData MiningScalable IndexKnowledge DiscoveryText IndexingComputer ScienceTree Edit DistanceData IndexingGraph TheoryBusinessSearch Engine IndexingIndexing TechniqueSimilarity Search

Abstract

Given a query tree Q, the top-k subtree similarity query retrieves the k subtrees in a large document tree T that are closest to Q in terms of tree edit distance. The classical solution scans the entire document, which is slow. The state-of-the-art approach precomputes an index to reduce the query time. However, the index is large (quadratic in the document size), building the index is expensive, updates are not supported, and data-specific tuning is required.

References

Page 1

	Year	Citations

Page 1