Publication | Closed Access
Extended Characteristic Sets: Graph Indexing for SPARQL Query Optimization
37
Citations
15
References
2017
Year
Unknown Venue
EngineeringSemantic WebInformation RetrievalData ScienceData MiningGraph Query LanguageDatabase SystemManagementData IntegrationSemi-structured DataBig DataData ManagementRdf StorageVery Large DatabaseKnowledge DiscoverySchema AbstractionComputer ScienceGraph IndexingSparql Query ExecutionQuery OptimizationData IndexingGraph TheoryData Modeling
SPARQL query execution in state of the art RDF engines depends on, and is often limited by the underlying storage and indexing schemes. Typically, these systems exhaustively store permutations of the standard three-column triples table. However, even though RDF can give birth to datasets with loosely defined schemas, it is common for an emerging structure to appear in the data. In this paper, we introduce a novel indexing scheme for RDF data, that takes advantage of the inherent structure of triples. To this end, we define the Extended Characteristic Set (ECS), a schema abstraction that classifies triples based on the properties of their subjects and objects, and we discuss methods and algorithms for the identification and extraction of ECSs. We show how these can be used to assist query processing, and we implement axonDB, an RDF storage and querying engine based on ECS indexing. We perform an experimental evaluation on real world and synthetic datasets and observe that axonDB outperforms the competition by a few orders of magnitude.
| Year | Citations | |
|---|---|---|
Page 1
Page 1