Publication | Closed Access
Skew-Aware Join Optimization for Array Databases
31
Citations
39
References
2015
Year
Unknown Venue
Cluster ComputingEngineeringQuery ProcessingData ScienceData MiningManagementData IntegrationBig DataSpatial QueriesData ManagementParallel DatabaseVery Large DatabaseSkew-aware Join OptimizationKnowledge DiscoveryComputer EngineeringComputer ScienceDistributed Query ProcessingScience ApplicationsMultidimensional DatabaseQuery OptimizationParallel ProgrammingMassive Data ProcessingData Modeling
Science applications are accumulating an ever-increasing amount of multidimensional data. Although some of it can be processed in a relational database, much of it is better suited to array-based engines. As such, it is important to optimize the query processing of these systems. This paper focuses on efficient query processing of join operations within an array database. These engines invariably ``chunk'' their data into multidimensional tiles that they use to efficiently process spatial queries. As such, traditional relational algorithms need to be substantially modified to take advantage of array tiles. Moreover, most n-dimensional science data is unevenly distributed in array space because its underlying observations rarely follow a uniform pattern. It is crucial that the optimization of array joins be skew-aware. In addition, owing to the scale of science applications, their query processing usually spans multiple nodes. This further complicates the planning of array joins.
| Year | Citations | |
|---|---|---|
Page 1
Page 1