Publication | Closed Access
Locality-aware Partitioning in Parallel Database Systems
74
Citations
15
References
2015
Year
Unknown Venue
Cluster ComputingEngineeringComputer ArchitecturePartitioning SchemesMap-reduceDatabase SchemaData ScienceData IntegrationParallel ComputingData ManagementParallel Database SystemsParallel DatabaseComputer ScienceDistributed Query ProcessingLocality-aware PartitioningPartition (Database)Cloud ComputingParallel ProgrammingData-level ParallelismBig Data
Parallel database systems horizontally partition large amounts of structured data in order to provide parallel data processing capabilities for analytical workloads in shared-nothing clusters. One major challenge when horizontally partitioning large amounts of data is to reduce the network costs for a given workload and a database schema. A common technique to reduce the network costs in parallel database systems is to co-partition tables on their join key in order to avoid expensive remote join operations. However, existing partitioning schemes are limited in that respect since only subsets of tables in complex schemata sharing the same join key can be co-partitioned unless tables are fully replicated.
| Year | Citations | |
|---|---|---|
Page 1
Page 1