Publication | Closed Access
Balancing reducer skew in MapReduce workloads using progressive sampling
81
Citations
30
References
2012
Year
Unknown Venue
Cluster ComputingLoad Balancing (Computing)EngineeringComputer ArchitectureStatic LoadMap-reduceDistributed Data AnalyticsProgressive SamplingElapsed TimeData ScienceParallel ComputingParallel JobJob SchedulerComputer EngineeringComputer ScienceEdge ComputingParallel ProcessingCloud ComputingParallel ProgrammingData-level ParallelismBig Data
The elapsed time of a parallel job depends on the completion time of its longest running constituent. We present a static load balancing algorithm that distributes work evenly across the reducers in a MapReduce job resulting in significant elapsed time reductions.
| Year | Citations | |
|---|---|---|
Page 1
Page 1