Publication | Open Access
Efficient coflow scheduling with Varys
430
Citations
36
References
2014
Year
Unknown Venue
Parallel FlowsCluster ComputingEngineeringComputer ArchitectureCoflow AbstractionOperations ResearchData-intensive PlatformSystems EngineeringParallel ComputingCombinatorial OptimizationJob SchedulerNetwork SchedulingCloud SchedulingComputer EngineeringScheduling (Computing)Computer ScienceEfficient CoflowScheduling ProblemEdge ComputingCloud ComputingParallel ProgrammingData-level Parallelism
Communication in data-parallel applications often involves a collection of parallel flows. Traditional techniques to optimize flow-level metrics do not perform well in optimizing such collections, because the network is largely agnostic to application-level requirements. The recently proposed coflow abstraction bridges this gap and creates new opportunities for network scheduling. In this paper, we address inter-coflow scheduling for two different objectives: decreasing communication time of data-intensive jobs and guaranteeing predictable communication time. We introduce the concurrent open shop scheduling with coupled resources problem, analyze its complexity, and propose effective heuristics to optimize either objective. We present Varys, a system that enables data-intensive frameworks to use coflows and the proposed algorithms while maintaining high network utilization and guaranteeing starvation freedom. EC2 deployments and trace-driven simulations show that communication stages complete up to 3.16X faster on average and up to 2X more coflows meet their deadlines using Varys in comparison to per-flow mechanisms. Moreover, Varys outperforms non-preemptive coflow schedulers by more than 5X.
| Year | Citations | |
|---|---|---|
Page 1
Page 1