Publication | Closed Access
Panacea
32
Citations
25
References
2012
Year
Unknown Venue
Cluster ComputingEngineeringComputer ArchitectureMap-reduceData ScienceParallel ComputingLegacy Mapreduce ApplicationsData ManagementHigh-performance Data AnalyticsParallelizing CompilerComputer EngineeringComputer ScienceScalable ComputingHolistic Compiler OptimizationsXeon ServersCloud ComputingParallel ProgrammingMassive Data ProcessingBig Data
MapReduce has emerged as one of the most popular programming models for data parallel enterprise applications. Despite advances in runtime, the opportunities for optimizing MapReduce applications remain largely unexplored. In this paper, we present a framework for performing holistic compiler optimizations on legacy MapReduce applications. We have identified and implemented two optimizations and evaluated them with a set of Hadoop applications on a cluster of Xeon servers. Our experiments show that performance gains of more than 3X can be achieved without user involvement.
| Year | Citations | |
|---|---|---|
Page 1
Page 1