Publication | Closed Access
Tiling stencil computations to maximize parallelism
96
Citations
24
References
2012
Year
EngineeringCompiler TechnologyComputer ArchitectureHardware SystemsSuch HyperplanesConcurrent StartComputing SystemsStencil ComputationsParallel ComputingCompilersComputational GeometryInstruction-level ParallelismProgramming LanguagesMassively-parallel ComputingParallelizing CompilerCompiler SupportComputer EngineeringComputer ScienceProgram AnalysisParallel ProcessingParallel ProgrammingMost Stencil ComputationsData-level Parallelism
Most stencil computations allow tile-wise concurrent start, i.e., there always exists a face of the iteration space and a set of tiling hyperplanes such that all tiles along that face can be started concurrently. This provides load balance and maximizes parallelism. However, existing automatic tiling frameworks often choose hyperplanes that lead to pipelined start-up and load imbalance. We address this issue with a new tiling technique that ensures concurrent start-up as well as perfect load-balance whenever possible. We first provide necessary and sufficient conditions on tiling hyperplanes to enable concurrent start for programs with affine data accesses. We then provide an approach to find such hyperplanes. Experimental evaluation on a 12-core Intel Westmere shows that our code is able to outperform a tuned domain-specific stencil code generator by 4% to 27%, and previous compiler techniques by a factor of 2× to 10.14×.
| Year | Citations | |
|---|---|---|
Page 1
Page 1