Publication | Open Access
Groute
117
Citations
25
References
2017
Year
Unknown Venue
Cluster ComputingIrregular AlgorithmsHeterogeneous ComputingIrregular ApplicationsEngineeringMassively-parallel ComputingGpu ArchitectureEdge ComputingCloud ComputingComputer ArchitectureComputer EngineeringMultiple GpusParallel ProgrammingComputer ScienceParallel ComputingGpu ClusterGpu Computing
Nodes with multiple GPUs are becoming the platform of choice for high-performance computing. However, most applications are written using bulk-synchronous programming models, which may not be optimal for irregular algorithms that benefit from low-latency, asynchronous communication. This paper proposes constructs for asynchronous multi-GPU programming, and describes their implementation in a thin runtime environment called Groute. Groute also implements common collective operations and distributed work-lists, enabling the development of irregular applications without substantial programming effort. We demonstrate that this approach achieves state-of-the-art performance and exhibits strong scaling for a suite of irregular applications on 8-GPU and heterogeneous systems, yielding over 7x speedup for some algorithms.
| Year | Citations | |
|---|---|---|
Page 1
Page 1