Publication | Open Access
Heuristic adaptability to input dynamics for SpMM on GPUs
24
Citations
7
References
2022
Year
Massively-parallel ComputingComputational ScienceGpu ArchitectureEngineeringHardware AccelerationArray ComputingSpmm AccelerationComputer EngineeringComputer ArchitectureParallel ProgrammingComputer ScienceModeling And SimulationOrthogonal Design PrinciplesParallel ComputingGpu ClusterHeuristic AdaptabilityGpu ComputingVectorization
Sparse Matrix-Matrix Multiplication (SpMM) has served as fundamental components in various domains. Many previous studies exploit GPUs for SpMM acceleration because GPUs provide high bandwidth and parallelism. We point out that a static design does not always improve the performance of SpMM on different input data (e.g., >85% performance loss with a single algorithm). In this paper, we consider the challenge of input dynamics from a novel auto-tuning perspective, while following issues remain to be solved: (1) Orthogonal design principles considering sparsity. Orthogonal design principles for such a sparse problem should be extracted to form different algorithms, and further used for performance tuning. (2) Nontrivial implementations in the algorithm space. Combining orthogonal design principles to create new algorithms needs to tackle with new challenges like thread race handling. (3) Heuristic adaptability to input dynamics. The heuristic adaptability is required to dynamically optimize code for input dynamics.
| Year | Citations | |
|---|---|---|
Page 1
Page 1