Publication | Open Access
Dynamic Resource Management for Efficient Utilization of Multitasking GPUs
68
Citations
25
References
2017
Year
Unknown Venue
Cluster ComputingMultiple KernelsGpu ArchitectureEngineeringCompute KernelData ScienceGpu BenchmarkingEdge ComputingCloud ComputingDynamic Resource ManagementComputer EngineeringComputer ArchitectureParallel ProgrammingComputer ScienceGpu MaestroParallel ComputingGpu ClusterGpu Computing
As graphics processing units (GPUs) are broadly adopted, running multiple applications on a GPU at the same time is beginning to attract wide attention. Recent proposals on multitasking GPUs have focused on either spatial multitasking, which partitions GPU resource at a streaming multiprocessor (SM) granularity, or simultaneous multikernel (SMK), which runs multiple kernels on the same SM. However, multitasking performance varies heavily depending on the resource partitions within each scheme, and the application mixes. In this paper, we propose GPU Maestro that performs dynamic resource management for efficient utilization of multitasking GPUs. GPU Maestro can discover the best performing GPU resource partition exploiting both spatial multitasking and SMK. Furthermore, dynamism within a kernel and interference between the kernels are automatically considered because GPU Maestro finds the best performing partition through direct measurements. Evaluations show that GPU Maestro can improve average system throughput by 20.2% and 13.9% over the baseline spatial multitasking and SMK, respectively.
| Year | Citations | |
|---|---|---|
Page 1
Page 1