Publication | Closed Access
Simultaneous Multikernel GPU: Multi-tasking throughput processors via fine-grained sharing
156
Citations
26
References
2016
Year
Unknown Venue
Cluster ComputingHeterogeneous ComputingEngineeringComputer ArchitectureGpu ComputingHardware SecurityCompute KernelParallel ComputingGpu ThroughputComputer EngineeringComputer ScienceGpu ClusterSimultaneous Multikernel GpuGpu ArchitectureEdge ComputingCloud ComputingParallel ProgrammingGpu HardwareGpu Resource UtilizationSystem Software
Studies show that non-graphics programs can be less optimized for the GPU hardware, leading to significant resource under-utilization. Sharing the GPU among multiple programs can effectively improve utilization, which is particularly attractive to systems where many applications require access to the GPU (e.g., cloud computing). However, current GPUs lack proper architecture features to support sharing. Initial attempts are preliminary: They either provide only static sharing, which requires recompilation or code transformation, or they do not effectively improve GPU resource utilization. We propose Simultaneous Multikernel (SMK), a fine-grain dynamic sharing mechanism, that fully utilizes resources within a streaming multiprocessor by exploiting heterogeneity of different kernels. We propose several resource allocation strategies to improve system throughput while maintaining fairness. Our evaluation shows that for shared workloads with complementary resource occupancy, SMK improves GPU throughput by 52% over non-shared execution and 17% over a state-of-the-art design.
| Year | Citations | |
|---|---|---|
Page 1
Page 1