Publication | Closed Access
Chimera
147
Citations
24
References
2015
Year
Unknown Venue
Gpu ArchitectureHeterogeneous ComputingEngineeringLong Preemption LatencyCompute KernelEdge ComputingContext SwitchingComputer ArchitectureComputer EngineeringSystems EngineeringParallel ProgrammingComputer ScienceParallel ComputingManycore ProcessorSystem SoftwareGpu ComputingStrict Latency Requirements
The demand for multitasking on graphics processing units (GPUs) is constantly increasing as they have become one of the default components on modern computer systems along with traditional processors (CPUs). Preemptive multitasking on CPUs has been primarily supported through context switching. However, the same preemption strategy incurs substantial overhead due to the large context in GPUs. The overhead comes in two dimensions: a preempting kernel suffers from a long preemption latency, and the system throughput is wasted during the switch. Without precise control over the large preemption overhead, multitasking on GPUs has little use for applications with strict latency requirements.
| Year | Citations | |
|---|---|---|
Page 1
Page 1