Publication | Closed Access
OWL
247
Citations
49
References
2013
Year
Unknown Venue
Hardware SecurityCluster ComputingComputational ScienceGpu ArchitectureEngineeringCompute KernelGpu BenchmarkingLong Memory LatenciesComputer EngineeringComputer ArchitectureCurrent WarpParallel ProgrammingComputer ScienceGpgpu ArchitecturesParallel ComputingGpu ClusterGpu Computing
Emerging GPGPU architectures, along with programming models like CUDA and OpenCL, offer a cost-effective platform for many applications by providing high thread level parallelism at lower energy budgets. Unfortunately, for many general-purpose applications, available hardware resources of a GPGPU are not efficiently utilized, leading to lost opportunity in improving performance. A major cause of this is the inefficiency of current warp scheduling policies in tolerating long memory latencies.
| Year | Citations | |
|---|---|---|
Page 1
Page 1