Publication | Closed Access
GPUdrive: Reconsidering Storage Accesses for GPU Acceleration
10
Citations
9
References
2014
Year
Unknown Venue
EngineeringGpu-accelerated Data-intensive ApplicationsTraditional Storage LatencyGpu BenchmarkingComputer ArchitecturePower OptimizationEmbedded SystemsHardware SystemsGpu ComputingStorage SystemsHigh-performance ArchitectureComputing SystemsParallel ComputingCompilersGpu AccelerationComputer EngineeringStorage Access BandwidthComputer ScienceGpu ArchitectureParallel Programming
GPU-accelerated data-intensive applications demonstrate in excess of ten-fold speedups over CPU-only approaches. However, file-driven data movement between the CPU and the GPU can degrade performance and energy efficiencies by an order of magnitude as a result of traditional storage latency and ineffectual memory management. In this paper, we first analyze these two critical performance bottlenecks in GPU-accelerated data processing. We then study design considerations to reduce the overheads imposed by file-driven data movements in GPU computing. To address these issues, we prototype a low cost and low power all-flash array designed specifically for stream-based, I/O-rich workloads inherent in GPUs. As preliminary evaluation results, we demonstrate that our early-stage all-flash array solution can eliminate 60% ∼ 90% performance discrepancy between memory-level GPU data transfer rates and storage access bandwidth by removing unnecessary data copies, memory management, and user/kernel-mode switching in the current system software stack. In addition, our allflash array prototype consumes less dynamic power than the baseline storage array by 49%, on average.
| Year | Citations | |
|---|---|---|
Page 1
Page 1