Publication | Closed Access
GPGPUs: How to combine high computational power with high reliability
36
Citations
0
References
2014
Year
EngineeringGpu BenchmarkingGpgpu ReliabilityComputer ArchitectureHigh Computational PowerGpu ComputingHardware SecurityRadiation ExperimentsReliability EngineeringParallel ComputingPower ManagementReliabilityComputer EngineeringComputer ScienceGpu ClusterComputational ScienceGpu ArchitectureParallel ProgrammingHpc DatacentersGpu Virtualization
GPGPUs are used increasingly in several domains, from gaming to different kinds of computationally intensive applications. In many applications GPGPU reliability is becoming a serious issue, and several research activities are focusing on its evaluation. This paper offers an overview of some major results in the area. First, it shows and analyzes the results of some experiments assessing GPGPU reliability in HPC datacenters. Second, it provides some recent results derived from radiation experiments about the reliability of GPGPUs. Third, it describes the characteristics of an advanced fault-injection environment, allowing effective evaluation of the resiliency of applications running on GPGPUs.