Publication | Closed Access
DRAM errors in the wild
555
Citations
21
References
2009
Year
Unknown Venue
Hardware SecurityReliability EngineeringEngineeringHardware ReliabilityReal Dram FailuresMem TestingSoftware TestingCloud ComputingIn-memory DatabaseComputer ArchitectureComputer EngineeringMemory ErrorsComputer ScienceParallel ComputingDram ErrorsMemory ManagementMemory ArchitectureHardware Failure
Errors in dynamic random access memory (DRAM) are a common form of hardware failure in modern compute clusters. Failures are costly both in terms of hardware replacement costs and service disruption. While a large body of work exists on DRAM in laboratory conditions, little has been reported on real DRAM failures in large production clusters. In this paper, we analyze measurements of memory errors in a large fleet of commodity servers over a period of 2.5 years. The collected data covers multiple vendors, DRAM capacities and technologies, and comprises many millions of DIMM days.
| Year | Citations | |
|---|---|---|
Page 1
Page 1