Publication | Open Access
Metastable failures in distributed systems
25
Citations
9
References
2021
Year
Unknown Venue
ReliabilitySoftware MaintenanceReliability EngineeringEngineeringMetastable FailuresFault-tolerant NetworkSoftware Testing-A Failure PatternSystems EngineeringFault ToleranceSoftware EngineeringDistributed SystemsComputer ScienceHigh AvailabilityBlack Swan EventsFault-tolerant MessagingData ManagementFailure Detection
We describe metastable failures---a failure pattern in distributed systems. Currently, metastable failures manifest themselves as black swan events; they are outliers because nothing in the past points to their possibility, have a severe impact, and are much easier to explain in hindsight than to predict. Although instances of metastable failures can look different at the surface, deeper analysis shows that they can be understood within the same framework.
| Year | Citations | |
|---|---|---|
Page 1
Page 1