Publication | Closed Access
Analysis and modeling of correlated failures in multicomputer systems
77
Citations
16
References
1992
Year
Cluster ComputingAvailabilityEngineeringComputer ArchitectureSystem ReliabilityDependable System ArchitectureNetwork SurvivabilityC-dependent ModelReliability EngineeringSystems EngineeringCorrelated FailuresFailure DetectionDependability AnalysisReliabilityNetworked Computer SystemsDistributed SystemsComputer ScienceDependability ModellingP-dependent ModelFault ManagementSoftware TestingFault Injection
Based on the measurements from two DEC VAX-cluster multicomputer systems, the issue of correlated failures is addressed. In particular, the characteristics of correlated failures, their impact and their modelling on dependability, are discussed. It is found from the data that most correlated failures are related to errors in shared resources and propagate from one machine to another. Comparisons between measurement-based models and analytical models that assume failure independence show that the impact of correlated failures on dependability is significant. Two validated models. the c-dependent model and the p-dependent model, are developed to evaluate the dependability of systems with correlated failures.< <ETX xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">></ETX>
| Year | Citations | |
|---|---|---|
Page 1
Page 1