Publication | Closed Access
System monitoring with metric-correlation models
44
Citations
12
References
2009
Year
Unknown Venue
Software MaintenanceEngineeringMeasurementService MonitoringN MetricsSoftware EngineeringSystem MetricManagement MetricsSoftware AnalysisCorrelated MetricsMetric-correlation ModelsReliability EngineeringEmpirical Software Engineering ResearchData ScienceSystems EngineeringStatisticsQuantitative ManagementSoftware MeasurementSignal ProcessingSoftware DesignRegression TestingProgram AnalysisSoftware TestingSoftware MetricProcess ControlBusinessMonitoringSystem MonitoringSystem SoftwareData Modeling
Correlations among management metrics in software systems allow errors to be detected and their cause localized. Prior research shows that linear models can capture many of these correlations. However, our research shows that several factors may prevent linear models from accurately describing correlations, even if the underlying relationship is linear. Two common phenomena we have observed are relationships that evolve, typically with time, and heterogeneous variance of the correlated metrics. Two-variable linear models proposed thus far fail to capture these phenomena, and thus fail to describe system dynamics correctly. Often, these phenomena are caused by a missing variable. However, searching for three-variable correlations is O(n3) for n metrics, which is costly for systems with many metrics. In this paper we address the above challenges by improving on two-variable Ordinary Least Squares regression models. We validate our models using a realistic Java-Enterprise-Edition application. Using fault-injection experiments we show that our improved models capture system behavior accurately. We detect errors within 8 sample periods on average from the injection of the fault, which is less than half the time required by the current linear-model approach.
| Year | Citations | |
|---|---|---|
Page 1
Page 1