Publication | Closed Access
Alert Detection in System Logs
88
Citations
15
References
2008
Year
Unknown Venue
Anomaly DetectionEngineeringWarning SystemDiagnosisSoftware EngineeringPresent NodeinfoSoftware AnalysisData ScienceData MiningSystems EngineeringLog ManagementIntrusion Detection SystemKnowledge DiscoveryComputer ScienceAlert DetectionLog AnalysisProgram AnalysisSoftware TestingIntrusion DetectionEvent-driven MonitoringBig Data
We present Nodeinfo, an unsupervised algorithm for anomaly detection in system logs. We demonstrate Nodeinfo's effectiveness on data from four of the world's most powerful supercomputers: using logs representing over 746 million processor-hours, in which anomalous events called alerts were manually tagged for scoring, we aim to automatically identify the regions of the log containing those alerts. We formalize the alert detection task in these terms, describe how Nodeinfo uses the information entropy of message terms to identify alerts, and present an online version of this algorithm, which is now in production use. This is the first work to investigate alert detection on (several) publicly-available supercomputer system logs, thereby providing a reproducible performance baseline.
| Year | Citations | |
|---|---|---|
Page 1
Page 1