Publication | Closed Access
Log Message Anomaly Detection with Oversampling
18
Citations
16
References
2020
Year
Unknown Venue
Anomaly DetectionMachine LearningEngineeringFeature ExtractionInformation ForensicsText MiningData ScienceData MiningPattern RecognitionClass ImbalanceManagementLog ManagementImbalanced DataMachine Learning ModelPredictive AnalyticsOutlier DetectionKnowledge DiscoveryComputer ScienceSignal ProcessingLog AnalysisNovelty Detection
Imbalanced data is a significant challenge in classification with machine learning algorithms. This is particularly important with log message data as negative logs are sparse so this data is typically imbalanced. In this paper, a model to generate text log messages is proposed which employs a SeqGAN network. An Autoencoder is used for feature extraction and anomaly detection is done using a GRU network. The proposed model is evaluated with three imbalanced log data sets, namely BGL, OpenStack, and Thunderbird. Results are presented which show that appropriate oversampling and data balancing improves anomaly detection accuracy.
| Year | Citations | |
|---|---|---|
Page 1
Page 1