Publication | Closed Access
Machine Learning Predictions of Runtime and IO Traffic on High-End Clusters
47
Citations
4
References
2016
Year
Unknown Venue
Cluster ComputingInternet Traffic AnalysisEngineeringMachine LearningMachine Learning ToolIo TrafficData ScienceData MiningTraffic PredictionDecision Tree LearningInternet Of ThingsMachine Learning PredictionsPerformance PredictionHigh-performance Data AnalyticsJob SchedulerPredictive AnalyticsKnowledge DiscoveryComputer ScienceTraffic MonitoringHigh-end ClustersCloud ComputingNetwork Traffic MeasurementDecision TreesRandom ForestBig Data
We use supervised machine learning algorithms (i.e., Decision Trees, Random Forest, and K-nearest Neighbors) to predict performance characteristics such as runtime and IO traffic of batch jobs on high-end clusters, using only user job scripts as input. We show that decision trees outperform other algorithms and accurately predict the runtime of 73% of jobs within a error tolerance of 10 minutes, which is a 51% improvement over the user requested runtime.
| Year | Citations | |
|---|---|---|
Page 1
Page 1