An empirical comparison of supervised learning algorithms

TLDR

Supervised learning methods have proliferated over the last decade, yet the most recent comprehensive empirical evaluation was the Statlog Project of the early 1990s. The study aims to conduct a large-scale empirical comparison of ten supervised learning methods, including SVMs, neural nets, logistic regression, naive Bayes, memory-based learning, random forests, decision trees, bagged trees, boosted trees, and boosted stumps. The authors evaluate these methods across a variety of performance criteria, assess the effect of calibrating models with Platt Scaling and Isotonic Regression, and examine how calibration influences performance.

Abstract

A number of supervised learning methods have been introduced in the last decade. Unfortunately, the last comprehensive empirical evaluation of supervised learning was the Statlog Project in the early 90's. We present a large-scale empirical comparison between ten supervised learning methods: SVMs, neural nets, logistic regression, naive bayes, memory-based learning, random forests, decision trees, bagged trees, boosted trees, and boosted stumps. We also examine the effect that calibrating the models via Platt Scaling and Isotonic Regression has on their performance. An important aspect of our study is the use of a variety of performance criteria to evaluate the learning methods.

References

Page 1

	Year	Citations
Random Forests Leo Breiman Machine Learning	2001	119.3K
Statistical Learning Theory Yuhai Wu, Vladimir Vapnik Technometrics Generalization TheoryEngineeringMachine LearningData ScienceComputational Learning Theory	1999	26.9K
Bagging Predictors Leo Breiman Machine Learning	1996	16.6K
UCI Repository of machine learning databases Catherine Blake Medical Entomology and Zoology Data ClassificationEngineeringMachine LearningData ScienceData Mining	1998	10.5K
Probabilistic Outputs for Support vector Machines and Comparisons to Regularized Likelihood Methods John Platt	1999	4.9K
Making Large-Scale SVM Learning Practical Thorsten Joachims Technical reports Artificial IntelligenceMathematical ProgrammingSupport Vector MachineImage AnalysisMachine Learning	2006	4.3K
An Empirical Comparison of Voting Classification Algorithms: Bagging, Boosting, and Variants Eric Bauer, Ron Kohavi Machine Learning	1999	2.6K
Order Restricted Statistical Inference. Thomas J. Santner, Tim Robertson, F. T. Wright, Journal of the American Statistical Association Estimation StatisticStatistical FoundationNormal Means CaseRegression AnalysisStatistical Inference	1990	1.6K
Predicting good probabilities with supervised learning Alexandru Niculescu-Mizil, Rich Caruana Artificial IntelligenceEngineeringMachine LearningData ScienceData Mining	2005	1.5K
A Comparison of Prediction Accuracy, Complexity, and Training Time of Thirty-Three Old and New Classification Algorithms Tjen-Sien Lim, Wei‐Yin Loh, Yu‐Shan Shih Machine Learning	2000	1.1K

Page 1