Publication | Open Access
Learning Optimal Classification Trees Using a Binary Linear Program Formulation
130
Citations
15
References
2019
Year
Mathematical ProgrammingArtificial IntelligenceNew FormulationEngineeringMachine LearningComputational ComplexityClassification MethodData ScienceData MiningPattern RecognitionDecision TreeOptimal Classification TreesDecision Tree LearningMathematical Optimization FormulationsCombinatorial OptimizationSupervised LearningComputational Learning TheoryPredictive AnalyticsKnowledge DiscoveryLarge Scale OptimizationComputer ScienceDeep LearningOptimal Classification TreeInteger ProgrammingData ClassificationClassifier SystemLinear Programming
We provide a new formulation for the problem of learning the optimal classification tree of a given depth as a binary linear program. A limitation of previously proposed Mathematical Optimization formulations is that they create constraints and variables for every row in the training data. As a result, the running time of the existing Integer Linear programming (ILP) formulations increases dramatically with the size of data. In our new binary formulation, we aim to circumvent this problem by making the formulation size largely independent from the training data size. We show experimentally that our formulation achieves better performance than existing formulations on both small and large problem instances within shorter running time.
| Year | Citations | |
|---|---|---|
Page 1
Page 1