Open AccessPosted Content
Learning Optimal Classification Trees: Strong Max-Flow Formulations.
TL;DR: This work proposes a flow-based MIP formulation for optimal binary classification trees that has a stronger linear programming relaxation and exploits the structure and max-flow/min-cut duality to derive a Benders' decomposition method, which scales to larger instances.
read more
Abstract: We consider the problem of learning optimal binary classification trees. Literature on the topic has burgeoned in recent years, motivated both by the empirical suboptimality of heuristic approaches and the tremendous improvements in mixed-integer programming (MIP) technology. Yet, existing approaches from the literature do not leverage the power of MIP to its full extent. Indeed, they rely on weak formulations, resulting in slow convergence and large optimality gaps. To fill this gap in the literature, we propose a flow-based MIP formulation for optimal binary classification trees that has a stronger linear programming relaxation. Our formulation presents an attractive decomposable structure. We exploit this structure and max-flow/min-cut duality to derive a Benders' decomposition method, which scales to larger instances. We conduct extensive computational experiments on standard benchmark datasets on which we show that our proposed approaches are 50 times faster than state-of-the art MIP-based techniques and improve out of sample performance up to 13.8%.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
•Posted Content
Interpretable Machine Learning: Fundamental Principles and 10 Grand Challenges
TL;DR: In this paper, the authors provide fundamental principles for interpretable ML and dispel common misunderstandings that dilute the importance of this crucial topic, and identify 10 technical challenge areas in interpretable machine learning and provide history and background on each problem.
479
•Proceedings Article
Decision Trees for Decision-Making under the Predict-then-Optimize Framework
Adam N. Elmachtoub,Jason Cheuk Nam Liang,Ryan McNellis +2 more
- 12 Jul 2020
TL;DR: In this article, a Smart Predict-then-Optimize (SPO) loss is proposed to measure the suboptimality of the decisions induced by the predicted input parameters, as opposed to measuring loss using input parameter prediction error.
Mathematical Optimization in Classification and Regression Trees
TL;DR: In this paper, the authors review recent contributions within the Continuous Optimization and the Mixed-Integer Linear Optimization paradigms to develop novel formulations in this research area and compare those in terms of the nature of the decision variables and the constraints required, as well as the optimization algorithms proposed.
•Proceedings Article
A Scalable MIP-based Method for Learning Optimal Multivariate Decision Trees
Haoran Zhu,Pavankumar Murali,Dzung T. Phan,Lam M. Nguyen,Jayant R. Kalagnanam +4 more
- 01 Jan 2020
TL;DR: A novel MIP formulation, based on a 1-norm support vector machine model, to train a multivariate ODT for classification problems and is able to routinely handle large data-sets with more than 7,000 sample points and outperform heuristics methods and other MIP based techniques.
Journal Article
MurTree: Optimal Decision Trees via Dynamic Programming and Search
Emir Demirović,Anna Lukina,Emmanuel Hebrard,Jeffrey Chan,James Bailey,Christopher Leckie,Kotagiri Ramamohanarao,Peter J. Stuckey,Luc De Raedt +8 more
TL;DR: This work provides a novel algorithm for learning optimal classification trees based on dynamic programming and search and shows in a detailed experimental study that this approach uses only a fraction of the time required by the state-of-the-art and can handle datasets with tens of thousands of instances.
15
References
•Book
C4.5: Programs for Machine Learning
J. Ross Quinlan
- 15 Oct 1992
TL;DR: A complete guide to the C4.5 system as implemented in C for the UNIX environment, which starts from simple core learning methods and shows how they can be elaborated and extended to deal with typical problems such as missing data and over hitting.
27.2K
•Book
Classification and regression trees
Leo Breiman
- 01 Jan 1983
TL;DR: The methodology used to construct tree structured rules is the focus of a monograph as mentioned in this paper, covering the use of trees as a data analysis method, and in a more mathematical framework, proving some of their fundamental properties.
22.7K
Classification and Regression by randomForest
Andy Liaw,Matthew C. Wiener +1 more
- 01 Jan 2007
TL;DR: random forests are proposed, which add an additional layer of randomness to bagging and are robust against overfitting, and the randomForest package provides an R interface to the Fortran programs by Breiman and Cutler.
Induction of Decision Trees
TL;DR: In this paper, an approach to synthesizing decision trees that has been used in a variety of systems, and it describes one such system, ID3, in detail, is described, and a reported shortcoming of the basic algorithm is discussed.
Classification and regression trees
TL;DR: This article gives an introduction to the subject of classification and regression trees by reviewing some widely available algorithms and comparing their capabilities, strengths, and weakness in two examples.