Learning Optimal Classification Trees: Strong Max-Flow Formulations.

Open AccessPosted Content

Learning Optimal Classification Trees: Strong Max-Flow Formulations.

- 21 Feb 2020

30

TL;DR: This work proposes a flow-based MIP formulation for optimal binary classification trees that has a stronger linear programming relaxation and exploits the structure and max-flow/min-cut duality to derive a Benders' decomposition method, which scales to larger instances.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Posted Content

Interpretable Machine Learning: Fundamental Principles and 10 Grand Challenges

Cynthia Rudin, +5 more

- 20 Mar 2021

- arXiv: Learning

TL;DR: In this paper, the authors provide fundamental principles for interpretable ML and dispel common misunderstandings that dilute the importance of this crucial topic, and identify 10 technical challenge areas in interpretable machine learning and provide history and background on each problem.

...read moreread less

479

•Proceedings Article

Decision Trees for Decision-Making under the Predict-then-Optimize Framework

Adam N. Elmachtoub, +2 more

- 12 Jul 2020

TL;DR: In this article, a Smart Predict-then-Optimize (SPO) loss is proposed to measure the suboptimality of the decisions induced by the predicted input parameters, as opposed to measuring loss using input parameter prediction error.

...read moreread less

101

•Journal Article•10.1007/S11750-021-00594-1

Mathematical Optimization in Classification and Regression Trees

Emilio Carrizosa, +2 more

- 17 Mar 2021

- Top

TL;DR: In this paper, the authors review recent contributions within the Continuous Optimization and the Mixed-Integer Linear Optimization paradigms to develop novel formulations in this research area and compare those in terms of the nature of the decision variables and the constraints required, as well as the optimization algorithms proposed.

...read moreread less

100

•Proceedings Article

A Scalable MIP-based Method for Learning Optimal Multivariate Decision Trees

Haoran Zhu, +4 more

- 01 Jan 2020

TL;DR: A novel MIP formulation, based on a 1-norm support vector machine model, to train a multivariate ODT for classification problems and is able to routinely handle large data-sets with more than 7,000 sample points and outperform heuristics methods and other MIP based techniques.

...read moreread less

32

Journal Article

MurTree: Optimal Decision Trees via Dynamic Programming and Search

Emir Demirović, +8 more

- Journal of Machine Learning Research

TL;DR: This work provides a novel algorithm for learning optimal classification trees based on dynamic programming and search and shows in a detailed experimental study that this approach uses only a fraction of the time required by the state-of-the-art and can handle datasets with tens of thousands of instances.

...read moreread less

15

...

Expand

References

•Book

C4.5: Programs for Machine Learning

J. Ross Quinlan

- 15 Oct 1992

TL;DR: A complete guide to the C4.5 system as implemented in C for the UNIX environment, which starts from simple core learning methods and shows how they can be elaborated and extended to deal with typical problems such as missing data and over hitting.

...read moreread less

27.2K

•Book

Classification and regression trees

Leo Breiman

- 01 Jan 1983

TL;DR: The methodology used to construct tree structured rules is the focus of a monograph as mentioned in this paper, covering the use of trees as a data analysis method, and in a more mathematical framework, proving some of their fundamental properties.

...read moreread less

22.7K

Classification and Regression by randomForest

Andy Liaw, +1 more

- 01 Jan 2007

TL;DR: random forests are proposed, which add an additional layer of randomness to bagging and are robust against overfitting, and the randomForest package provides an R interface to the Fortran programs by Breiman and Cutler.

...read moreread less

20.1K

•Journal Article•10.1023/A:1022643204877

Induction of Decision Trees

J. R. Quinlan

- 25 Mar 1986

- Machine Learning

TL;DR: In this paper, an approach to synthesizing decision trees that has been used in a variety of systems, and it describes one such system, ID3, in detail, is described, and a reported shortcoming of the basic algorithm is discussed.

...read moreread less

18.8K

Journal Article•10.1002/WIDM.8

Classification and regression trees

Wei-Yin Loh

- 01 Jan 2011

- Wiley Interdisciplinary Reviews-Data Min...

TL;DR: This article gives an introduction to the subject of classification and regression trees by reviewing some widely available algorithms and comparing their capabilities, strengths, and weakness in two examples.

...read moreread less

18.7K

...

Expand

Learning Optimal Classification Trees: Strong Max-Flow Formulations.

Chat with Paper

AI Agents for this Paper

Citations

Interpretable Machine Learning: Fundamental Principles and 10 Grand Challenges

Decision Trees for Decision-Making under the Predict-then-Optimize Framework

Mathematical Optimization in Classification and Regression Trees

A Scalable MIP-based Method for Learning Optimal Multivariate Decision Trees

MurTree: Optimal Decision Trees via Dynamic Programming and Search

References

C4.5: Programs for Machine Learning

Classification and regression trees

Classification and Regression by randomForest

Induction of Decision Trees

Classification and regression trees

Related Papers (5)

Learning optimal classification trees using a binary linear program formulation

Optimal Sparse Decision Trees

Optimal classification trees

Optimal decision trees for categorical data via integer programming

Strong mixed-integer programming formulations for trained neural networks