A boosted decision tree approach using Bayesian hyper-parameter optimization for credit scoring

doi:10.1016/J.ESWA.2017.02.017

Journal Article10.1016/J.ESWA.2017.02.017

A boosted decision tree approach using Bayesian hyper-parameter optimization for credit scoring

Yufei Xia, +3 more

- 15 Jul 2017

- Expert Systems With Applications

- Vol. 78, Iss: 78, pp 225-241

699

TL;DR: A sequential ensemble credit scoring model based on a variant of gradient boosting machine (i.e., extreme gradient boosting (XGBoost) is proposed, which demonstrates that Bayesian hyper-parameter optimization performs better than random search, grid search, and manual search.

Abstract: Credit scoring is an effective tool for banks to properly guide decision profitably on granting loans. Ensemble methods, which according to their structures can be divided into parallel and sequential ensembles, have been recently developed in the credit scoring domain. These methods have proven their superiority in discriminating borrowers accurately. However, among the ensemble models, little consideration has been provided to the following: (1) highlighting the hyper-parameter tuning of base learner despite being critical to well-performed ensemble models; (2) building sequential models (i.e., boosting, as most have focused on developing the same or different algorithms in parallel); and (3) focusing on the comprehensibility of models. This paper aims to propose a sequential ensemble credit scoring model based on a variant of gradient boosting machine (i.e., extreme gradient boosting (XGBoost)). The model mainly comprises three steps. First, data pre-processing is employed to scale the data and handle missing values. Second, a model-based feature selection system based on the relative feature importance scores is utilized to remove redundant variables. Third, the hyper-parameters of XGBoost are adaptively tuned with Bayesian hyper-parameter optimization and used to train the model with selected feature subset. Several hyper-parameter optimization methods and baseline classifiers are considered as reference points in the experiment. Results demonstrate that Bayesian hyper-parameter optimization performs better than random search, grid search, and manual search. Moreover, the proposed model outperforms baseline models on average over four evaluation measures: accuracy, error rate, the area under the curve (AUC) H measure (AUC-H measure), and Brier score. The proposed model also provides feature importance scores and decision chart, which enhance the interpretability of credit scoring model.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Pattern Recognition and Machine Learning

Christopher M. Bishop

- 01 Jan 2006

TL;DR: Probability distributions of linear models for regression and classification are given in this article, along with a discussion of combining models and combining models in the context of machine learning and classification.

...read moreread less

10.1K

•Journal Article•10.1016/J.NEUCOM.2020.07.061

On Hyperparameter Optimization of Machine Learning Algorithms: Theory and Practice

Li Yang, +1 more

- 20 Nov 2020

- Neurocomputing

TL;DR: This survey paper will help industrial users, data analysts, and researchers to better develop machine learning models by identifying the proper hyper-parameter configurations effectively and introducing several state-of-the-art optimization techniques.

...read moreread less

2K

•Journal Article•10.1007/S10462-020-09896-5

A comparative analysis of gradient boosting algorithms

Candice Bentéjac, +2 more

- 01 Mar 2021

- Artificial Intelligence Review

TL;DR: A comprehensive comparison between XGBoost, LightGBM, CatBoost, random forests and gradient boosting has been performed and indicates that CatBoost obtains the best results in generalization accuracy and AUC in the studied datasets although the differences are small.

...read moreread less

1.3K

•Journal Article•10.1016/J.GSF.2020.03.007

Prediction of undrained shear strength using extreme gradient boosting and random forest based on Bayesian optimization

Wengang Zhang, +4 more

- 01 Jan 2021

- Geoscience frontiers

TL;DR: Novel data-driven extreme gradient boosting (XGBoost) and random forest ensemble learning methods are applied for capturing the relationships between the USS and various basic soil parameters to predict undrained shear strength of soft clays.

...read moreread less

682

•Journal Article•10.1016/J.ESWA.2017.04.003

An up-to-date comparison of state-of-the-art classification algorithms

Chongsheng Zhang, +3 more

- 01 Oct 2017

- Expert Systems With Applications

TL;DR: It is found that Stochastic Gradient Boosting Trees (GBDT) matches or exceeds the prediction performance of Support Vector Machines and Random Forests, while being the fastest algorithm in terms of prediction efficiency.

...read moreread less

418

...

Expand

References

•Journal Article•10.1023/A:1010933404324

Random Forests

Leo Breiman

- 01 Oct 2001

TL;DR: Internal estimates monitor error, strength, and correlation and these are used to show the response to increasing the number of features used in the forest, and are also applicable to regression.

...read moreread less

113.1K

Journal Article•10.1145/1961189.1961199

LIBSVM: A library for support vector machines

Chih-Chung Chang, +1 more

- 06 May 2011

- ACM Transactions on Intelligent Systems ...

TL;DR: Issues such as solving SVM optimization problems theoretical convergence multiclass classification probability estimates and parameter selection are discussed in detail.

...read moreread less

46.3K

•Journal Article•10.1023/A:1022627411411

Support-Vector Networks

Corinna Cortes, +1 more

- 15 Sep 1995

- Machine Learning

TL;DR: High generalization ability of support-vector networks utilizing polynomial input transformations is demonstrated and the performance of the support- vector network is compared to various classical learning algorithms that all took part in a benchmark study of Optical Character Recognition.

...read moreread less

42K

•Proceedings Article•10.1145/2939672.2939785

XGBoost: A Scalable Tree Boosting System

Tianqi Chen, +1 more

- 09 Mar 2016

- arXiv: Learning

TL;DR: This paper proposes a novel sparsity-aware algorithm for sparse data and weighted quantile sketch for approximate tree learning and provides insights on cache access patterns, data compression and sharding to build a scalable tree boosting system called XGBoost.

...read moreread less

32.8K

...

Expand

A boosted decision tree approach using Bayesian hyper-parameter optimization for credit scoring

Chat with Paper

AI Agents for this Paper

Citations

Pattern Recognition and Machine Learning

On Hyperparameter Optimization of Machine Learning Algorithms: Theory and Practice

A comparative analysis of gradient boosting algorithms

Prediction of undrained shear strength using extreme gradient boosting and random forest based on Bayesian optimization

An up-to-date comparison of state-of-the-art classification algorithms

References

Random Forests

Scikit-learn: Machine Learning in Python

LIBSVM: A library for support vector machines

Support-Vector Networks

XGBoost: A Scalable Tree Boosting System

Related Papers (5)

Benchmarking state-of-the-art classification algorithms for credit scoring: An update of research

Random Forests

Greedy function approximation: A gradient boosting machine.

XGBoost: A Scalable Tree Boosting System

SMOTE: synthetic minority over-sampling technique