A data driven ensemble classifier for credit scoring analysis

doi:10.1016/J.ESWA.2009.05.059

Journal Article10.1016/J.ESWA.2009.05.059

A data driven ensemble classifier for credit scoring analysis

Nan-Chen Hsieh, +1 more

- 01 Jan 2010

- Expert Systems With Applications

- Vol. 37, Iss: 1, pp 534-545

173

TL;DR: This study focuses on predicting whether a credit applicant can be categorized as good, bad or borderline from information initially supplied, and introduces the concept of class-wise classification as a preprocessing step in order to obtain an efficient ensemble classifier.

Abstract: This study focuses on predicting whether a credit applicant can be categorized as good, bad or borderline from information initially supplied. This is essentially a classification task for credit scoring. Given its importance, many researchers have recently worked on an ensemble of classifiers. However, to the best of our knowledge, unrepresentative samples drastically reduce the accuracy of the deployment classifier. Few have attempted to preprocess the input samples into more homogeneous cluster groups and then fit the ensemble classifier accordingly. For this reason, we introduce the concept of class-wise classification as a preprocessing step in order to obtain an efficient ensemble classifier. This strategy would work better than a direct ensemble of classifiers without the preprocessing step. The proposed ensemble classifier is constructed by incorporating several data mining techniques, mainly involving optimal associate binning to discretize continuous values; neural network, support vector machine, and Bayesian network are used to augment the ensemble classifier. In particular, the Markov blanket concept of Bayesian network allows for a natural form of feature selection, which provides a basis for mining association rules. The learned knowledge is represented in multiple forms, including causal diagram and constrained association rules. The data driven nature of the proposed system distinguishes it from existing hybrid/ensemble credit scoring systems.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.1057/PALGRAVE.JORS.2601545

Benchmarking state-of-the-art classification algorithms for credit scoring

Bart Baesens, +5 more

- 09 Jun 2003

- Journal of the Operational Research Soci...

TL;DR: It is found that both the LS-SVM and neural network classifiers yield a very good performance, but also simple classifiers such as logistic regression and linear discriminant analysis perform very well for credit scoring.

...read moreread less

1.3K

•Journal Article•10.1016/J.EJOR.2015.05.030

Benchmarking state-of-the-art classification algorithms for credit scoring: An update of research

Stefan Lessmann, +4 more

- 16 Nov 2015

- European Journal of Operational Research

TL;DR: The study of Baesens et al. (2003) is updated and several novel classification algorithms to the state-of-the-art in credit scoring are compared, providing an independent assessment of recent scoring methods and offering a new baseline to which future approaches can be compared.

...read moreread less

948

•Journal Article•10.1155/2015/431047

Data mining for the Internet of Things: literature review and challenges

Feng Chen, +5 more

- 01 Jan 2015

- International Journal of Distributed Sen...

TL;DR: A systematic way to review data mining in knowledge view, technique view, and application view, including classification, clustering, association analysis, time series analysis and outlier analysis is given.

...read moreread less

567

Journal Article•10.1007/S00521-010-0362-Z

A comparative survey of artificial intelligence applications in finance: artificial neural networks, expert system and hybrid intelligent systems

Arash Bahrammirzaee

- 01 Nov 2010

- Neural Computing and Applications

TL;DR: Comparative research review of three famous artificial intelligent techniques in financial market shows that accuracy of these artificial intelligent methods is superior to that of traditional statistical methods in dealing with financial problems, especially regarding nonlinear patterns.

...read moreread less

542

Journal Article•10.1016/J.INS.2017.10.017

Imbalanced enterprise credit evaluation with DTE-SBD

Jie Sun, +3 more

- 01 Jan 2018

- Information Sciences

TL;DR: A new DT ensemble model for imbalanced enterprise credit evaluation based on the synthetic minority over-sampling technique and the Bagging ensemble learning algorithm with differentiated sampling rates is proposed, which is named as DTE-SBD (Decision Tree Ensemble based on SMOTE, Bagging and DSR).

...read moreread less

360

...

Expand

References

•Book

The Nature of Statistical Learning Theory

Vladimir Vapnik

- 01 Jan 1995

TL;DR: Setting of the learning problem consistency of learning processes bounds on the rate of convergence ofLearning processes controlling the generalization ability of learning process constructing learning algorithms what is important in learning theory?

...read moreread less

46K

•Journal Article•10.1214/AOS/1176344136

Estimating the Dimension of a Model

Gideon Schwarz

- 01 Mar 1978

- Annals of Statistics

TL;DR: In this paper, the problem of selecting one of a number of models of different dimensions is treated by finding its Bayes solution, and evaluating the leading terms of its asymptotic expansion.

...read moreread less

45K

Estimating the dimension of a model

Gideon Schwarz

- 01 Jan 2005

TL;DR: In this paper, the problem of selecting one of a number of models of different dimensions is treated by finding its Bayes solution, and evaluating the leading terms of its asymptotic expansion.

...read moreread less

40.6K

Journal Article•10.1111/J.1540-6261.1968.TB00843.X

Financial ratios, discriminant analysis and the prediction of corporate bankruptcy

Edward I. Altman

- 01 Sep 1968

- Journal of Finance

TL;DR: In this paper, a set of financial and economic ratios are investigated in a bankruptcy prediction context wherein a multiple discriminant statistical methodology is employed, and the data used in the study are limited to manufacturing corporations, where an initial sample of sixty-six firms is utilized to establish a function which best discriminates between companies in two mutually exclusive groups: bankrupt and nonbankrupt firms.

...read moreread less

13.2K

Journal Article•10.1109/34.667881

On combining classifiers

Josef Kittler, +3 more

- 01 Mar 1998

- IEEE Transactions on Pattern Analysis an...

TL;DR: A common theoretical framework for combining classifiers which use distinct pattern representations is developed and it is shown that many existing schemes can be considered as special cases of compound classification where all the pattern representations are used jointly to make a decision.

...read moreread less

5.8K

...

Expand

A data driven ensemble classifier for credit scoring analysis

Chat with Paper

AI Agents for this Paper

Citations

Benchmarking state-of-the-art classification algorithms for credit scoring

Benchmarking state-of-the-art classification algorithms for credit scoring: An update of research

Data mining for the Internet of Things: literature review and challenges

A comparative survey of artificial intelligence applications in finance: artificial neural networks, expert system and hybrid intelligent systems

Imbalanced enterprise credit evaluation with DTE-SBD

References

The Nature of Statistical Learning Theory

Estimating the Dimension of a Model

Estimating the dimension of a model

Financial ratios, discriminant analysis and the prediction of corporate bankruptcy

On combining classifiers

Related Papers (5)

Credit scoring with a data mining approach based on support vector machines

A comparative assessment of ensemble learning for credit scoring

Using neural network ensembles for bankruptcy prediction and credit scoring

Neural network credit scoring models

Bagging predictors