A Hybrid Sampling SVM Approach to Imbalanced Data Classification

doi:10.1155/2014/972786

Open AccessJournal Article10.1155/2014/972786

A Hybrid Sampling SVM Approach to Imbalanced Data Classification

Qiang Wang

- 12 Jun 2014

- Abstract and Applied Analysis

- Vol. 2014, pp 1-7

60

TL;DR: A hybrid sampling SVM approach is proposed combining an oversampled technique and an undersampling technique for addressing the imbalanced data classification problem and generates a balanced training dataset to replace the original imbalanced training dataset.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.1016/J.PATCOG.2018.03.008

Handling data irregularities in classification: Foundations, trends, and future challenges

Swagatam Das, +2 more

- 01 Sep 2018

- Pattern Recognition

TL;DR: This article provides a bird's eye view of data irregularities, beginning with a taxonomy and characterization of various distribution-based and feature-based irregularities, and discusses the notable and recent approaches that have been taken to make the existing stand-alone as well as ensemble classifiers robust against such irregularities.

...read moreread less

198

Journal Article•10.1016/J.NEUNET.2015.06.005

Near-Bayesian Support Vector Machines for imbalanced data classification with equal or unequal misclassification costs

Shounak Datta, +1 more

- 01 Oct 2015

- Neural Networks

TL;DR: A Near-Bayesian Support Vector Machine (NBSVM) is proposed for such imbalanced classification problems, by combining the philosophies of decision boundary shift and unequal regularization costs.

...read moreread less

164

Book Chapter•10.1007/978-981-10-6602-3_3

Comparing the Behavior of Oversampling and Undersampling Approach of Class Imbalance Learning by Combining Class Imbalance Problem with Noise

Prabhjot Kaur, +1 more

- 01 Jan 2018

TL;DR: This paper compares the oversampling and undersampling approaches of class imbalance learning in noisy environment and tries to find out which is the better approach in such case.

...read moreread less

136

Journal Article•10.1016/J.ASOC.2018.07.003

A robust fuzzy least squares twin support vector machine for class imbalance learning

Bharat Richhariya, +1 more

- 01 Oct 2018

- Applied Soft Computing

TL;DR: This paper proposes a robust fuzzy least squares twin support vector machine for class imbalance learning termed as RFLSTSVM-CIL using 2-norm of the slack variables which makes the optimization problem strongly convex.

...read moreread less

76

Journal Article•10.1080/07421222.2017.1394056

A Data-Mining Approach to Identification of Risk Factors in Safety Management Systems

Donghui Shi, +3 more

- 02 Oct 2017

- Journal of Management Information System...

TL;DR: A data-mining approach to incident risk factor identification and analysis using data from the Aviation Safety Reporting System is presented, in an attempt to overcome obstacles related to labor intensive manual identification of risk factors as well as incomplete data.

...read moreread less

70

...

Expand

References

•Book

The Nature of Statistical Learning Theory

Vladimir Vapnik

- 01 Jan 1995

TL;DR: Setting of the learning problem consistency of learning processes bounds on the rate of convergence ofLearning processes controlling the generalization ability of learning process constructing learning algorithms what is important in learning theory?

...read moreread less

46K

•Journal Article•10.1613/JAIR.953

SMOTE: synthetic minority over-sampling technique

Nitesh V. Chawla, +3 more

- 01 Jan 2002

- Journal of Artificial Intelligence Resea...

TL;DR: In this article, a method of over-sampling the minority class involves creating synthetic minority class examples, which is evaluated using the area under the Receiver Operating Characteristic curve (AUC) and the ROC convex hull strategy.

...read moreread less

27.7K

•Proceedings Article•10.1145/130385.130401

A training algorithm for optimal margin classifiers

Bernhard E. Boser, +2 more

- 01 Jul 1992

TL;DR: A training algorithm that maximizes the margin between the training patterns and the decision boundary is presented, applicable to a wide variety of the classification functions, including Perceptrons, polynomials, and Radial Basis Functions.

...read moreread less

13.1K

•Journal Article•10.1613/JAIR.953

SMOTE: Synthetic Minority Over-sampling Technique

Nitesh V. Chawla, +3 more

- 09 Jun 2011

- arXiv: Artificial Intelligence

TL;DR: In this article, a method of over-sampling the minority class involves creating synthetic minority class examples, which is evaluated using the area under the Receiver Operating Characteristic curve (AUC) and the ROC convex hull strategy.

...read moreread less

11.5K

Journal Article•10.1109/TKDE.2008.239

Learning from Imbalanced Data

Haibo He, +1 more

- 01 Sep 2009

- IEEE Transactions on Knowledge and Data ...

TL;DR: A critical review of the nature of the problem, the state-of-the-art technologies, and the current assessment metrics used to evaluate learning performance under the imbalanced learning scenario is provided.

...read moreread less

8.2K