A Hybrid Feature Selection Algorithm For Classification Unbalanced Data Processsing

doi:10.1109/SMARTIOT.2018.00055

Proceedings Article10.1109/SMARTIOT.2018.00055

A Hybrid Feature Selection Algorithm For Classification Unbalanced Data Processsing

Xue Zhang, +3 more

- 01 Aug 2018

- pp 269-275

17

TL;DR: A hybrid feature selection algorithm is proposed to process the two classification unbalanced data problem and multi classification problem and its results show that the area under receiver operating characteristic curve for two classifications and the accuracy rate forMulti classification problem have been improved compared with other models.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.1002/DAC.4812

A fog computing data reduce level to enhance the cloud of things performance

Tarek Moulahi, +6 more

- 10 Apr 2021

- International Journal of Communication S...

TL;DR: It is believed that data reduction can provide a blueprint for avoiding unnecessary data storage and processing and demonstrates the efficacy of the reduced ML models.

...read moreread less

14

Proceedings Article•10.1109/IWCMC.2019.8766790

Enhancing cloud of things performance by avoiding unnecessary data through artificial intelligence tools

Sami Mahfoudhi, +2 more

- 24 Jun 2019

TL;DR: This research focuses on how to use artificial intelligence to segregate unnecessary data collected by things, to avoid unnecessary charging of storage and processing resources of things as well as of cloud.

...read moreread less

12

Journal Article•10.1016/j.compbiomed.2023.107075

An intent classification method for questions in "Treatise on Febrile diseases" based on TinyBERT-CNN fusion model

Helong Yu, +5 more

- 01 May 2023

- Computers in Biology and Medicine

TL;DR: In this paper , a knowledge distillation-based bidirectional Transformer encoder combined with a convolutional neural network model (TinyBERT-CNN) was used for the task of question intent classification in "Treatise on Febrile Diseases", which used TinyBERT as an embedding and encoding layer to obtain the global vector information of the text and then completed the intent classification by feeding the encoded feature information into the CNN.

...read moreread less

11

Book Chapter•10.1007/978-3-030-79150-6_6

Comparative Study of Embedded Feature Selection Methods on Microarray Data

Hind Hamla, +1 more

- 25 Jun 2021

TL;DR: In this paper, the authors compared the performance of five embedded feature selection methods namely decision tree, random forest, lasso, ridge, and SVM-RFE in the classification of microarray data.

...read moreread less

7

Journal Article•10.1007/s13042-022-01663-y

Feature selection based on a hybrid simplified particle swarm optimization algorithm with maximum separation and minimum redundancy

Liqin Sun, +3 more

- 21 Oct 2022

- International Journal of Machine Learnin...

TL;DR: A hybrid simplified PSO-based feature selection algorithm with the elite strategy (HECSPSO) is proposed, which can achieve a feature subset with better performance, and is a highly competitive algorithm for feature selection.

...read moreread less

7

...

Expand

References

Journal Article•10.1145/1007730.1007735

A study of the behavior of several methods for balancing machine learning training data

Gustavo E. A. P. A. Batista, +2 more

- 01 Jun 2004

- Sigkdd Explorations

TL;DR: This work performs a broad experimental evaluation involving ten methods, three of them proposed by the authors, to deal with the class imbalance problem in thirteen UCI data sets, and shows that, in general, over-sampling methods provide more accurate results than under-sampled methods considering the area under the ROC curve (AUC).

...read moreread less

3.9K

UCI Repository of Machine Learning Databases

P. M. Murphy

- 01 Jan 1994

1.7K

Journal Article•10.1145/1007730.1007734

Mining with rarity: a unifying framework

Gary M. Weiss

- 01 Jun 2004

- Sigkdd Explorations

TL;DR: It is demonstrated that rare classes and rare cases are very similar phenomena---both forms of rarity are shown to cause similar problems during data mining and benefit from the same remediation methods.

...read moreread less

1.5K

Journal Article•10.1109/TC.1977.1674939

A Branch and Bound Algorithm for Feature Subset Selection

Narendra, +1 more

- 01 Sep 1977

- IEEE Transactions on Computers

TL;DR: In this paper, a branch and bound-based feature subset selection algorithm is proposed to select the best subset of m features from an n-feature set without exhaustive search, which is computationally computationally unfeasible.

...read moreread less

1.4K

Journal Article•10.1016/J.JNCA.2015.11.016

A survey of network anomaly detection techniques

Mohiuddin Ahmed, +2 more

- 01 Jan 2016

- Journal of Network and Computer Applicat...

TL;DR: This paper presents an in-depth analysis of four major categories of anomaly detection techniques which include classification, statistical, information theory and clustering and evaluates effectiveness of different categories of techniques.

...read moreread less

1.4K