Journal Article10.1007/S13042-016-0524-0
A selective neural network ensemble classification for incomplete data
24
TL;DR: Experiments show that SNNE outperforms the NNE-based algorithms compared and can greatly reduce the running time when dealing with datasets with larger number of missing values.
read more
Abstract: Neural network ensemble (NNE) is a simple and effective method to deal with incomplete data for classification. However, with the increase in the number of missing values, the number of incomplete feature combinations (feature subsets) grown rapidly which makes the NNE method very time-consuming and the accuracy is also need to be improved. In this paper, we propose a selective neural network ensemble (SNNE) classification for incomplete data. The SNNE first obtains all the available feature subsets of the incomplete dataset and then applies mutual information to measure the importance (relevance) degree of each feature subset. After that, an optimization process is applied to remove the feature subsets by satisfying the following condition: there is at least a feature subset contained in the removed feature subset and the difference of their importance degree is smaller than a given threshold δ. Finally, the rest of the feature subsets were used to train a group of neural networks and the classification for a given sample is decided by weighted majority voting of all available components in the ensemble. Experimental results show that δ = 0.05 is reasonable in our study. It can improve the efficiency of the algorithm without loss the algorithm accuracy. Experiments also show that SNNE outperforms the NNE-based algorithms compared. In addition, it can greatly reduce the running time when dealing with datasets with larger number of missing values.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Missing value imputation: a review and analysis of the literature (2006–2017)
TL;DR: This paper aims at reviewing and analyzing related studies carried out in recent decades, from the experimental design perspective, and identifying limitations in the existing body of literature based upon which some directions for future research can be gleaned.
473
Optimal resource allocation using reinforcement learning for IoT content-centric services
Keke Gai,Meikang Qiu,Meikang Qiu +2 more
TL;DR: A novel approach is proposed, which utilizes the mechanism of Reinforcement Learning (RL) to obtain high accurate QoE in resource allocations in IoT and utilizes the satisfactory level of Quality of Experience (QoE) to achieve intelligent content-centric services.
206
Predicting breast cancer biopsy outcomes from BI-RADS findings using random forests with chi-square and MI features
TL;DR: A comparative analysis indicates that the proposed model is superior in terms of various efficiency indicators to the RF classifier and all standard models used in the study, and points out that the suggested approach is a comparable approach to different classification models present in the relevant literature.
44
Efficient extreme learning machine via very sparse random projection
Chuangquan Chen,Chi-Man Vong,Chi-Man Wong,Weiru Wang,Pak Kin Wong +4 more
- 19 Mar 2018
TL;DR: An efficient ELM via very sparse random projection (VSRP) called VSRP-ELM is proposed for training with large N and L, which adds a novel compression layer between the hidden layer and output layer, which compresses the dimension of thehidden layer from N×L to N×k under projection with random sparse-Bernoulli matrix.
25
Incremental multiple kernel extreme learning machine and its application in Robo-advisors
Jingming Xue,Qiang Liu,Miaomiao Li,Xinwang Liu,Yongkai Ye,Siqi Wang,Jianping Yin +6 more
- 01 Jun 2018
TL;DR: An incremental multiple kernel extreme learning machine (IMK-ELM) model, which initializes a generic training database and then tunes itself to the classification task, and is able to efficiently deal with large-scale tasks like Robo-advisors.
23
References
A mathematical theory of communication
TL;DR: This final installment of the paper considers the case where the signals or the messages or both are continuously variable, in contrast with the discrete nature assumed until now.
74.4K
•Journal Article
The mathematical theory of communication
Claude E. Shannon,Warren Weaver +1 more
TL;DR: The Mathematical Theory of Communication (MTOC) as discussed by the authors was originally published as a paper on communication theory more than fifty years ago and has since gone through four hardcover and sixteen paperback printings.
36.2K
•Book
Statistical Analysis with Missing Data
Roderick J. A. Little,Donald B. Rubin +1 more
- 01 Jan 1987
TL;DR: This work states that maximum Likelihood for General Patterns of Missing Data: Introduction and Theory with Ignorable Nonresponse and large-Sample Inference Based on Maximum Likelihood Estimates is likely to be high.
18.3K
Bagging predictors
Leo Breiman
- 01 Aug 1996
TL;DR: Tests on real and simulated data sets using classification and regression trees and subset selection in linear regression show that bagging can give substantial gains in accuracy.