Book Chapter10.1007/978-3-030-77102-7_6
P System as a Computing Tool for Embedded Feature Selection and Classification Method for Microarray Cancer Data
Ravie Chandren Muniyandi,Naeimeh Elkhani +1 more
- 14 Sep 2020
- pp 94-125
TL;DR: In this article, a multi-objective binary particle swarm optimization (MObPSO) algorithm was proposed to select informative genes from microarray data, where the kernel P system (kP) was used as the variant of the P system to improve the performance of the intelligent algorithm.
read more
Abstract: Selection of relevant genes is the crucial task for sample classification in microarray data, where researchers try to identify the smallest possible set of genes that can still achieve good predictive performance. Due to the problem of higher risk of overfitting in wrapper methods and sensitivity of the best embedded way to filter out factor that leads to unstable model and significantly different gene subsets, in this paper, we propose a novel model for evaluating and improving techniques for selecting informative genes from microarray data. This model inspired by membrane computing and used the kernel P system (kP) as the variant of the P system to improve the performance of the intelligent algorithm, multi-objective binary particle swarm optimization (MObPSO). The proposed model consists of two main parts. First, kP-MObPSO, which resembles a wrapper type feature selection, and the second part that improves the results of the first part through an embedded feature selection and classification idea based on the kP system. Division, rewriting, and input/output rules are used to make interaction among the genes inside and between the particles. The proposed model applied to the colorectal and breast dataset contains 100 genes with six attributes. The embedded part of the model extracts the marker gene sets indicate more stability and reliability based on ROC measure as well as better error rate in comparison to the wrapper part of the model. In the paper, the lowest error rate by an embedded model is displayed as 0.1111 for breast cancer and 0.0769 for colorectal data.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
References
A Tutorial on Support Vector Machines for Pattern Recognition
TL;DR: There are several arguments which support the observed high accuracy of SVMs, which are reviewed and numerous examples and proofs of most of the key theorems are given.
Gene Selection for Cancer Classification using Support Vector Machines
TL;DR: In this article, a Support Vector Machine (SVM) method based on recursive feature elimination (RFE) was proposed to select a small subset of genes from broad patterns of gene expression data, recorded on DNA micro-arrays.
Correlation-based Feature Selection for Machine Learning
Mark Hall
- 01 Jan 1998
TL;DR: This thesis addresses the problem of feature selection for machine learning through a correlation based approach with CFS (Correlation based Feature Selection), an algorithm that couples this evaluation formula with an appropriate correlation measure and a heuristic search strategy.
Support vector machine classification and validation of cancer tissue samples using microarray expression data
Terrence S. Furey,Nello Cristianini,Nigel Duffy,David W. Bednarski,Michèl Schummer,David Haussler +5 more
TL;DR: A new method to analyse tissue samples using support vector machines for mis-labeled or questionable tissue results and shows that other machine learning methods also perform comparably to the SVM on many of those datasets.
Colorectal cancer statistics, 2014
TL;DR: Progress in reducing colorectal cancer death rates can be accelerated by improving access to and use of screening and standard treatment in all populations, including the most current data on incidence, survival, and mortality rates and trends.
2.6K