P System as a Computing Tool for Embedded Feature Selection and Classification Method for Microarray Cancer Data

doi:10.1007/978-3-030-77102-7_6

P System as a Computing Tool for Embedded Feature Selection and Classification Method for Microarray Cancer Data

- 14 Sep 2020

- pp 94-125

TL;DR: In this article, a multi-objective binary particle swarm optimization (MObPSO) algorithm was proposed to select informative genes from microarray data, where the kernel P system (kP) was used as the variant of the P system to improve the performance of the intelligent algorithm.

Abstract: Selection of relevant genes is the crucial task for sample classification in microarray data, where researchers try to identify the smallest possible set of genes that can still achieve good predictive performance. Due to the problem of higher risk of overfitting in wrapper methods and sensitivity of the best embedded way to filter out factor that leads to unstable model and significantly different gene subsets, in this paper, we propose a novel model for evaluating and improving techniques for selecting informative genes from microarray data. This model inspired by membrane computing and used the kernel P system (kP) as the variant of the P system to improve the performance of the intelligent algorithm, multi-objective binary particle swarm optimization (MObPSO). The proposed model consists of two main parts. First, kP-MObPSO, which resembles a wrapper type feature selection, and the second part that improves the results of the first part through an embedded feature selection and classification idea based on the kP system. Division, rewriting, and input/output rules are used to make interaction among the genes inside and between the particles. The proposed model applied to the colorectal and breast dataset contains 100 genes with six attributes. The embedded part of the model extracts the marker gene sets indicate more stability and reliability based on ROC measure as well as better error rate in comparison to the wrapper part of the model. In the paper, the lowest error rate by an embedded model is displayed as 0.1111 for breast cancer and 0.0769 for colorectal data.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

References

Journal Article•10.1023/A:1009715923555

A Tutorial on Support Vector Machines for Pattern Recognition

Christopher John Burges

- 01 Jun 1998

- Data Mining and Knowledge Discovery

TL;DR: There are several arguments which support the observed high accuracy of SVMs, which are reviewed and numerous examples and proofs of most of the key theorems are given.

...read moreread less

17.8K

•Journal Article•10.1023/A:1012487302797

Gene Selection for Cancer Classification using Support Vector Machines

Isabelle Guyon, +3 more

- 11 Mar 2002

- Machine Learning

TL;DR: In this article, a Support Vector Machine (SVM) method based on recursive feature elimination (RFE) was proposed to select a small subset of genes from broad patterns of gene expression data, recorded on DNA micro-arrays.

...read moreread less

9.5K

Correlation-based Feature Selection for Machine Learning

Mark Hall

- 01 Jan 1998

TL;DR: This thesis addresses the problem of feature selection for machine learning through a correlation based approach with CFS (Correlation based Feature Selection), an algorithm that couples this evaluation formula with an appropriate correlation measure and a heuristic search strategy.

...read moreread less

4.1K

•Journal Article•10.1093/BIOINFORMATICS/16.10.906

Support vector machine classification and validation of cancer tissue samples using microarray expression data

Terrence S. Furey, +5 more

- 01 Oct 2000

- Bioinformatics

TL;DR: A new method to analyse tissue samples using support vector machines for mis-labeled or questionable tissue results and shows that other machine learning methods also perform comparably to the SVM on many of those datasets.

...read moreread less

2.7K

•Journal Article•10.3322/CAAC.21220

Colorectal cancer statistics, 2014

Rebecca L. Siegel, +2 more

- 01 Mar 2014

- CA: A Cancer Journal for Clinicians

TL;DR: Progress in reducing colorectal cancer death rates can be accelerated by improving access to and use of screening and standard treatment in all populations, including the most current data on incidence, survival, and mortality rates and trends.

...read moreread less

2.6K