A review of feature selection techniques in bioinformatics

doi:10.1093/BIOINFORMATICS/BTM344

Open AccessJournal Article10.1093/BIOINFORMATICS/BTM344

A review of feature selection techniques in bioinformatics

Yvan Saeys, +2 more

- 10 Sep 2007

- Bioinformatics

- Vol. 23, Iss: 19, pp 2507-2517

5.3K

TL;DR: A basic taxonomy of feature selection techniques is provided, providing their use, variety and potential in a number of both common as well as upcoming bioinformatics applications.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Book

Applied Predictive Modeling

Max Kuhn, +1 more

- 17 May 2013

TL;DR: This research presents a novel and scalable approach called “Smartfitting” that automates the very labor-intensive and therefore time-heavy and therefore expensive and expensive process of designing and implementing statistical models for regression models.

...read moreread less

5.9K

Journal Article•10.1016/J.ISPRSJPRS.2016.01.011

Random forest in remote sensing: A review of applications and future directions

Mariana Belgiu, +1 more

- 01 Apr 2016

- Isprs Journal of Photogrammetry and Remo...

TL;DR: This review has revealed that RF classifier can successfully handle high data dimensionality and multicolinearity, being both fast and insensitive to overfitting.

...read moreread less

5.2K

•Journal Article•10.1145/3136625

Feature Selection: A Data Perspective

Jundong Li, +6 more

- 29 Jan 2016

- arXiv: Learning

TL;DR: Feature selection, as a data preprocessing strategy, has proven to be effective and efficient in preparing data (especially high-dimensional data) for various data mining and machine learning problems.

...read moreread less

2.2K

•Journal Article•10.1145/3136625

Feature Selection: A Data Perspective

Jundong Li, +6 more

- 06 Dec 2017

- ACM Computing Surveys

TL;DR: This survey revisits feature selection research from a data perspective and reviews representative feature selection algorithms for conventional data, structured data, heterogeneous data and streaming data, and categorizes them into four main groups: similarity- based, information-theoretical-based, sparse-learning-based and statistical-based.

...read moreread less

2.2K

•Journal Article•10.1371/JOURNAL.PONE.0012776

Inferring Regulatory Networks from Expression Data Using Tree-Based Methods

Vân Anh Huynh-Thu, +3 more

- 28 Sep 2010

- PLOS ONE

TL;DR: This article presents GENIE3, a new algorithm for the inference of GRNs that was best performer in the DREAM4 In Silico Multifactorial challenge and compares favorably with existing algorithms to decipher the genetic regulatory network of Escherichia coli.

...read moreread less

2K

...

Expand

References

•Journal Article•10.1093/NAR/27.23.4636

Improved microbial gene identification with GLIMMER

Arthur L. Delcher, +4 more

- 01 Dec 1999

- Nucleic Acids Research

TL;DR: Significant technical improvements to GLIMMER are reported that improve its accuracy still further, and a comprehensive evaluation demonstrates that the accuracy of the system is likely to be higher than previously recognized.

...read moreread less

2.5K

•Journal Article•10.1038/73432

Systematic variation in gene expression patterns in human cancer cell lines.

Douglas T. Ross, +17 more

- 01 Mar 2000

- Nature Genetics

TL;DR: Using cDNA microarrays to explore the variation in expression of approximately 8,000 unique genes among the 60 cell lines used in the National Cancer Institute's screen for anti-cancer drugs provided a novel molecular characterization of this important group of human cell lines and their relationships to tumours in vivo.

...read moreread less

2.3K

•Book

Feature Selection for Knowledge Discovery and Data Mining

Huan Liu, +1 more

- 31 Jul 1998

TL;DR: Feature Selection for Knowledge Discovery and Data Mining offers an overview of the methods developed since the 1970's and provides a general framework in order to examine these methods and categorize them and suggests guidelines for how to use different methods under various circumstances.

...read moreread less

2.2K

•Journal Article

Efficient Feature Selection via Analysis of Relevance and Redundancy

Lei Yu, +1 more

- 01 Dec 2004

- Journal of Machine Learning Research

TL;DR: It is shown that feature relevance alone is insufficient for efficient feature selection of high-dimensional data, and a new framework is introduced that decouples relevance analysis and redundancy analysis.

...read moreread less

2.2K

•Journal Article•10.1198/016214501753382129

Empirical Bayes analysis of a microarray experiment

Bradley Efron, +3 more

- 01 Dec 2001

- Journal of the American Statistical Asso...

TL;DR: A simple nonparametric empirical Bayes model is introduced, which is used to guide the efficient reduction of the data to a single summary statistic per gene, and also to make simultaneous inferences concerning which genes were affected by the radiation.

...read moreread less

2K