A review of feature selection techniques in bioinformatics
5.3K
TL;DR: A basic taxonomy of feature selection techniques is provided, providing their use, variety and potential in a number of both common as well as upcoming bioinformatics applications.
read more
Abstract: Feature selection techniques have become an apparent need in many bioinformatics applications. In addition to the large pool of techniques that have already been developed in the machine learning and data mining fields, specific applications in bioinformatics have led to a wealth of newly proposed techniques.
In this article, we make the interested reader aware of the possibilities of feature selection, providing a basic taxonomy of feature selection techniques, and discussing their use, variety and potential in a number of both common as well as upcoming bioinformatics applications.
Contact: yvan.saeys@psb.ugent.be
Supplementary information: http://bioinformatics.psb.ugent.be/supplementary_data/yvsae/fsreview
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Learning from class-imbalanced data
TL;DR: An in depth review of rare event detection from an imbalanced learning perspective and a comprehensive taxonomy of the existing application domains of im balanced learning are provided.
2K
•Proceedings Article
Efficient and Robust Feature Selection via Joint ℓ2,1-Norms Minimization
Feiping Nie,Heng Huang,Xiao Cai,Chris Ding +3 more
- 06 Dec 2010
TL;DR: A new robust feature selection method with emphasizing joint l2,1-norm minimization on both loss function and regularization is proposed, which has been applied into both genomic and proteomic biomarkers discovery.
Wisdom of crowds for robust gene network inference
Daniel Marbach,James C. Costello,Robert Küffner,Nicole M. Vega,Robert J. Prill,Diogo M. Camacho,Kyle R. Allison,Andrej Aderhold,Richard Bonneau,Yukun Chen,James J. Collins,Francesca Cordero,Martin Crane,Frank Dondelinger,Mathias Drton,Roberto Esposito,Rina Foygel,Alberto de la Fuente,Jan Gertheiss,Pierre Geurts,Alex Greenfield,Marco Grzegorczyk,Anne-Claire Haury,Benjamin Holmes,Torsten Hothorn,Dirk Husmeier,Vân Anh Huynh-Thu,Alexandre Irrthum,Manolis Kellis,Guy Karlebach,Sophie Lèbre,Vincenzo De Leo,Aviv Madar,Subramani Mani,Fantine Mordelet,Harry Ostrer,Zhengyu Ouyang,Ravi Pandya,Tobias Petri,Andrea Pinna,Christopher S. Poultney,Serena Rezny,Heather J. Ruskin,Yvan Saeys,Ron Shamir,Alina Sîrbu,Mingzhou Song,Nicola Soranzo,Alexander Statnikov,Gustavo Stolovitzky,Nicci Vega,Paola Vera-Licona,Jean-Philippe Vert,Alessia Visconti,Haizhou Wang,Louis Wehenkel,Lukas Windhager,Yang Zhang,Ralf Zimmer +58 more
- 01 Jul 2012
TL;DR: A comprehensive blind assessment of over 30 network inference methods on Escherichia coli, Staphylococcus aureus, Saccharomyces cerevisiae and in silico microarray data defines the performance, data requirements and inherent biases of different inference approaches, and provides guidelines for algorithm application and development.
1.5K
A review of variable selection methods in Partial Least Squares Regression
TL;DR: A review of available methods for variable selection within one of the many modeling approaches for high-throughput data, Partial Least Squares Regression, to get an understanding of the characteristics of the methods and to get a basis for selecting an appropriate method for own use.
1.4K
Machine Learning for Medical Imaging.
TL;DR: Deep learning has started to be used; this method has the benefit that it does not require image feature identification and calculation as a first step; rather, features are identified as part of the learning process.
1.3K
References
•Book
Adaptation in natural and artificial systems
John H. Holland
- 01 Jan 1975
TL;DR: Names of founding work in the area of Adaptation and modiication, which aims to mimic biological optimization, and some (Non-GA) branches of AI.
•Book
Data Mining: Practical Machine Learning Tools and Techniques
Ian H. Witten,Eibe Frank,Mark Hall +2 more
- 25 Oct 1999
TL;DR: This highly anticipated third edition of the most acclaimed work on data mining and machine learning will teach you everything you need to know about preparing inputs, interpreting outputs, evaluating results, and the algorithmic methods at the heart of successful data mining.
25.4K
An introduction to variable and feature selection
Isabelle Guyon,André Elisseeff +1 more
TL;DR: The contributions of this special issue cover a wide range of aspects of variable selection: providing a better definition of the objective function, feature construction, feature ranking, multivariate feature selection, efficient search methods, and feature validity assessment methods.
Molecular classification of cancer: class discovery and class prediction by gene expression monitoring.
Todd R. Golub,Todd R. Golub,Donna K. Slonim,Pablo Tamayo,Christine Huard,Michelle Gaasenbeek,Jill P. Mesirov,Hilary A. Coller,Mignon L. Loh,James R. Downing,Michael A. Caligiuri,Clara D. Bloomfield,Eric S. Lander +12 more
TL;DR: A generic approach to cancer classification based on gene expression monitoring by DNA microarrays is described and applied to human acute leukemias as a test case and suggests a general strategy for discovering and predicting cancer classes for other types of cancer, independent of previous biological knowledge.