Variable selection method for quantitative trait analysis based on parallel genetic algorithm.
TL;DR: This study uses parallel genetic algorithm (PGA) to identify genetic and environmental factors in genetic association studies of complex human diseases and shows that PGA is able to choose the variables correctly and is also an easy‐to‐use variable selection tool.
read more
Abstract: Selection of important genetic and environmental factors is of strong interest in quantitative trait analyses. In this study, we use parallel genetic algorithm (PGA) to identify genetic and environmental factors in genetic association studies of complex human diseases. Our method can take account of both multiple markers across the genome and environmental factors, and also can be used to do fine mapping based on the results of haplotype analysis to select the markers that are associated with the quantitative traits. Using both simulated and real examples, we show that PGA is able to choose the variables correctly and is also an easy-to-use variable selection tool.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
•Journal Article
Darwinian evolution in parallel universes: A parallel genetic algorithm for variable selection
Mu Zhu,Hugh A. Chipman +1 more
TL;DR: This article first demonstrates that the GA is actually not a particularly effective variable selection tool, and then proposes a very simple modification which is to run a number of GAs in parallel without allowing each GA to fully converge, and to consolidate the information from all the individual GA in the end.
39
Higher order interactions: detection of epistasis using machine learning and evolutionary computation.
TL;DR: Here it is shown that evolutionary algorithms are a useful tool in high-dimensional analyses designed to identify gene-gene interactions in current large-scale genotypic data.
14
Big data challenges in genomics
Hongyan Xu
- 01 Jan 2020
TL;DR: The big data challenges and opportunities in genomics research are discussed and possible solutions for these challenges are discussed, which can serve as the basis for future research.
5
Selection of a stroke risk model based on transcranial Doppler ultrasound velocity
TL;DR: The parallel genetic algorithm (PGA) is used to select a stroke risk model with TCD velocity as the response variable and it is shown that the PGA is an easy-to-use computationally variable selection tool.
2
An Improved Diploid Genetic Algorithm for Head-Independency Hydro Plant Scheduling
Li He,Jiaping Liao,Tao Huang,Hui Li +3 more
- 17 Sep 2011
TL;DR: DGA has a better performance than HGA in terms of preserving the diversity and an improved method of mutation operator for DGA was developed to deal with the nonlinear optimization problem for hydro scheduling in power market.
1
References
Genetic algorithms in search, optimization and machine learning
David E. Goldberg
- 01 Jan 1989
TL;DR: This book brings together the computer techniques, mathematical tools, and research results that will enable both students and practitioners to apply genetic algorithms to problems in many fields.
58.6K
•Book
Genetic algorithms in search, optimization, and machine learning
David E. Goldberg
- 01 Sep 1988
TL;DR: In this article, the authors present the computer techniques, mathematical tools, and research results that will enable both students and practitioners to apply genetic algorithms to problems in many fields, including computer programming and mathematics.
Generalized Cross-Validation as a Method for Choosing a Good Ridge Parameter
TL;DR: The generalized cross-validation (GCV) method as discussed by the authors is a generalized version of Allen's PRESS, which can be used in subset selection and singular value truncation, and even to choose from among mixtures of these methods.
Variable selection via Gibbs sampling
TL;DR: In this paper, the Gibbs sampler is used to indirectly sample from the multinomial posterior distribution on the set of possible subset choices to identify the promising subsets by their more frequent appearance in the Gibbs sample.
3.1K
Optimization of Control Parameters for Genetic Algorithms
John J. Grefenstette
- 01 Jan 1986
TL;DR: GA's are shown to be effective for both levels of the systems optimization problem and are applied to the second level task of identifying efficient GA's for a set of numerical optimization problems.
3.1K