Mixed linear model approach adapted for genome-wide association studies.
Zhiwu Zhang,Elhan S. Ersoz,Chao-Qiang Lai,Rory J. Todhunter,Hemant K. Tiwari,Michael A. Gore,Peter J. Bradbury,Jianming Yu,Donna K. Arnett,Jose M. Ordovas,Edward S. Buckler,Edward S. Buckler +11 more
TL;DR: A compression approach is reported, called 'compressed MLM', that decreases the effective sample size of such datasets by clustering individuals into groups and a complementary approach, 'population parameters previously determined' (P3D), that eliminates the need to re-compute variance components.
read more
Abstract: Mixed linear model (MLM) methods have proven useful in controlling for population structure and relatedness within genome-wide association studies. However, MLM-based methods can be computationally challenging for large datasets. We report a compression approach, called ‘compressed MLM’, that decreases the effective sample size of such datasets by clustering individuals into groups. We also present a complementary approach, ‘population parameters previously determined’ (P3D), that eliminates the need to re-compute variance components. We applied these two methods both independently and combined in selected genetic association datasets from human, dog and maize. The joint implementation of these two methods markedly reduced computing time and either maintained or improved statistical power. We used simulations to demonstrate the usefulness in controlling for substructure in genetic association datasets for a range of species and genetic architectures. We have made these methods available within an implementation of the software program TASSEL.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Identification of genetic variants associated with maize flowering time using an extremely large multi-genetic background population.
Yongxiang Li,Chunhui Li,Peter J. Bradbury,Xiaolei Liu,Fei Lu,Cinta Romay,Jeffrey C. Glaubitz,Xun Wu,Bo Peng,Yunsu Shi,Yanchun Song,Dengfeng Zhang,Edward S. Buckler,Edward S. Buckler,Zhiwu Zhang,Zhiwu Zhang,Yu Li,Tianyu Wang +17 more
TL;DR: To detect more maize flowering time variants, an extremely large multi- genetic background population that contained more than 8000 lines under multiple Sino-United States environments was evaluated and nearly 1 million single-nucleotide polymorphisms (SNPs) were used in the analyses.
137
A high-throughput, field-based phenotyping technology for tall biomass crops
TL;DR: Phenobot 1.0, an auto-steered and self-propelled field-based high-throughput phenotyping platform for tall dense canopy crops, such as sorghum, was developed and tested and was proven robust to obtain ground-basedHigh-Throughput plant architecture parameters of sorghums, a tall and densely planted crop species.
135
A group VII ethylene response factor gene, ZmEREB180, coordinates waterlogging tolerance in maize seedlings.
TL;DR: This study provides direct evidence that a ZmERFVII gene is involved in waterlogging tolerance, and could be applied directly to breed waterlogged‐tolerant maize cultivars and improve the understanding of waterlogges stress.
134
A genome-wide association study of 23 agronomic traits in Chinese wheat landraces.
TL;DR: This study should be helpful in elucidating the underlying genetic mechanisms of complex agronomic traits and performing marker-assisted selection in wheat.
134
A Genomic Resource for the Development, Improvement, and Exploitation of Sorghum for Bioenergy
Zachary Brenton,Elizabeth A. Cooper,Mathew T Myers,Richard Boyles,Nadia Shakoor,Kelsey J. Zielinski,Bradley L. Rauh,William C. Bridges,Geoffrey P. Morris,Stephen Kresovich +9 more
TL;DR: The value of sorghum as a functional model for candidate gene discovery for bioenergy Andropogoneae is demonstrated by performing genome-wide association analysis for two contrasting phenotypes representing key components of structural and non-structural carbohydrates.
134
References
Inference of population structure using multilocus genotype data
TL;DR: Pritch et al. as discussed by the authors proposed a model-based clustering method for using multilocus genotype data to infer population structure and assign individuals to populations, which can be applied to most of the commonly used genetic markers, provided that they are not closely linked.
Data clustering: a review
TL;DR: An overview of pattern clustering methods from a statistical pattern recognition perspective is presented, with a goal of providing useful advice and references to fundamental concepts accessible to the broad community of clustering practitioners.
TASSEL: software for association mapping of complex traits in diverse samples
Peter J. Bradbury,Zhiwu Zhang,Dallas E. Kroon,Terry M. Casstevens,Yogesh Ramdoss,Edward S. Buckler +5 more
TL;DR: TASSEL (Trait Analysis by aSSociation, Evolution and Linkage) implements general linear model and mixed linear model approaches for controlling population and family structure and allows for linkage disequilibrium statistics to be calculated and visualized graphically.
7.2K
A unified mixed-model method for association mapping that accounts for multiple levels of relatedness
Jianming Yu,Gaël Pressoir,William H. Briggs,Irie Vroh Bi,Masanori Yamasaki,John Doebley,Michael D. McMullen,Michael D. McMullen,Brandon S. Gaut,Dahlia M. Nielsen,James B. Holland,James B. Holland,Stephen Kresovich,Edward S. Buckler,Edward S. Buckler +14 more
TL;DR: A unified mixed-model approach to account for multiple levels of relatedness simultaneously as detected by random genetic markers is developed and provides a powerful complement to currently available methods for association mapping.
4.1K