Top 243 Nature Genetics papers published in 2019

Showing papers in "Nature Genetics in 2019"

Journal Article•10.1038/S41588-018-0312-8•

Tumor mutational load predicts survival after immunotherapy across multiple cancer types.

[...]

Robert M. Samstein¹, Chung-Han Lee², Chung-Han Lee¹, Alexander N. Shoushtari², Alexander N. Shoushtari¹, Matthew D. Hellmann¹, Matthew D. Hellmann², Ronglai Shen¹, Yelena Y. Janjigian¹, Yelena Y. Janjigian², David Barron¹, Ahmet Zehir¹, Emmet Jordan¹, Antonio Omuro¹, Thomas Kaley¹, Sviatoslav M. Kendall¹, Robert J. Motzer², Robert J. Motzer¹, A. Ari Hakimi¹, Martin H. Voss², Martin H. Voss¹, Paul Russo¹, Jonathan E. Rosenberg², Jonathan E. Rosenberg¹, Gopa Iyer², Gopa Iyer¹, Bernard H. Bochner¹, Dean F. Bajorin¹, Dean F. Bajorin², Hikmat Al-Ahmadie¹, Jamie E. Chaft², Jamie E. Chaft¹, Charles M. Rudin², Charles M. Rudin¹, Gregory J. Riely¹, Gregory J. Riely², Shrujal S. Baxi², Shrujal S. Baxi¹, Alan L. Ho², Alan L. Ho¹, Richard J. Wong¹, David G. Pfister², David G. Pfister¹, Jedd D. Wolchok¹, Jedd D. Wolchok², Christopher A. Barker¹, Philip H. Gutin¹, Cameron Brennan¹, Viviane Tabar¹, Ingo K. Mellinghoff¹, Lisa M. DeAngelis¹, Charlotte E. Ariyan¹, Nancy Y. Lee¹, William D. Tap¹, William D. Tap², Mrinal M. Gounder², Mrinal M. Gounder¹, Sandra P. D'Angelo², Sandra P. D'Angelo¹, Leonard B. Saltz², Leonard B. Saltz¹, Zsofia K. Stadler², Zsofia K. Stadler¹, Howard I. Scher¹, Howard I. Scher², José Baselga¹, José Baselga², Pedram Razavi¹, Pedram Razavi², Christopher A. Klebanoff¹, Christopher A. Klebanoff², Rona Yaeger², Rona Yaeger¹, Neil H. Segal¹, Neil H. Segal², Geoffrey Y. Ku¹, Geoffrey Y. Ku², Ronald P. DeMatteo¹, Marc Ladanyi¹, Naiyer A. Rizvi³, Michael F. Berger¹, Nadeem Riaz¹, David B. Solit¹, Timothy A. Chan¹, Luc G. T. Morris¹ - Show less +81 more•Institutions (3)

Memorial Sloan Kettering Cancer Center¹, Cornell University², Columbia University Medical Center³

14 Jan 2019-Nature Genetics

TL;DR: Analysis of advanced cancer patients treated with immune-checkpoint inhibitors shows that tumor mutational burden, as assessed by targeted next-generation sequencing, predicts survival after immunotherapy across multiple cancer types.

...read moreread less

Abstract: Immune checkpoint inhibitor (ICI) treatments benefit some patients with metastatic cancers, but predictive biomarkers are needed. Findings in selected cancer types suggest that tumor mutational burden (TMB) may predict clinical response to ICI. To examine this association more broadly, we analyzed the clinical and genomic data of 1,662 advanced cancer patients treated with ICI, and 5,371 non-ICI-treated patients, whose tumors underwent targeted next-generation sequencing (MSK-IMPACT). Among all patients, higher somatic TMB (highest 20% in each histology) was associated with better overall survival. For most cancer histologies, an association between higher TMB and improved survival was observed. The TMB cutpoints associated with improved survival varied markedly between cancer types. These data indicate that TMB is associated with improved survival in patients receiving ICI across a wide variety of cancer types, but that there may not be one universal definition of high TMB.

...read moreread less

3,638 citations

Journal Article•10.1038/S41588-019-0358-2•

Genetic meta-analysis of diagnosed Alzheimer’s disease identifies new risk loci and implicates Aβ, tau, immunity and lipid processing

[...]

Cohorts for Heart¹, Genetic², Genetic³, Genetic⁴, Environmental Risk in Ad, Polygenic Defining Genetic, Perades⁵ - Show less +3 more•Institutions (5)

Cardiff University¹, university of lille², Pasteur Institute³, French Institute of Health and Medical Research⁴, Erasmus University Rotterdam⁵

01 Mar 2019-Nature Genetics

TL;DR: Pathway analysis implicates immunity, lipid metabolism, tau binding proteins, and amyloid precursor protein (APP) metabolism, showing that genetic variants affecting APP and Aβ processing are associated not only with early-onset autosomal dominant Alzheimer’s disease but also with LOAD.

...read moreread less

Abstract: Risk for late-onset Alzheimer’s disease (LOAD), the most prevalent dementia, is partially driven by genetics. To identify LOAD risk loci, we performed a large genome-wide association meta-analysis of clinically diagnosed LOAD (94,437 individuals). We confirm 20 previous LOAD risk loci and identify five new genome-wide loci (IQCK, ACE, ADAM10, ADAMTS1, and WWOX), two of which (ADAM10, ACE) were identified in a recent genome-wide association (GWAS)-by-familial-proxy of Alzheimer’s or dementia. Fine-mapping of the human leukocyte antigen (HLA) region confirms the neurological and immune-mediated disease haplotype HLA-DR15 as a risk factor for LOAD. Pathway analysis implicates immunity, lipid metabolism, tau binding proteins, and amyloid precursor protein (APP) metabolism, showing that genetic variants affecting APP and Aβ processing are associated not only with early-onset autosomal dominant Alzheimer’s disease but also with LOAD. Analyses of risk genes and pathways show enrichment for rare variants (P = 1.32 × 10−7), indicating that additional rare variants remain to be identified. We also identify important genetic correlations between LOAD and traits such as family history of dementia and education.

...read moreread less

2,572 citations

Journal Article•10.1038/S41588-019-0379-X•

Clinical use of current polygenic risk scores may exacerbate health disparities.

[...]

Alicia R. Martin¹, Masahiro Kanai, Yoichiro Kamatani², Yukinori Okada³, Benjamin M. Neale¹, Benjamin M. Neale⁴, Mark J. Daly - Show less +3 more•Institutions (4)

Harvard University¹, Kyoto University², Osaka University³, Broad Institute⁴

29 Mar 2019-Nature Genetics

TL;DR: To realize the full and equitable potential of polygenic risk scores, greater diversity must be prioritized in genetic studies, and summary statistics must be publically disseminated to ensure that health disparities are not increased for those individuals already most underserved.

...read moreread less

Abstract: Polygenic risk scores (PRS) are poised to improve biomedical outcomes via precision medicine. However, the major ethical and scientific challenge surrounding clinical implementation of PRS is that those available today are several times more accurate in individuals of European ancestry than other ancestries. This disparity is an inescapable consequence of Eurocentric biases in genome-wide association studies, thus highlighting that-unlike clinical biomarkers and prescription drugs, which may individually work better in some populations but do not ubiquitously perform far better in European populations-clinical uses of PRS today would systematically afford greater improvement for European-descent populations. Early diversifying efforts show promise in leveling this vast imbalance, even when non-European sample sizes are considerably smaller than the largest studies to date. To realize the full and equitable potential of PRS, greater diversity must be prioritized in genetic studies, and summary statistics must be publically disseminated to ensure that health disparities are not increased for those individuals already most underserved.

...read moreread less

2,348 citations

Journal Article•10.1038/S41588-018-0311-9•

Genome-wide meta-analysis identifies new loci and functional pathways influencing Alzheimer's disease risk.

[...]

Iris E. Jansen¹, Jeanne E. Savage¹, Kyoko Watanabe¹, Julien Bryois², Dylan M. Williams², Stacy Steinberg³, Julia Sealock⁴, Ida K. Karlsson⁵, Ida K. Karlsson², Sara Hägg², Lavinia Athanasiu⁶, Lavinia Athanasiu⁷, Nicola Voyle⁸, Petroula Proitsi⁸, Aree Witoelar⁷, Sven Stringer¹, Dag Aarsland⁸, Dag Aarsland⁹, Ina S. Almdahl⁷, Ina S. Almdahl⁶, Ina S. Almdahl¹⁰, Fred Andersen¹¹, Sverre Bergh¹², Francesco Bettella⁷, Sigurbjorn Bjornsson, Anne Brækhus⁶, Geir Bråthen¹³, Christiaan de Leeuw¹, Rahul S. Desikan¹⁴, Srdjan Djurovic⁷, Srdjan Djurovic⁶, Logan Dumitrescu¹⁵, Tormod Fladby⁷, Tormod Fladby¹⁰, Timothy J. Hohman¹⁵, Palmi V. Jonsson¹⁶, Steven J. Kiddle¹⁷, Arvid Rongve¹⁸, Ingvild Saltvedt¹³, Sigrid Botne Sando¹³, Geir Selbæk⁷, Maryam Shoai, Nathan G. Skene¹⁹, Nathan G. Skene², Jon Snaedal, Eystein Stordal¹³, Eystein Stordal²⁰, Ingun Ulstein⁶, Yunpeng Wang⁷, Linda R. White¹³, John Hardy, Jens Hjerling-Leffler², Patrick F. Sullivan², Patrick F. Sullivan²¹, Wiesje M. van der Flier¹, Richard Dobson, Lea K. Davis¹⁵, Hreinn Stefansson³, Kari Stefansson³, Nancy L. Pedersen², Stephan Ripke²², Stephan Ripke²³, Stephan Ripke²⁴, Ole A. Andreassen⁷, Danielle Posthuma²⁵, Danielle Posthuma¹ - Show less +62 more•Institutions (25)

01 Mar 2019-Nature Genetics

TL;DR: A large genome-wide association study of clinically diagnosed AD and AD-by-proxy identifies new loci and functional pathways that contribute to AD risk and adds novel insights into the neurobiology of AD.

...read moreread less

Abstract: Alzheimer's disease (AD) is highly heritable and recent studies have identified over 20 disease-associated genomic loci. Yet these only explain a small proportion of the genetic variance, indicating that undiscovered loci remain. Here, we performed a large genome-wide association study of clinically diagnosed AD and AD-by-proxy (71,880 cases, 383,378 controls). AD-by-proxy, based on parental diagnoses, showed strong genetic correlation with AD (rg = 0.81). Meta-analysis identified 29 risk loci, implicating 215 potential causative genes. Associated genes are strongly expressed in immune-related tissues and cell types (spleen, liver, and microglia). Gene-set analyses indicate biological mechanisms involved in lipid-related processes and degradation of amyloid precursor proteins. We show strong genetic correlations with multiple health-related outcomes, and Mendelian randomization results suggest a protective effect of cognitive ability on AD risk. These results are a step forward in identifying the genetic factors that contribute to AD risk and add novel insights into the neurobiology of AD.

...read moreread less

2,143 citations

Journal Article•10.1038/S41588-019-0344-8•

Identification of common genetic risk variants for autism spectrum disorder

[...]

Bupgen¹•Institutions (1)

Harvard University¹

01 Mar 2019-Nature Genetics

TL;DR: A genome-wide association meta-analysis of 18,381 austim spectrum disorder cases and 27,969 controls identifies five risk loci and the authors find quantitative and qualitative polygenic heterogeneity across ASD subtypes.

...read moreread less

Abstract: Autism spectrum disorder (ASD) is a highly heritable and heterogeneous group of neurodevelopmental phenotypes diagnosed in more than 1% of children. Common genetic variants contribute substantially to ASD susceptibility, but to date no individual variants have been robustly associated with ASD. With a marked sample-size increase from a unique Danish population resource, we report a genome-wide association meta-analysis of 18,381 individuals with ASD and 27,969 controls that identified five genome-wide-significant loci. Leveraging GWAS results from three phenotypes with significantly overlapping genetic architectures (schizophrenia, major depression, and educational attainment), we identified seven additional loci shared with other traits at equally strict significance levels. Dissecting the polygenic architecture, we found both quantitative and qualitative polygenic heterogeneity across ASD subtypes. These results highlight biological insights, particularly relating to neuronal function and corticogenesis, and establish that GWAS performed at scale will be much more productive in the near term in ASD.

...read moreread less

2,110 citations

Journal Article•10.1038/S41588-018-0269-7•

Discovery of the first genome-wide significant risk loci for attention deficit/hyperactivity disorder

[...]

Ditte Demontis¹, Ditte Demontis², Raymond K. Walters³, Raymond K. Walters⁴, Joanna Martin³, Joanna Martin⁵, Joanna Martin⁶, Manuel Mattheisen, Thomas Damm Als¹, Thomas Damm Als², Esben Agerbo², Esben Agerbo¹, Gisli Baldursson, Rich Belliveau³, Jonas Bybjerg-Grauholm¹, Jonas Bybjerg-Grauholm⁷, Marie Bækvad-Hansen¹, Marie Bækvad-Hansen⁷, Felecia Cerrato³, Kimberly Chambert³, Claire Churchhouse³, Claire Churchhouse⁴, Ashley Dumont³, Nicholas Eriksson, Michael J. Gandal, Jacqueline I. Goldstein³, Jacqueline I. Goldstein⁴, Katrina L. Grasby⁸, Jakob Grove, Olafur O Gudmundsson⁹, Olafur O Gudmundsson¹⁰, Christine Søholm Hansen⁷, Christine Søholm Hansen¹, Christine Søholm Hansen¹¹, Mads E. Hauberg², Mads E. Hauberg¹, Mads V. Hollegaard⁷, Mads V. Hollegaard¹, Daniel P. Howrigan³, Daniel P. Howrigan⁴, Hailiang Huang⁴, Hailiang Huang³, Julian Maller³, Alicia R. Martin⁴, Alicia R. Martin³, Nicholas G. Martin⁸, Jennifer L. Moran³, Jonatan Pallesen¹, Jonatan Pallesen², Duncan S. Palmer³, Duncan S. Palmer⁴, Carsten Bøcker Pedersen², Carsten Bøcker Pedersen¹, Marianne Giørtz Pedersen², Marianne Giørtz Pedersen¹, Timothy Poterba³, Timothy Poterba⁴, Jesper Buchhave Poulsen⁷, Jesper Buchhave Poulsen¹, Stephan Ripke¹², Stephan Ripke⁴, Stephan Ripke³, Elise B. Robinson⁴, F. Kyle Satterstrom⁴, F. Kyle Satterstrom³, Hreinn Stefansson⁹, Christine Stevens³, Patrick Turley⁴, Patrick Turley³, G. Bragi Walters⁹, G. Bragi Walters¹⁰, Hyejung Won¹³, Hyejung Won¹⁴, Margaret J. Wright¹⁵, Ole A. Andreassen¹⁶, Philip Asherson¹⁷, Christie L. Burton¹⁸, Dorret I. Boomsma¹⁹, Bru Cormand, Søren Dalsgaard², Barbara Franke²⁰, Joel Gelernter²¹, Joel Gelernter²², Daniel H. Geschwind¹⁴, Daniel H. Geschwind¹³, Hakon Hakonarson²³, Jan Haavik²⁴, Jan Haavik²⁵, Henry R. Kranzler²¹, Henry R. Kranzler²⁶, Jonna Kuntsi¹⁷, Kate Langley⁶, Klaus-Peter Lesch²⁷, Klaus-Peter Lesch²⁸, Klaus-Peter Lesch²⁹, Christel M. Middeldorp¹⁹, Christel M. Middeldorp¹⁵, Andreas Reif³⁰, Luis Augusto Rohde³¹, Panos Roussos, Russell Schachar¹⁸, Pamela Sklar³², Edmund J.S. Sonuga-Barke¹⁷, Patrick F. Sullivan³³, Patrick F. Sullivan⁵, Anita Thapar⁶, Joyce Y. Tung, Irwin D. Waldman³⁴, Sarah E. Medland⁸, Kari Stefansson¹⁰, Kari Stefansson⁹, Merete Nordentoft¹, Merete Nordentoft³⁵, David M. Hougaard⁷, David M. Hougaard¹, Thomas Werge¹¹, Thomas Werge¹, Thomas Werge³⁵, Ole Mors³⁶, Ole Mors¹, Preben Bo Mortensen, Mark J. Daly, Stephen V. Faraone³⁷, Anders D. Børglum², Anders D. Børglum¹, Benjamin M. Neale³, Benjamin M. Neale⁴ - Show less +123 more•Institutions (37)

Lundbeck¹, Aarhus University², Broad Institute³, Harvard University⁴, Karolinska Institutet⁵, Cardiff University⁶, Statens Serum Institut⁷, QIMR Berghofer Medical Research Institute⁸, deCODE genetics⁹, University of Iceland¹⁰, Mental Health Services¹¹, Charité¹², University of California, Los Angeles¹³, Semel Institute for Neuroscience and Human Behavior¹⁴, University of Queensland¹⁵, Oslo University Hospital¹⁶, King's College London¹⁷, University of Toronto¹⁸, VU University Amsterdam¹⁹, Radboud University Nijmegen²⁰, Veterans Health Administration²¹, Yale University²², Children's Hospital of Philadelphia²³, University of Bergen²⁴, Haukeland University Hospital²⁵, University of Pennsylvania²⁶, Maastricht University²⁷, University of Würzburg²⁸, I.M. Sechenov First Moscow State Medical University²⁹, Goethe University Frankfurt³⁰, Universidade Federal do Rio Grande do Sul³¹, Icahn School of Medicine at Mount Sinai³², University of North Carolina at Chapel Hill³³, Emory University³⁴, University of Copenhagen³⁵, Aarhus University Hospital³⁶, State University of New York Upstate Medical University³⁷

01 Jan 2019-Nature Genetics

TL;DR: A genome-wide association meta-analysis of 20,183 individuals diagnosed with ADHD and 35,191 controls identifies variants surpassing genome- wide significance in 12 independent loci and implicates neurodevelopmental pathways and conserved regions of the genome as being involved in underlying ADHD biology.

...read moreread less

Abstract: Attention deficit/hyperactivity disorder (ADHD) is a highly heritable childhood behavioral disorder affecting 5% of children and 2.5% of adults. Common genetic variants contribute substantially to ADHD susceptibility, but no variants have been robustly associated with ADHD. We report a genome-wide association meta-analysis of 20,183 individuals diagnosed with ADHD and 35,191 controls that identifies variants surpassing genome-wide significance in 12 independent loci, finding important new information about the underlying biology of ADHD. Associations are enriched in evolutionarily constrained genomic regions and loss-of-function intolerant genes and around brain-expressed regulatory marks. Analyses of three replication studies: a cohort of individuals diagnosed with ADHD, a self-reported ADHD sample and a meta-analysis of quantitative measures of ADHD symptoms in the population, support these findings while highlighting study-specific differences on genetic overlap with educational attainment. Strong concordance with GWAS of quantitative population measures of ADHD symptoms supports that clinical diagnosis of ADHD is an extreme expression of continuous heritable traits.

...read moreread less

2,091 citations

Journal Article•10.1038/S41588-018-0307-5•

Association studies of up to 1.2 million individuals yield new insights into the genetic etiology of tobacco and alcohol use

[...]

Hunt All-In Psychiatry¹•Institutions (1)

Pennsylvania State University¹

01 Feb 2019-Nature Genetics

TL;DR: Evidence is reported for the involvement of many systems in tobacco and alcohol use, including genes involved in nicotinic, dopaminergic, and glutamatergic neurotransmission, which provide a solid starting point to evaluate the effects of these loci in model organisms and more precise substance use measures.

...read moreread less

Abstract: Tobacco and alcohol use are leading causes of mortality that influence risk for many complex diseases and disorders1. They are heritable2,3 and etiologically related4,5 behaviors that have been resistant to gene discovery efforts6–11. In sample sizes up to 1.2 million individuals, we discovered 566 genetic variants in 406 loci associated with multiple stages of tobacco use (initiation, cessation, and heaviness) as well as alcohol use, with 150 loci evidencing pleiotropic association. Smoking phenotypes were positively genetically correlated with many health conditions, whereas alcohol use was negatively correlated with these conditions, such that increased genetic risk for alcohol use is associated with lower disease risk. We report evidence for the involvement of many systems in tobacco and alcohol use, including genes involved in nicotinic, dopaminergic, and glutamatergic neurotransmission. The results provide a solid starting point to evaluate the effects of these loci in model organisms and more precise substance use measures.

...read moreread less

1,834 citations

Journal Article•10.1038/S41588-019-0397-8•

Genome-wide association study identifies 30 loci associated with bipolar disorder

[...]

Eli A. Stahl¹, Eli A. Stahl², Gerome Breen³, Andreas J. Forstner +339 more•Institutions (107)

01 May 2019-Nature Genetics

TL;DR: Genome-wide analysis identifies 30 loci associated with bipolar disorder, allowing for comparisons of shared genes and pathways with other psychiatric disorders, including schizophrenia and depression.

...read moreread less

Abstract: Bipolar disorder is a highly heritable psychiatric disorder. We performed a genome-wide association study (GWAS) including 20,352 cases and 31,358 controls of European descent, with follow-up analysis of 822 variants with P < 1 × 10-4 in an additional 9,412 cases and 137,760 controls. Eight of the 19 variants that were genome-wide significant (P < 5 × 10-8) in the discovery GWAS were not genome-wide significant in the combined analysis, consistent with small effect sizes and limited power but also with genetic heterogeneity. In the combined analysis, 30 loci were genome-wide significant, including 20 newly identified loci. The significant loci contain genes encoding ion channels, neurotransmitter transporters and synaptic components. Pathway analysis revealed nine significantly enriched gene sets, including regulation of insulin secretion and endocannabinoid signaling. Bipolar I disorder is strongly genetically correlated with schizophrenia, driven by psychosis, whereas bipolar II disorder is more strongly correlated with major depressive disorder. These findings address key clinical questions and provide potential biological mechanisms for bipolar disorder.

...read moreread less

1,467 citations

Journal Article•10.1038/S41588-019-0350-X•

Causal relationships among the gut microbiome, short-chain fatty acids and metabolic diseases.

[...]

Serena Sanna¹, Natalie R. van Zuydam², Natalie R. van Zuydam³, Anubha Mahajan², Alexander Kurilshikov¹, Arnau Vich Vila¹, Urmo Võsa¹, Zlatan Mujagic⁴, Ad A.M. Masclee⁴, Daisy Jonkers⁴, Marije Oosting⁵, Leo A. B. Joosten⁵, Mihai G. Netea⁵, Lude Franke¹, Alexandra Zhernakova¹, Jingyuan Fu¹, Cisca Wijmenga⁶, Mark I. McCarthy², Mark I. McCarthy⁷ - Show less +15 more•Institutions (7)

University Medical Center Groningen¹, University of Oxford², Wellcome Trust Centre for Human Genetics³, Maastricht University⁴, Radboud University Nijmegen⁵, University of Oslo⁶, John Radcliffe Hospital⁷

18 Feb 2019-Nature Genetics

TL;DR: Evidence of a causal effect of the gut microbiome on metabolic traits is shown and the use of MR is supported as a means to elucidate causal relationships from microbiome-wide association findings.

...read moreread less

Abstract: Microbiome-wide association studies on large population cohorts have highlighted associations between the gut microbiome and complex traits, including type 2 diabetes (T2D) and obesity1. However, the causal relationships remain largely unresolved. We leveraged information from 952 normoglycemic individuals for whom genome-wide genotyping, gut metagenomic sequence and fecal short-chain fatty acid (SCFA) levels were available2, then combined this information with genome-wide-association summary statistics for 17 metabolic and anthropometric traits. Using bidirectional Mendelian randomization (MR) analyses to assess causality3, we found that the host-genetic-driven increase in gut production of the SCFA butyrate was associated with improved insulin response after an oral glucose-tolerance test (P = 9.8 × 10-5), whereas abnormalities in the production or absorption of another SCFA, propionate, were causally related to an increased risk of T2D (P = 0.004). These data provide evidence of a causal effect of the gut microbiome on metabolic traits and support the use of MR as a means to elucidate causal relationships from microbiome-wide association findings.

...read moreread less

1,216 citations

Journal Article•10.1038/S41588-019-0481-0•

A global overview of pleiotropy and genetic architecture in complex traits

[...]

Kyoko Watanabe¹, Sven Stringer¹, Oleksandr Frei², Maša Umićević Mirkov¹, Christiaan de Leeuw¹, Tinca J. C. Polderman¹, Sophie van der Sluis¹, Ole A. Andreassen², Ole A. Andreassen³, Benjamin M. Neale⁴, Benjamin M. Neale⁵, Danielle Posthuma - Show less +8 more•Institutions (5)

VU University Amsterdam¹, University of Oslo², Oslo University Hospital³, Broad Institute⁴, Harvard University⁵

19 Aug 2019-Nature Genetics

TL;DR: It is shown that trait-associated loci cover more than half of the genome, and 90% of these overlap with loci from multiple traits, which provides insights into how genetic variation contributes to trait variation.

...read moreread less

Abstract: After a decade of genome-wide association studies (GWASs), fundamental questions in human genetics, such as the extent of pleiotropy across the genome and variation in genetic architecture across traits, are still unanswered. The current availability of hundreds of GWASs provides a unique opportunity to address these questions. We systematically analyzed 4,155 publicly available GWASs. For a subset of well-powered GWASs on 558 traits, we provide an extensive overview of pleiotropy and genetic architecture. We show that trait-associated loci cover more than half of the genome, and 90% of these overlap with loci from multiple traits. We find that potential causal variants are enriched in coding and flanking regions, as well as in regulatory elements, and show variation in polygenicity and discoverability of traits. Our results provide insights into how genetic variation contributes to trait variation. All GWAS results can be queried and visualized at the GWAS ATLAS resource ( https://atlas.ctglab.nl ).

...read moreread less

1,189 citations

Journal Article•10.1038/S41588-019-0538-0•

Activity-by-contact model of enhancer-promoter regulation from thousands of CRISPR perturbations.

[...]

Charles P. Fulco¹, Charles P. Fulco², Joseph Nasser¹, Thouis R. Jones¹, Glen Munson¹, Drew T. Bergman¹, Vidya Subramanian¹, Sharon R. Grossman³, Sharon R. Grossman¹, Rockwell Anyoha¹, Benjamin R. Doughty¹, Tejal A. Patwardhan¹, Tung T. Nguyen¹, Michael Kane¹, Elizabeth M. Perez¹, Neva C. Durand, Caleb A. Lareau¹, Elena K. Stamenova¹, Erez Lieberman Aiden, Eric S. Lander³, Eric S. Lander¹, Eric S. Lander², Jesse M. Engreitz², Jesse M. Engreitz¹ - Show less +20 more•Institutions (3)

Broad Institute¹, Harvard University², Massachusetts Institute of Technology³

03 Jul 2019-Nature Genetics

TL;DR: A simple activity-by-contact model substantially outperformed previous methods at predicting the complex connections in the CRISPR dataset and allows systematic mapping of enhancer–gene connections in a given cell type, on the basis of chromatin-state measurements.

...read moreread less

Abstract: Enhancer elements in the human genome control how genes are expressed in specific cell types and harbor thousands of genetic variants that influence risk for common diseases1-4. Yet, we still do not know how enhancers regulate specific genes, and we lack general rules to predict enhancer-gene connections across cell types5,6. We developed an experimental approach, CRISPRi-FlowFISH, to perturb enhancers in the genome, and we applied it to test >3,500 potential enhancer-gene connections for 30 genes. We found that a simple activity-by-contact model substantially outperformed previous methods at predicting the complex connections in our CRISPR dataset. This activity-by-contact model allows us to construct genome-wide maps of enhancer-gene connections in a given cell type, on the basis of chromatin state measurements. Together, CRISPRi-FlowFISH and the activity-by-contact model provide a systematic approach to map and predict which enhancers regulate which genes, and will help to interpret the functions of the thousands of disease risk variants in the noncoding genome.

...read moreread less

Journal Article•10.1038/S41588-019-0439-2•

Genome-wide association study identifies eight risk loci and implicates metabo-psychiatric origins for anorexia nervosa

[...]

Hunna J. Watson¹, Hunna J. Watson², Hunna J. Watson³, Zeynep Yilmaz³ +255 more•Institutions (99)

01 Aug 2019-Nature Genetics

TL;DR: The genetic architecture of anorexia nervosa mirrors its clinical presentation, showing significant genetic correlations with psychiatric disorders, physical activity, and metabolic (including glycemic), lipid and anthropometric traits, independent of the effects of common variants associated with body-mass index.

...read moreread less

Abstract: Characterized primarily by a low body-mass index, anorexia nervosa is a complex and serious illness1, affecting 0.9-4% of women and 0.3% of men2-4, with twin-based heritability estimates of 50-60%5. Mortality rates are higher than those in other psychiatric disorders6, and outcomes are unacceptably poor7. Here we combine data from the Anorexia Nervosa Genetics Initiative (ANGI)8,9 and the Eating Disorders Working Group of the Psychiatric Genomics Consortium (PGC-ED) and conduct a genome-wide association study of 16,992 cases of anorexia nervosa and 55,525 controls, identifying eight significant loci. The genetic architecture of anorexia nervosa mirrors its clinical presentation, showing significant genetic correlations with psychiatric disorders, physical activity, and metabolic (including glycemic), lipid and anthropometric traits, independent of the effects of common variants associated with body-mass index. These results further encourage a reconceptualization of anorexia nervosa as a metabo-psychiatric disorder. Elucidating the metabolic component is a critical direction for future research, and paying attention to both psychiatric and metabolic components may be key to improving outcomes.

...read moreread less

Journal Article•10.1038/S41588-019-0371-5•

Gossypium barbadense and Gossypium hirsutum genomes provide insights into the origin and evolution of allotetraploid cotton.

[...]

Yan Hu¹, Yan Hu², Jiedan Chen¹, Lei Fang¹, Lei Fang², Zhiyuan Zhang¹, Wei Ma¹, Yongchao Niu, Longzhen Ju², Jieqiong Deng², Ting Zhao¹, Ting Zhao², Jinmin Lian, Kobi Baruch, David D. Fang³, Xia Liu, Yong-Ling Ruan¹, Yong-Ling Ruan⁴, Mehboob-ur Rahman⁵, Jinlei Han⁶, Kai Wang⁶, Qiong Wang², Huaitong Wu², Gaofu Mei², Yihao Zang², Zegang Han², Chenyu Xu², Weijuan Shen², Duofeng Yang², Zhanfeng Si¹, Fan Dai¹, Liangfeng Zou, Fei Huang, Bai Yulin, Yu-Gao Zhang, Avital Brodt, Hilla Ben-Hamo, Xiefei Zhu², Baoliang Zhou², Xueying Guan¹, Xueying Guan², Shuijin Zhu¹, Xiao-Ya Chen⁷, Tianzhen Zhang¹, Tianzhen Zhang² - Show less +41 more•Institutions (7)

Zhejiang University¹, Nanjing Agricultural University², United States Department of Agriculture³, University of Newcastle⁴, National Institute for Biotechnology and Genetic Engineering⁵, Fujian Agriculture and Forestry University⁶, Chinese Academy of Sciences⁷

18 Mar 2019-Nature Genetics

TL;DR: High-quality de novo–assembled genomes of two cultivated allotetraploid cotton species and whole-genome comparative analyses provide insights into the evolution of cotton genomes and improvement of fiber quality and resilience to stress.

...read moreread less

Abstract: Allotetraploid cotton is an economically important natural-fiber-producing crop worldwide. After polyploidization, Gossypium hirsutum L. evolved to produce a higher fiber yield and to better survive harsh environments than Gossypium barbadense, which produces superior-quality fibers. The global genetic and molecular bases for these interspecies divergences were unknown. Here we report high-quality de novo-assembled genomes for these two cultivated allotetraploid species with pronounced improvement in repetitive-DNA-enriched centromeric regions. Whole-genome comparative analyses revealed that species-specific alterations in gene expression, structural variations and expanded gene families were responsible for speciation and the evolutionary history of these species. These findings help to elucidate the evolution of cotton genomes and their domestication history. The information generated not only should enable breeders to improve fiber quality and resilience to ever-changing environmental conditions but also can be translated to other crops for better understanding of their domestication history and use in improvement.

...read moreread less

Journal Article•10.1038/S41588-019-0385-Z•

Opportunities and challenges for transcriptome-wide association studies.

[...]

Michael Wainberg¹, Nasa Sinnott-Armstrong¹, Nicholas Mancuso², Alvaro N. Barbeira³, David A. Knowles⁴, David E. Golan¹, Raili Ermel⁵, Arno Ruusalepp⁵, Thomas Quertermous¹, Ke Hao⁶, Johan Björkegren, Hae Kyung Im³, Bogdan Pasaniuc², Manuel A. Rivas¹, Anshul Kundaje¹ - Show less +11 more•Institutions (6)

Stanford University¹, University of California, Los Angeles², University of Chicago³, Columbia University⁴, Tartu University Hospital⁵, Icahn School of Medicine at Mount Sinai⁶

29 Mar 2019-Nature Genetics

TL;DR: Property of TWAS is explored as a potential approach to prioritize causal genes at GWAS loci, by using simulations and case studies of literature-curated candidate causal genes for schizophrenia, low-density-lipoprotein cholesterol and Crohn’s disease.

...read moreread less

Abstract: Transcriptome-wide association studies (TWAS) integrate genome-wide association studies (GWAS) and gene expression datasets to identify gene-trait associations. In this Perspective, we explore properties of TWAS as a potential approach to prioritize causal genes at GWAS loci, by using simulations and case studies of literature-curated candidate causal genes for schizophrenia, low-density-lipoprotein cholesterol and Crohn's disease. We explore risk loci where TWAS accurately prioritizes the likely causal gene as well as loci where TWAS prioritizes multiple genes, some likely to be non-causal, owing to sharing of expression quantitative trait loci (eQTL). TWAS is especially prone to spurious prioritization with expression data from non-trait-related tissues or cell types, owing to substantial cross-cell-type variation in expression levels and eQTL strengths. Nonetheless, TWAS prioritizes candidate causal genes more accurately than simple baselines. We suggest best practices for causal-gene prioritization with TWAS and discuss future opportunities for improvement. Our results showcase the strengths and limitations of using eQTL datasets to determine causal genes at GWAS loci.

...read moreread less

Journal Article•10.1038/S41588-018-0295-5•

A primer on deep learning in genomics.

[...]

James Zou¹, Mikael Huss², Abubakar Abid¹, Pejman Mohammadi³, Ali Torkamani⁴, Ali Torkamani³, Amalio Telenti⁴, Amalio Telenti³ - Show less +4 more•Institutions (4)

Stanford University¹, Karolinska Institutet², Scripps Health³, Scripps Research Institute⁴

01 Jan 2019-Nature Genetics

TL;DR: A perspective and primer on deep learning applications for genome analysis and successful applications in the fields of regulatory genomics, variant calling and pathogenicity scores are provided.

...read moreread less

Abstract: Deep learning methods are a class of machine learning techniques capable of identifying highly complex patterns in large datasets. Here, we provide a perspective and primer on deep learning applications for genome analysis. We discuss successful applications in the fields of regulatory genomics, variant calling and pathogenicity scores. We include general guidance for how to effectively use deep learning methods as well as a practical guide to tools and resources. This primer is accompanied by an interactive online tutorial.

...read moreread less

Journal Article•10.1038/S41588-018-0333-3•

Genome-wide analysis of insomnia in 1,331,010 individuals identifies new risk loci and functional pathways

[...]

Philip R. Jansen¹, Kyoko Watanabe¹, Sven Stringer¹, Nathan G. Skene², Julien Bryois², Anke R. Hammerschlag¹, de Leeuw Ca¹, Jeroen S. Benjamins³, Ana B. Muñoz-Manchado², Mats Nagel¹, Mats Nagel⁴, Jeanne E. Savage¹, Henning Tiemeier⁵, Tonya White⁵, Joyce Y. Tung, David A. Hinds, Vacic, Xin Wang, Patrick F. Sullivan⁶, van der Sluis S¹, August B. Smit¹, Jens Hjerling-Leffler², Van Someren Ejw⁷, Danielle Posthuma⁴, Danielle Posthuma¹ - Show less +21 more•Institutions (7)

VU University Amsterdam¹, Karolinska Institutet², Utrecht University³, VU University Medical Center⁴, Erasmus University Medical Center⁵, UCL Institute of Neurology⁶, Royal Netherlands Academy of Arts and Sciences⁷

25 Feb 2019-Nature Genetics

TL;DR: A large genetic association sample is used to detect novel loci and gain insight into the pathways, tissue and cell types involved in insomnia complaints, identifying 202 loci implicating 956 genes through positional, expression quantitative trait loci, and chromatin mapping.

...read moreread less

Abstract: Insomnia is the second most prevalent mental disorder, with no sufficient treatment available. Despite substantial heritability, insight into the associated genes and neurobiological pathways remains limited. Here, we use a large genetic association sample (n = 1,331,010) to detect novel loci and gain insight into the pathways, tissue and cell types involved in insomnia complaints. We identify 202 loci implicating 956 genes through positional, expression quantitative trait loci, and chromatin mapping. The meta-analysis explained 2.6% of the variance. We show gene set enrichments for the axonal part of neurons, cortical and subcortical tissues, and specific cell types, including striatal, hypothalamic, and claustrum neurons. We found considerable genetic correlations with psychiatric traits and sleep duration, and modest correlations with other sleep-related traits. Mendelian randomization identified the causal effects of insomnia on depression, diabetes, and cardiovascular disease, and the protective effects of educational attainment and intracranial volume. Our findings highlight key brain areas and cell types implicated in insomnia, and provide new treatment targets.

...read moreread less

Journal Article•10.1038/S41588-019-0407-X•

A catalog of genetic loci associated with kidney function from analyses of a million individuals

[...]

V. A. Million Veteran Program¹•Institutions (1)

University of Freiburg¹

31 May 2019-Nature Genetics

TL;DR: Pathway and enrichment analyses, including mouse models with renal phenotypes, support the kidney as the main target organ and provide a comprehensive priority list of molecular targets for translational research.

...read moreread less

Abstract: Chronic kidney disease (CKD) is responsible for a public health burden with multi-systemic complications. Through trans-ancestry meta-analysis of genome-wide association studies of estimated glomerular filtration rate (eGFR) and independent replication (n = 1,046,070), we identified 264 associated loci (166 new). Of these, 147 were likely to be relevant for kidney function on the basis of associations with the alternative kidney function marker blood urea nitrogen (n = 416,178). Pathway and enrichment analyses, including mouse models with renal phenotypes, support the kidney as the main target organ. A genetic risk score for lower eGFR was associated with clinically diagnosed CKD in 452,264 independent individuals. Colocalization analyses of associations with eGFR among 783,978 European-ancestry individuals and gene expression across 46 human tissues, including tubulo-interstitial and glomerular kidney compartments, identified 17 genes differentially expressed in kidney. Fine-mapping highlighted missense driver variants in 11 genes and kidney-specific regulatory variants. These results provide a comprehensive priority list of molecular targets for translational research.

...read moreread less

Journal Article•10.1038/S41588-018-0302-X•

An atlas of genetic influences on osteoporosis in humans and mice

[...]

John A. Morris¹, John P. Kemp², John P. Kemp³, Scott E. Youlten⁴, Laetitia Laurent¹, John G. Logan⁵, Ryan C. Chai⁴, Nicholas A. Vulpescu⁶, Vincenzo Forgetta¹, Aaron Kleinman, Sindhu T. Mohanty⁴, C. Marcelo Sergio⁴, Julian M.W. Quinn⁴, Loan Nguyen-Yamamoto⁷, Aimee Lee Luco⁷, Jinchu Vijay¹, Marie-Michelle Simon¹, Albena Pramatarova¹, Carolina Medina-Gomez⁸, Katerina Trajanoska⁸, Elena J. Ghirardello⁵, Natalie C. Butterfield⁵, Katharine F. Curry⁵, Victoria D. Leitch⁵, Penny C. Sparkes⁵, Anne-Tounsia Adoum⁵, Naila S. Mannan⁵, Davide Komla-Ebri⁵, Andrea S. Pollard⁵, Hannah F. Dewhurst⁵, Thomas A D Hassall³, Michael-John G. Beltejar⁹, Douglas J. Adams¹⁰, Suzanne M. Vaillancourt¹, Stephen Kaptoge¹¹, Paul A. Baldock⁴, Cyrus Cooper¹², Cyrus Cooper¹³, Cyrus Cooper¹⁴, Jonathan Reeve¹³, Evangelia E. Ntzani¹⁵, Evangelia E. Ntzani¹⁶, Evangelos Evangelou⁵, Evangelos Evangelou¹⁶, Claes Ohlsson¹⁷, David Karasik, Fernando Rivadeneira⁸, Douglas P. Kiel, Jonathan H Tobias², Celia L Gregson², Nicholas C. Harvey¹⁴, Nicholas C. Harvey¹², Elin Grundberg¹⁸, Elin Grundberg¹, David Goltzman⁷, David J. Adams¹⁹, Christopher J. Lelliott¹⁹, David A. Hinds, Cheryl L. Ackert-Bicknell⁹, Yi-Hsiang Hsu, Matthew T. Maurano⁶, Peter I. Croucher⁴, Graham R. Williams⁵, J. H. Duncan Bassett⁵, David M. Evans³, David M. Evans², J. Brent Richards - Show less +63 more•Institutions (19)

McGill University¹, University of Bristol², University of Queensland³, Garvan Institute of Medical Research⁴, Imperial College London⁵, New York University⁶, McGill University Health Centre⁷, Erasmus University Rotterdam⁸, University of Rochester⁹, Anschutz Medical Campus¹⁰, University of Cambridge¹¹, University Hospital Southampton NHS Foundation Trust¹², University of Oxford¹³, University of Southampton¹⁴, Brown University¹⁵, University of Ioannina¹⁶, University of Gothenburg¹⁷, Children's Mercy Hospital¹⁸, Wellcome Trust Sanger Institute¹⁹

01 Feb 2019-Nature Genetics

TL;DR: This genetic atlas provides evidence linking associated SNPs to causal genes, offers new insight into osteoporosis pathophysiology, and highlights opportunities for drug development.

...read moreread less

Abstract: Osteoporosis is a common aging-related disease diagnosed primarily using bone mineral density (BMD). We assessed genetic determinants of BMD as estimated by heel quantitative ultrasound in 426,824 individuals, identifying 518 genome-wide significant loci (301 novel), explaining 20% of its variance. We identified 13 bone fracture loci, all associated with estimated BMD (eBMD), in ~1.2 million individuals. We then identified target genes enriched for genes known to influence bone density and strength (maximum odds ratio (OR) = 58, P = 1 × 10-75) from cell-specific features, including chromatin conformation and accessible chromatin sites. We next performed rapid-throughput skeletal phenotyping of 126 knockout mice with disruptions in predicted target genes and found an increased abnormal skeletal phenotype frequency compared to 526 unselected lines (P < 0.0001). In-depth analysis of one gene, DAAM2, showed a disproportionate decrease in bone strength relative to mineralization. This genetic atlas provides evidence linking associated SNPs to causal genes, offers new insight into osteoporosis pathophysiology, and highlights opportunities for drug development.

...read moreread less

Journal Article•10.1038/S41588-019-0381-3•

Durum wheat genome highlights past domestication signatures and future improvement targets

[...]

Marco Maccaferri¹, Marco Maccaferri², Neil S. Harris³, Sven Twardziok, Raj K. Pasam, Heidrun Gundlach, Manuel Spannagl, Danara Ormanbekova¹, Thomas Lux, Verena M. Prade, Sara Giulia Milner⁴, Axel Himmelbach⁴, Martin Mascher⁴, Paolo Bagnaresi², Primetta Faccioli², Paolo Cozzi⁵, Massimiliano Lauria⁵, Barbara Lazzari⁵, Alessandra Stella⁵, Andrea Manconi⁵, Matteo Gnocchi⁵, Marco Moscatelli⁵, Raz Avni⁶, Jasline Deek⁶, Sezgi Biyiklioglu⁷, Elisabetta Frascaroli¹, Simona Corneti¹, Silvio Salvi¹, Gabriella Sonnante⁵, Francesca Desiderio², Caterina Marè², Cristina Crosatti², E. Mica², Hakan Özkan⁸, Benjamin Kilian, Pasquale De Vita², Daniela Marone², Reem Joukhadar⁹, Elisabetta Mazzucotelli², Domenica Nigro¹⁰, Agata Gadaleta¹⁰, Shiaoman Chao⁸, Justin D. Faris⁸, Arthur T. O. Melo¹¹, Michael O. Pumphrey¹², Nicola Pecchioni², Luciano Milanesi⁵, Krystalee Wiebe¹³, Jennifer Ens¹³, Ron MacLachlan¹³, John M. Clarke¹³, Andrew G. Sharpe¹³, Chu Shin Koh¹³, Kevin Y. H. Liang³, Gregory J. Taylor³, Ron Knox¹⁴, Hikmet Budak⁷, Anna M. Mastrangelo², Steven S. Xu⁸, Nils Stein⁴, Iago Hale¹¹, Assaf Distelfeld⁶, Matthew J. Hayden⁹, Roberto Tuberosa¹, Sean Walkowiak¹³, Klaus F. X. Mayer¹⁵, Aldo Ceriotti⁵, Curtis J. Pozniak¹³, Luigi Cattivelli² - Show less +65 more•Institutions (15)

University of Bologna¹, Canadian Real Estate Association², University of Alberta³, Leibniz Association⁴, National Research Council⁵, Tel Aviv University⁶, Montana State University⁷, United States Department of Agriculture⁸, La Trobe University⁹, University of Bari¹⁰, University of New Hampshire¹¹, Washington State University¹², University of Saskatchewan¹³, Agriculture and Agri-Food Canada¹⁴, Technische Universität München¹⁵

08 Apr 2019-Nature Genetics

TL;DR: The assembly of the genome of durum wheat cultivar Svevo enables genome-wide genetic diversity analyses highlighting modifications imposed by thousands of years of empirical selection and breeding.

...read moreread less

Abstract: The domestication of wild emmer wheat led to the selection of modern durum wheat, grown mainly for pasta production. We describe the 10.45 gigabase (Gb) assembly of the genome of durum wheat cultivar Svevo. The assembly enabled genome-wide genetic diversity analyses revealing the changes imposed by thousands of years of empirical selection and breeding. Regions exhibiting strong signatures of genetic divergence associated with domestication and breeding were widespread in the genome with several major diversity losses in the pericentromeric regions. A locus on chromosome 5B carries a gene encoding a metal transporter (TdHMA3-B1) with a non-functional variant causing high accumulation of cadmium in grain. The high-cadmium allele, widespread among durum cultivars but undetected in wild emmer accessions, increased in frequency from domesticated emmer to modern durum wheat. The rapid cloning of TdHMA3-B1 rescues a wild beneficial allele and demonstrates the practical use of the Svevo genome for wheat improvement. Genome assembly of durum wheat cultivar Svevo enables genome-wide genetic diversity analyses highlighting modifications imposed by thousands of years of empirical selection and breeding.

...read moreread less

Journal Article•10.1038/S41588-019-0405-Z•

The genome sequence of segmental allotetraploid peanut Arachis hypogaea

[...]

David J. Bertioli¹, Jerry Jenkins, Josh Clevenger¹, Olga Dudchenko², Dongying Gao¹, Guillermo Seijo³, Soraya C. M. Leal-Bertioli¹, Longhui Ren⁴, Andrew Farmer⁵, Manish K. Pandey⁶, Sergio Sebastián Samoluk³, Brian Abernathy¹, Gaurav Agarwal¹, Carolina Ballén-Taborda¹, Connor Cameron⁵, Jacqueline D. Campbell⁴, Carolina Chavarro¹, Annapurna Chitikineni⁶, Ye Chu¹, Sudhansu Dash⁵, Moaine El Baidouri⁷, Moaine El Baidouri⁸, Baozhu Guo⁹, Wei Huang⁴, Kyung Do Kim¹⁰, Kyung Do Kim¹, Walid Korani¹, Sophie Lanciano¹¹, Sophie Lanciano⁸, Christopher Lui², Marie Mirouze¹¹, Marie Mirouze⁸, Márcio C. Moretzsohn¹², Melanie Pham², Jin Hee Shin¹, Jin Hee Shin¹⁰, Kenta Shirasawa, Senjuti Sinharoy, Avinash Sreedasyam, Nathan T. Weeks⁹, Xinyou Zhang¹³, Zheng Zheng¹³, Ziqi Sun¹³, Lutz Froenicke¹⁴, Erez Lieberman Aiden², Richard W Michelmore¹⁴, Rajeev K. Varshney⁶, C. Corley Holbrook⁹, Ethalinda K. S. Cannon⁴, Brian E. Scheffler⁹, Jane Grimwood, Peggy Ozias-Akins¹, Steven B. Cannon⁹, Scott A. Jackson¹, Jeremy Schmutz¹⁵ - Show less +51 more•Institutions (15)

University of Georgia¹, Baylor College of Medicine², Instituto de Botánica del Nordeste³, Iowa State University⁴, National Center for Genome Resources⁵, International Crops Research Institute for the Semi-Arid Tropics⁶, Centre national de la recherche scientifique⁷, University of Perpignan⁸, United States Department of Agriculture⁹, LG Chem¹⁰, University of Montpellier¹¹, Empresa Brasileira de Pesquisa Agropecuária¹², Crops Research Institute¹³, University of California, Davis¹⁴, Joint Genome Institute¹⁵

01 May 2019-Nature Genetics

TL;DR: The genome sequence of segmental allotetraploid peanut is reported and suggests that diversity generated by genetic deletions and homeologous recombination helped to favor the domestication of Arachis hypogaea over its diploid relatives.

...read moreread less

Abstract: Like many other crops, the cultivated peanut (Arachis hypogaea L.) is of hybrid origin and has a polyploid genome that contains essentially complete sets of chromosomes from two ancestral species. Here we report the genome sequence of peanut and show that after its polyploid origin, the genome has evolved through mobile-element activity, deletions and by the flow of genetic information between corresponding ancestral chromosomes (that is, homeologous recombination). Uniformity of patterns of homeologous recombination at the ends of chromosomes favors a single origin for cultivated peanut and its wild counterpart A. monticola. However, through much of the genome, homeologous recombination has created diversity. Using new polyploid hybrids made from the ancestral species, we show how this can generate phenotypic changes such as spontaneous changes in the color of the flowers. We suggest that diversity generated by these genetic mechanisms helped to favor the domestication of the polyploid A. hypogaea over other diploid Arachis species cultivated by humans.

...read moreread less

Journal Article•10.1038/S41588-019-0402-2•

The genome of cultivated peanut provides insight into legume karyotypes, polyploid evolution and crop domestication

[...]

Weijian Zhuang¹, Chen Hua¹, Meng Yang, Jianping Wang¹, Jianping Wang², Manish K. Pandey³, Zhang Chong¹, Wen Chi Chang⁴, Liangsheng Zhang¹, Xingtan Zhang¹, Tang Ronghua, Vanika Garg³, Xingjun Wang, Haibao Tang¹, Chi Nga Chow⁴, Jinpeng Wang⁵, Ye Deng¹, Depeng Wang, Aamir W. Khan³, Aamir W. Khan⁶, Qiang Yang¹, Tiecheng Cai¹, Prasad Bajaj³, Kangcheng Wu¹, Baozhu Guo¹, Baozhu Guo⁷, Xinyou Zhang, Jingjing Li, Fan Liang, Jiang Hu, Boshou Liao⁸, Shengyi Liu⁸, Annapurna Chitikineni³, Hansong Yan¹, Yixiong Zheng¹, Yixiong Zheng⁹, Shihua Shan, Qinzheng Liu¹, Dongyang Xie¹, Zhenyi Wang⁵, Shahid Ali Khan¹, Niaz Ali¹, Chuanzhi Zhao, Xinguo Li, Ziliang Luo², Shubiao Zhang¹, Ruirong Zhuang¹, Ze Peng², Shuaiyin Wang¹, Gandeka Mamadou¹, Yuhui Zhuang¹, Yuhui Zhuang¹⁰, Zifan Zhao², Weichang Yu¹¹, Faqian Xiong, Weipeng Quan, Mei Yuan, Yu Li¹, Huasong Zou¹, Han Xia, Li Zha¹, Junpeng Fan, Jigao Yu⁵, Wenping Xie¹, Jiaqing Yuan⁵, Kun Chen¹, Shanshan Zhao¹, Wenting Chu¹, Yuting Chen¹, Pengchuan Sun⁵, Fanbo Meng⁵, Tao Zhuo¹, Yuhao Zhao⁵, Chunjuan Li, Guohao He¹², Yongli Zhao¹², Congcong Wang⁹, P. B. KaviKishor¹³, Rong Long Pan¹, Rong Long Pan¹⁴, Andrew H. Paterson¹⁵, Andrew H. Paterson⁵, Xiyin Wang⁵, Ray Ming¹, Ray Ming¹⁶, Rajeev K. Varshney³, Rajeev K. Varshney⁶ - Show less +83 more•Institutions (16)

Fujian Agriculture and Forestry University¹, University of Florida², International Crops Research Institute for the Semi-Arid Tropics³, National Cheng Kung University⁴, North China University of Science and Technology⁵, University of Western Australia⁶, Agricultural Research Service⁷, Crops Research Institute⁸, Zhongkai University of Agriculture and Engineering⁹, Tsinghua University¹⁰, Shenzhen University¹¹, Tuskegee University¹², Osmania University¹³, National Tsing Hua University¹⁴, Plant Genome Mapping Laboratory¹⁵, University of Illinois at Urbana–Champaign¹⁶

01 May 2019-Nature Genetics

TL;DR: High-quality genome sequence of cultivated peanut provides insights into genome evolution and the genetic mechanisms underlying seed size and leaf resistance in peanut, providing a cornerstone for functional genomics and peanut improvement.

...read moreread less

Abstract: High oil and protein content make tetraploid peanut a leading oil and food legume. Here we report a high-quality peanut genome sequence, comprising 2.54 Gb with 20 pseudomolecules and 83,709 protein-coding gene models. We characterize gene functional groups implicated in seed size evolution, seed oil content, disease resistance and symbiotic nitrogen fixation. The peanut B subgenome has more genes and general expression dominance, temporally associated with long-terminal-repeat expansion in the A subgenome that also raises questions about the A-genome progenitor. The polyploid genome provided insights into the evolution of Arachis hypogaea and other legume chromosomes. Resequencing of 52 accessions suggests that independent domestications formed peanut ecotypes. Whereas 0.42–0.47 million years ago (Ma) polyploidy constrained genetic variation, the peanut genome sequence aids mapping and candidate-gene discovery for traits such as seed size and color, foliar disease resistance and others, also providing a cornerstone for functional genomics and peanut improvement.

...read moreread less

Journal Article•10.1038/S41588-018-0318-2•

Molecular landmarks of tumor hypoxia across cancer types.

[...]

Vinayak Bhandari¹, Vinayak Bhandari², Christianne Hoey¹, Christianne Hoey³, Lydia Y Liu², Lydia Y Liu¹, Emilie Lalonde², Emilie Lalonde¹, Jessica Ray³, Jessica Ray¹, Julie Livingstone², Robert Lesurf², Yu-Jia Shiah², Tina Vujcic³, Xiaoyong Huang³, Shadrielle Melijah G. Espiritu², Lawrence E. Heisler², Fouad Yousif², Vincent Huang², Takafumi N. Yamaguchi², Cindy Q. Yao², Veronica Y. Sabelnykova², Michael Fraser², Melvin L.K. Chua⁴, Theodorus van der Kwast⁵, Stanley K. Liu¹, Stanley K. Liu³, Paul C. Boutros, Robert G. Bristow - Show less +25 more•Institutions (5)

University of Toronto¹, Ontario Institute for Cancer Research², Sunnybrook Health Sciences Centre³, National University of Singapore⁴, University Health Network⁵

14 Jan 2019-Nature Genetics

TL;DR: It is established that tumor hypoxia may drive aggressive molecular features across cancers and shape the clinical trajectory of individual tumors.

...read moreread less

Abstract: Many primary-tumor subregions have low levels of molecular oxygen, termed hypoxia. Hypoxic tumors are at elevated risk for local failure and distant metastasis, but the molecular hallmarks of tumor hypoxia remain poorly defined. To fill this gap, we quantified hypoxia in 8,006 tumors across 19 tumor types. In ten tumor types, hypoxia was associated with elevated genomic instability. In all 19 tumor types, hypoxic tumors exhibited characteristic driver-mutation signatures. We observed widespread hypoxia-associated dysregulation of microRNAs (miRNAs) across cancers and functionally validated miR-133a-3p as a hypoxia-modulated miRNA. In localized prostate cancer, hypoxia was associated with elevated rates of chromothripsis, allelic loss of PTEN and shorter telomeres. These associations are particularly enriched in polyclonal tumors, representing a constellation of features resembling tumor nimbosus, an aggressive cellular phenotype. Overall, this work establishes that tumor hypoxia may drive aggressive molecular features across cancers and shape the clinical trajectory of individual tumors.

...read moreread less

Journal Article•10.1038/S41588-019-0356-4•

Origin and evolution of the octoploid strawberry genome.

[...]

Patrick P. Edger¹, Thomas J. Poorten², Robert VanBuren¹, Michael A. Hardigan², Marivi Colle¹, Michael R. McKain³, Ronald D. Smith⁴, Scott J. Teresi⁴, Andrew D. L. Nelson⁵, Ching Man Wai¹, Elizabeth I. Alger¹, Kevin A. Bird¹, Alan E. Yocca¹, Nathan Pumplin², Shujun Ou¹, Gil Ben-Zvi, Avital Brodt, Kobi Baruch, Thomas Swale, Lily Shiue, Charlotte B. Acharya², Glenn S. Cole², Jeffrey P. Mower⁶, Kevin L. Childs¹, Ning Jiang¹, Eric Lyons⁵, Michael Freeling⁷, Joshua R. Puzey⁴, Steven J. Knapp² - Show less +25 more•Institutions (7)

Michigan State University¹, University of California, Davis², University of Alabama³, College of William & Mary⁴, University of Arizona⁵, University of Nebraska–Lincoln⁶, University of California, Berkeley⁷

25 Feb 2019-Nature Genetics

TL;DR: A near-complete chromosome-scale assembly for cultivated octoploid strawberry (Fragaria × ananassa) is reported and the origin and evolutionary processes that shaped this complex allopolyploid are uncovered, providing a useful resource for genome-wide analyses and molecular breeding.

...read moreread less

Abstract: Cultivated strawberry emerged from the hybridization of two wild octoploid species, both descendants from the merger of four diploid progenitor species into a single nucleus more than 1 million years ago. Here we report a near-complete chromosome-scale assembly for cultivated octoploid strawberry (Fragaria × ananassa) and uncovered the origin and evolutionary processes that shaped this complex allopolyploid. We identified the extant relatives of each diploid progenitor species and provide support for the North American origin of octoploid strawberry. We examined the dynamics among the four subgenomes in octoploid strawberry and uncovered the presence of a single dominant subgenome with significantly greater gene content, gene expression abundance, and biased exchanges between homoeologous chromosomes, as compared with the other subgenomes. Pathway analysis showed that certain metabolomic and disease-resistance traits are largely controlled by the dominant subgenome. These findings and the reference genome should serve as a powerful platform for future evolutionary studies and enable molecular breeding in strawberry.

...read moreread less

Journal Article•10.1038/S41588-019-0512-X•

Comparative genetic architectures of schizophrenia in East Asian and European populations

[...]

Max Lam, Chia-Yen Chen, Zhiqiang Li¹, Alicia R. Martin², Alicia R. Martin³, Julien Bryois⁴, Xixian Ma⁵, Helena Gaspar⁶, Masashi Ikeda⁷, Beben Benyamin⁸, Beben Benyamin⁹, Brielin C. Brown¹⁰, Ruize Liu³, Ruize Liu², Wei Zhou¹, Lili Guan¹¹, Yoichiro Kamatani¹², Sung-Wan Kim¹³, Michiaki Kubo, Agung A.A.A Kusumawardhani¹⁴, Chih-Min Liu¹⁵, Hong Ma¹¹, Sathish Periyasamy⁹, Atsushi Takahashi, Zhida Xu¹⁶, Hao Yu¹¹, Feng Zhu¹⁷, Wei J. Chen¹⁵, Stephen V. Faraone¹⁸, Stephen J. Glatt¹⁹, Lin He¹, Lin He²⁰, Steven E. Hyman³, Steven E. Hyman², Hai-Gwo Hwu¹⁵, Steven A. McCarroll², Steven A. McCarroll³, Benjamin M. Neale², Benjamin M. Neale³, Pamela Sklar²¹, Dieter B. Wildenauer²², Xin Yu¹¹, Dai Zhang¹¹, Bryan J. Mowry⁹, Jimmy Lee, Peter Holmans²³, Shuhua Xu, Patrick F. Sullivan²⁴, Stephan Ripke³, Stephan Ripke², Stephan Ripke²⁵, Michael Conlon O'Donovan²³, Mark J. Daly, Shengying Qin²⁶, Shengying Qin¹, Pak C. Sham²⁷, Pak C. Sham²⁸, Nakao Iwata⁷, Kyung Sue Hong²⁹, Sibylle G. Schwab³⁰, Sibylle G. Schwab³¹, Weihua Yue, Ming T. Tsuang³², Jianjun Liu³³, Jianjun Liu³⁴, Xiancang Ma¹⁷, René S. Kahn²¹, Yongyong Shi, Hailiang Huang², Hailiang Huang³ - Show less +66 more•Institutions (34)

Shanghai Jiao Tong University¹, Broad Institute², Harvard University³, Karolinska Institutet⁴, CAS-MPG Partner Institute for Computational Biology⁵, King's College London⁶, Fujita Health University⁷, University of South Australia⁸, University of Queensland⁹, Columbia University¹⁰, Peking University¹¹, University of Tokyo¹², Chonnam National University¹³, University of Indonesia¹⁴, National Taiwan University¹⁵, Utrecht University¹⁶, Xi'an Jiaotong University¹⁷, State University of New York System¹⁸, State University of New York Upstate Medical University¹⁹, Jinan University²⁰, Icahn School of Medicine at Mount Sinai²¹, University of Western Australia²², Cardiff University²³, University of North Carolina at Chapel Hill²⁴, Charité²⁵, Jining Medical University²⁶, Li Ka Shing Faculty of Medicine, University of Hong Kong²⁷, University of Hong Kong²⁸, Samsung Medical Center²⁹, Illawarra Health & Medical Research Institute³⁰, University of Wollongong³¹, University of California, San Diego³², National University of Singapore³³, Genome Institute of Singapore³⁴

31 Dec 2019-Nature Genetics

TL;DR: The largest study to date of East Asian participants is reported, identifying 21 genome-wide-significant associations in 19 genetic loci associated with schizophrenia and highlighting the importance of including sufficient samples of major ancestral groups to ensure their generalizability across populations.

...read moreread less

Abstract: Schizophrenia is a debilitating psychiatric disorder with approximately 1% lifetime risk globally. Large-scale schizophrenia genetic studies have reported primarily on European ancestry samples, potentially missing important biological insights. Here, we report the largest study to date of East Asian participants (22,778 schizophrenia cases and 35,362 controls), identifying 21 genome-wide-significant associations in 19 genetic loci. Common genetic variants that confer risk for schizophrenia have highly similar effects between East Asian and European ancestries (genetic correlation = 0.98 ± 0.03), indicating that the genetic basis of schizophrenia and its biology are broadly shared across populations. A fixed-effect meta-analysis including individuals from East Asian and European ancestries identified 208 significant associations in 176 genetic loci (53 novel). Trans-ancestry fine-mapping reduced the sets of candidate causal variants in 44 loci. Polygenic risk scores had reduced performance when transferred across ancestries, highlighting the importance of including sufficient samples of major ancestral groups to ensure their generalizability across populations.

...read moreread less

Journal Article•10.1038/S41588-019-0410-2•

The tomato pan-genome uncovers new genes and a rare allele regulating fruit flavor.

[...]

Lei Gao¹, Itay Gonda², Itay Gonda¹, Honghe Sun¹, Qiyue Ma¹, Kan Bao¹, Denise M. Tieman³, Elizabeth A. Burzynski-Chang⁴, Tara Fish⁵, Kaitlin A. Stromberg¹, Gavin L. Sacks⁴, Theodore W. Thannhauser⁵, Majid R. Foolad⁶, María José Díez⁷, José Blanca⁷, Joaquín Cañizares⁷, Yimin Xu¹, Esther van der Knaap⁸, Sanwen Huang, Harry J. Klee³, James J. Giovannoni⁵, James J. Giovannoni¹, Zhangjun Fei¹, Zhangjun Fei⁵ - Show less +20 more•Institutions (8)

Boyce Thompson Institute for Plant Research¹, Agricultural Research Organization, Volcani Center², University of Florida³, Cornell University⁴, United States Department of Agriculture⁵, Pennsylvania State University⁶, Polytechnic University of Valencia⁷, University of Georgia⁸

13 May 2019-Nature Genetics

TL;DR: A tomato pan-genome constructed using genome sequences of 725 phylogenetically and geographically representative accessions captures 4,873 genes absent from the reference genome and identifies a rare allele of TomLoxC regulating fruit flavor.

...read moreread less

Abstract: Modern tomatoes have narrow genetic diversity limiting their improvement potential. We present a tomato pan-genome constructed using genome sequences of 725 phylogenetically and geographically representative accessions, revealing 4,873 genes absent from the reference genome. Presence/absence variation analyses reveal substantial gene loss and intense negative selection of genes and promoters during tomato domestication and improvement. Lost or negatively selected genes are enriched for important traits, especially disease resistance. We identify a rare allele in the TomLoxC promoter selected against during domestication. Quantitative trait locus mapping and analysis of transgenic plants reveal a role for TomLoxC in apocarotenoid production, which contributes to desirable tomato flavor. In orange-stage fruit, accessions harboring both the rare and common TomLoxC alleles (heterozygotes) have higher TomLoxC expression than those homozygous for either and are resurgent in modern tomatoes. The tomato pan-genome adds depth and completeness to the reference genome, and is useful for future biological discovery and breeding.

...read moreread less

Journal Article•10.1038/S41588-018-0286-6•

Discovery of common and rare genetic risk variants for colorectal cancer

[...]

Jeroen R. Huyghe¹, Stephanie A. Bien¹, Tabitha A. Harrison¹, Hyun Min Kang² +221 more•Institutions (68)

01 Jan 2019-Nature Genetics

TL;DR: Genome-wide association analyses based on whole-genome sequencing and imputation identify 40 new risk variants for colorectal cancer, including a strongly protective low-frequency variant at CHD1 and loci implicating signaling and immune function in disease etiology.

...read moreread less

Abstract: To further dissect the genetic architecture of colorectal cancer (CRC), we performed whole-genome sequencing of 1,439 cases and 720 controls, imputed discovered sequence variants and Haplotype Reference Consortium panel variants into genome-wide association study data, and tested for association in 34,869 cases and 29,051 controls. Findings were followed up in an additional 23,262 cases and 38,296 controls. We discovered a strongly protective 0.3% frequency variant signal at CHD1. In a combined meta-analysis of 125,478 individuals, we identified 40 new independent signals at P < 5 × 10-8, bringing the number of known independent signals for CRC to ~100. New signals implicate lower-frequency variants, Kruppel-like factors, Hedgehog signaling, Hippo-YAP signaling, long noncoding RNAs and somatic drivers, and support a role for immune function. Heritability analyses suggest that CRC risk is highly polygenic, and larger, more comprehensive studies enabling rare variant analysis will improve understanding of biology underlying this risk and influence personalized screening strategies and drug development.

...read moreread less

Journal Article•10.1038/S41588-019-0403-1•

Maternal and fetal genetic effects on birth weight and their relevance to cardio-metabolic risk factors

[...]

Nicole M. Warrington¹, Robin N Beaumont², Momoko Horikoshi³, Felix R. Day⁴ +242 more•Institutions (79)

01 May 2019-Nature Genetics

TL;DR: An expanded GWAS of birth weight and subsequent analysis using structural equation modeling and Mendelian randomization decomposes maternal and fetal genetic contributions and causal links between birth weight, blood pressure and glycemic traits.

...read moreread less

Abstract: Birth weight variation is influenced by fetal and maternal genetic and non-genetic factors, and has been reproducibly associated with future cardio-metabolic health outcomes. In expanded genome-wide association analyses of own birth weight (n = 321,223) and offspring birth weight (n = 230,069 mothers), we identified 190 independent association signals (129 of which are novel). We used structural equation modeling to decompose the contributions of direct fetal and indirect maternal genetic effects, then applied Mendelian randomization to illuminate causal pathways. For example, both indirect maternal and direct fetal genetic effects drive the observational relationship between lower birth weight and higher later blood pressure: maternal blood pressure-raising alleles reduce offspring birth weight, but only direct fetal effects of these alleles, once inherited, increase later offspring blood pressure. Using maternal birth weight-lowering genotypes to proxy for an adverse intrauterine environment provided no evidence that it causally raises offspring blood pressure, indicating that the inverse birth weight-blood pressure association is attributable to genetic effects, and not to intrauterine programming.

...read moreread less

Journal Article•10.1038/S41588-018-0315-5•

PAX5 -driven subtypes of B-progenitor acute lymphoblastic leukemia

[...]

Zhaohui Gu, Michelle L. Churchman, Kathryn G. Roberts, Ian Moore, Xin Zhou, Joy Nakitandwe, Kohei Hagiwara, Stephane Pelletier, Sebastien Gingras¹, Hartmut Berns, Debbie Payne-Turner, Ashley Hill, Ilaria Iacobucci, Lei Shi, Stanley Pounds, Cheng Cheng, Deqing Pei, Chunxu Qu, Scott Newman, Meenakshi Devidas², Yunfeng Dai², Shalini C. Reshmi³, Julie M. Gastier-Foster³, Elizabeth A. Raetz⁴, Michael J. Borowitz⁵, Brent L. Wood⁶, William L. Carroll⁴, Patrick A. Zweidler-McKay⁷, Karen R. Rabin⁸, Leonard A. Mattano, Kelly W. Maloney⁹, Alessandro Rambaldi, Orietta Spinelli, Jerald P. Radich¹⁰, Mark D. Minden¹¹, Jacob M. Rowe¹², Selina M. Luger¹³, Mark R. Litzow¹⁴, Martin S. Tallman¹⁵, Janis Racevskis¹⁶, Yanming Zhang¹⁵, Ravi Bhatia¹⁷, Jessica Kohlschmidt¹⁸, Krzysztof Mrózek¹⁸, Clara D. Bloomfield¹⁸, Wendy Stock¹⁹, Steven M. Kornblau²⁰, Hagop M. Kantarjian²⁰, Marina Konopleva²⁰, Williams E. Evans, Sima Jeha, Ching-Hon Pui, Jun J. Yang, Elisabeth Paietta¹⁶, James R. Downing, Mary V. Relling, Jinghui Zhang, Mignon L. Loh²¹, Stephen P. Hunger¹³, Charles G. Mullighan - Show less +56 more•Institutions (21)

01 Feb 2019-Nature Genetics

TL;DR: Analysis of 1,988 cases of B-cell acute lymphoblastic leukemia characterizes 23 subtypes defined by genomic features and shows that two of the subtypes have frequent PAX5 alterations, demonstrating the utility of transcriptome sequencing to classify B-ALL.

...read moreread less

Abstract: Recent genomic studies have identified chromosomal rearrangements defining new subtypes of B-progenitor acute lymphoblastic leukemia (B-ALL), however many cases lack a known initiating genetic alteration. Using integrated genomic analysis of 1,988 childhood and adult cases, we describe a revised taxonomy of B-ALL incorporating 23 subtypes defined by chromosomal rearrangements, sequence mutations or heterogeneous genomic alterations, many of which show marked variation in prevalence according to age. Two subtypes have frequent alterations of the B lymphoid transcription-factor gene PAX5. One, PAX5alt (7.4%), has diverse PAX5 alterations (rearrangements, intragenic amplifications or mutations); a second subtype is defined by PAX5 p.Pro80Arg and biallelic PAX5 alterations. We show that p.Pro80Arg impairs B lymphoid development and promotes the development of B-ALL with biallelic Pax5 alteration in vivo. These results demonstrate the utility of transcriptome sequencing to classify B-ALL and reinforce the central role of PAX5 as a checkpoint in B lymphoid maturation and leukemogenesis.

...read moreread less

Journal Article•10.1038/S41588-018-0262-1•

Comparative genomics of the major parasitic worms

[...]

Tim A. Day¹•Institutions (1)

Iowa State University¹

01 Jan 2019-Nature Genetics

TL;DR: A broad comparative study of 81 genomes of parasitic and non-parasitic worms identifies gene family births and hundreds of expanded gene families at key nodes in the phylogeny that are relevant to parasitism and proteins historically targeted for drug development.

...read moreread less

Abstract: Parasitic nematodes (roundworms) and platyhelminths (flatworms) cause debilitating chronic infections of humans and animals, decimate crop production and are a major impediment to socioeconomic development. Here we report the broadest comparative study to date of the genomes of parasitic and non-parasitic worms, involving 81. We have identified gene family births and hundreds of expanded gene families at key nodes in the phylogeny that are relevant to parasitism. Examples include gene families that modulate host immune responses, enable parasite migration though host tissues or allow the parasite to feed. We reveal extensive lineage-specific differences in core metabolism and protein families historically targeted for drug development. From an in silico screen, we have identified and prioritised new potential drug targets and compounds for testing. This comparative genomics resource provides a much needed boost for the research community to understand and combat parasitic worms.

...read moreread less

Journal Article•10.1038/S41588-019-0530-8•

A resource-efficient tool for mixed model association analysis of large-scale data

[...]

Longda Jiang¹, Zhili Zheng², Zhili Zheng¹, Ting Qi¹, Kathryn E. Kemper¹, Naomi R. Wray¹, Peter M. Visscher¹, Jian Yang¹, Jian Yang² - Show less +5 more•Institutions (2)

University of Queensland¹, Wenzhou Medical College²

25 Nov 2019-Nature Genetics

TL;DR: An MLM-based tool (fastGWA) is developed that controls for population stratification by principal components and for relatedness by a sparse genetic relationship matrix for GWA analyses of biobank-scale data.

...read moreread less

Abstract: The genome-wide association study (GWAS) has been widely used as an experimental design to detect associations between genetic variants and a phenotype. Two major confounding factors, population stratification and relatedness, could potentially lead to inflated GWAS test statistics and hence to spurious associations. Mixed linear model (MLM)-based approaches can be used to account for sample structure. However, genome-wide association (GWA) analyses in biobank samples such as the UK Biobank (UKB) often exceed the capability of most existing MLM-based tools especially if the number of traits is large. Here, we develop an MLM-based tool (fastGWA) that controls for population stratification by principal components and for relatedness by a sparse genetic relationship matrix for GWA analyses of biobank-scale data. We demonstrate by extensive simulations that fastGWA is reliable, robust and highly resource-efficient. We then apply fastGWA to 2,173 traits on array-genotyped and imputed samples from 456,422 individuals and to 2,048 traits on whole-exome-sequenced samples from 46,191 individuals in the UKB.

...read moreread less

...

Expand