Association mapping

Topic Tools

Papers published on a yearly basis

Papers

Journal Article•10.1093/BIOINFORMATICS/BTM308•

TASSEL: software for association mapping of complex traits in diverse samples

[...]

Peter J. Bradbury¹, Zhiwu Zhang¹, Dallas E. Kroon¹, Terry M. Casstevens¹, Yogesh Ramdoss¹, Edward S. Buckler¹ - Show less +2 more•Institutions (1)

United States Department of Agriculture¹

01 Oct 2007-Bioinformatics

TL;DR: TASSEL (Trait Analysis by aSSociation, Evolution and Linkage) implements general linear model and mixed linear model approaches for controlling population and family structure and allows for linkage disequilibrium statistics to be calculated and visualized graphically.

...read moreread less

Abstract: Summary: Association analyses that exploit the natural diversity of a genome to map at very high resolutions are becoming increasingly important. In most studies, however, researchers must contend with the confounding effects of both population and family structure. TASSEL (Trait Analysis by aSSociation, Evolution and Linkage) implements general linear model and mixed linear model approaches for controlling population and family structure. For result interpretation, the program allows for linkage disequilibrium statistics to be calculated and visualized graphically. Database browsing and data importation is facilitated by integrated middleware. Other features include analyzing insertions/deletions, calculating diversity statistics, integration of phenotypic and genotypic data, imputing missing data and calculating principal components. Availability: The TASSEL executable, user manual, example data sets and tutorial document are freely available at http://www. maizegenetics.net/tassel. The source code for TASSEL can be found at http://sourceforge.net/projects/tassel.

...read moreread less

7,270 citations

Journal Article•10.1093/BIOINFORMATICS/BTR509•

A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data

[...]

Heng Li¹•Institutions (1)

Broad Institute¹

01 Nov 2011-Bioinformatics

TL;DR: This work presents a statistical framework for calling SNPs, discovering somatic mutations, inferring population genetical parameters and performing association tests directly based on sequencing data without explicit genotyping or linkage-based imputation and demonstrates that this method achieves comparable accuracy to alternative methods for estimating site allele count, for inferring allele frequency spectrum and for association mapping.

...read moreread less

Abstract: Motivation: Most existing methods for DNA sequence analysis rely on accurate sequences or genotypes. However, in applications of the next-generation sequencing (NGS), accurate genotypes may not be easily obtained (e.g. multi-sample low-coverage sequencing or somatic mutation discovery). These applications press for the development of new methods for analyzing sequence data with uncertainty. Results: We present a statistical framework for calling SNPs, discovering somatic mutations, inferring population genetical parameters and performing association tests directly based on sequencing data without explicit genotyping or linkage-based imputation. On real data, we demonstrate that our method achieves comparable accuracy to alternative methods for estimating site allele count, for inferring allele frequency spectrum and for association mapping. We also highlight the necessity of using symmetric datasets for finding somatic mutations and confirm that for discovering rare events, mismapping is frequently the leading source of errors. Availability: http://samtools.sourceforge.net Contact: hengli@broadinstitute.org

...read moreread less

6,706 citations

Journal Article•10.1038/NG1702•

A unified mixed-model method for association mapping that accounts for multiple levels of relatedness

[...]

Jianming Yu¹, Gaël Pressoir¹, William H. Briggs², Irie Vroh Bi¹, Masanori Yamasaki³, John Doebley², Michael D. McMullen³, Michael D. McMullen⁴, Brandon S. Gaut⁵, Dahlia M. Nielsen⁶, James B. Holland⁶, James B. Holland⁴, Stephen Kresovich¹, Edward S. Buckler⁴, Edward S. Buckler¹ - Show less +11 more•Institutions (6)

Cornell University¹, University of Wisconsin-Madison², University of Missouri³, United States Department of Agriculture⁴, University of California, Irvine⁵, North Carolina State University⁶

30 Jan 2006-Nature Genetics

TL;DR: A unified mixed-model approach to account for multiple levels of relatedness simultaneously as detected by random genetic markers is developed and provides a powerful complement to currently available methods for association mapping.

...read moreread less

Abstract: As population structure can result in spurious associations, it has constrained the use of association studies in human and plant genetics. Association mapping, however, holds great promise if true signals of functional association can be separated from the vast number of false signals generated by population structure. We have developed a unified mixed-model approach to account for multiple levels of relatedness simultaneously as detected by random genetic markers. We applied this new approach to two samples: a family-based sample of 14 human families, for quantitative gene expression dissection, and a sample of 277 diverse maize inbred lines with complex familial relationships and population structure, for quantitative trait dissection. Our method demonstrates improved control of both type I and type II error rates over other methods. As this new method crosses the boundary between family-based and structured association samples, it provides a powerful complement to currently available methods for association mapping.

...read moreread less

4,143 citations

Journal Article•

Transmission test for linkage disequilibrium: the insulin gene region and insulin-dependent diabetes mellitus (IDDM).

[...]

Richard S. Spielman¹, Ralph McGinnis, Warren J. Ewens•Institutions (1)

University of Pennsylvania¹

01 Mar 1993-American Journal of Human Genetics

TL;DR: The statistical basis for this "transmission test for linkage disequilibrium" (transmission/disequilibrium test] is described and the relationship of this test to tests of cosegregation that are based on the proportion of haplotypes or genes identical by descent in affected sibs is shown.

...read moreread less

Abstract: A population association has consistently been observed between insulin-dependent diabetes mellitus (IDDM) and the "class 1" alleles of the region of tandem-repeat DNA (5' flanking polymorphism [5'FP]) adjacent to the insulin gene on chromosome 11p. This finding suggests that the insulin gene region contains a gene or genes contributing to IDDM susceptibility. However, several studies that have sought to show linkage with IDDM by testing for cosegregation in affected sib pairs have failed to find evidence for linkage. As means for identifying genes for complex diseases, both the association and the affected-sib-pairs approaches have limitations. It is well known that population association between a disease and a genetic marker can arise as an artifact of population structure, even in the absence of linkage. On the other hand, linkage studies with modest numbers of affected sib pairs may fail to detect linkage, especially if there is linkage heterogeneity. We consider an alternative method to test for linkage with a genetic marker when population association has been found. Using data from families with at least one affected child, we evaluate the transmission of the associated marker allele from a heterozygous parent to an affected offspring. This approach has been used by several investigators, but the statistical properties of the method as a test for linkage have not been investigated. In the present paper we describe the statistical basis for this "transmission test for linkage disequilibrium" (transmission/disequilibrium test [TDT]). We then show the relationship of this test to tests of cosegregation that are based on the proportion of haplotypes or genes identical by descent in affected sibs. The TDT provides strong evidence for linkage between the 5'FP and susceptibility to IDDM. The conclusions from this analysis apply in general to the study of disease associations, where genetic markers are usually closely linked to candidate genes. When a disease is found to be associated with such a marker, the TDT may detect linkage even when haplotype-sharing tests do not.

...read moreread less

3,982 citations

Journal Article•10.1038/NG2088•

A new multipoint method for genome-wide association studies by imputation of genotypes

[...]

Jonathan Marchini¹, Bryan Howie¹, Simon Myers¹, Gil McVean¹, Peter Donnelly¹ - Show less +1 more•Institutions (1)

University of Oxford¹

01 Jul 2007-Nature Genetics

TL;DR: This work proposes a coherent analysis framework that treats the genome-wide association problem as one involving missing or uncertain genotypes, and proposes a model-based imputation method for inferring genotypes at observed or unobserved SNPs, leading to improved power over existing methods for multipoint association mapping.

...read moreread less

Abstract: Genome-wide association studies are set to become the method of choice for uncovering the genetic basis of human diseases. A central challenge in this area is the development of powerful multipoint methods that can detect causal variants that have not been directly genotyped. We propose a coherent analysis framework that treats the problem as one involving missing or uncertain genotypes. Central to our approach is a model-based imputation method for inferring genotypes at observed or unobserved SNPs, leading to improved power over existing methods for multipoint association mapping. Using real genome-wide association study data, we show that our approach (i) is accurate and well calibrated, (ii) provides detailed views of associated regions that facilitate follow-up studies and (iii) can be used to validate and correct data at genotyped markers. A notable future use of our method will be to boost power by combining data from genome-wide scans that use different SNP sets.

...read moreread less

2,977 citations

...

Expand

Year	Papers
2025	22
2024	78
2023	63
2022	117
2021	124
2020	134

Topic Tools

Papers published on a yearly basis

Papers

TASSEL: software for association mapping of complex traits in diverse samples

A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data

A unified mixed-model method for association mapping that accounts for multiple levels of relatedness

Transmission test for linkage disequilibrium: the insulin gene region and insulin-dependent diabetes mellitus (IDDM).

A new multipoint method for genome-wide association studies by imputation of genotypes

Related Topics (5)

Performance Metrics