Segmental duplication

Topic Tools

Papers published on a yearly basis

Papers

Finishing the euchromatic sequence of the human genome

[...]

21 Oct 2004-Nature

TL;DR: The current human genome sequence (Build 35) as discussed by the authors contains 2.85 billion nucleotides interrupted by only 341 gaps and is accurate to an error rate of approximately 1 event per 100,000 bases.

...read moreread less

Abstract: The sequence of the human genome encodes the genetic instructions for human physiology, as well as rich information about human evolution. In 2001, the International Human Genome Sequencing Consortium reported a draft sequence of the euchromatic portion of the human genome. Since then, the international collaboration has worked to convert this draft into a genome sequence with high accuracy and nearly complete coverage. Here, we report the result of this finishing process. The current genome sequence (Build 35) contains 2.85 billion nucleotides interrupted by only 341 gaps. It covers approximately 99% of the euchromatic genome and is accurate to an error rate of approximately 1 event per 100,000 bases. Many of the remaining euchromatic gaps are associated with segmental duplications and will require focused work with new methods. The near-complete sequence, the first for a vertebrate, greatly improves the precision of biological analyses of the human genome including studies of gene number, birth and death. Notably, the human genome seems to encode only 20,000-25,000 protein-coding genes. The genome sequence reported here should serve as a firm foundation for biomedical research in the decades ahead.

...read moreread less

4,737 citations

Journal Article•10.1038/NATURE05329•

Global variation in copy number in the human genome

[...]

Richard Redon¹, Shumpei Ishikawa², Karen R. Fitch³, Lars Feuk⁴, George H. Perry⁵, T. Daniel Andrews¹, Heike Fiegler¹, Michael H. Shapero³, Andrew R. Carson⁴, Wenwei Chen³, Eun Kyung Cho⁶, Stephanie Dallaire⁶, Jennifer L. Freeman⁶, Juan R. González⁷, Mònica Gratacòs⁷, Jing Huang³, Dimitrios Kalaitzopoulos¹, Daisuke Komura², Jeffrey R. MacDonald⁴, Christian R. Marshall⁴, Rui Mei³, Lyndal Montgomery¹, Keunihiro Nishimura², Kohji Okamura⁴, Fan Shen³, Martin J. Somerville⁸, Joelle Tchinda⁶, Armand Valsesia¹, Cara Woodwark¹, Fengtang Yang¹, Junjun Zhang⁴, Tatiana Zerjal¹, Jane Zhang³, Lluís Armengol⁷, Donald F. Conrad⁹, Xavier Estivill⁷, Chris Tyler-Smith¹, Nigel P. Carter¹, Hiroyuki Aburatani², Charles Lee⁶, Keith W. Jones³, Stephen W. Scherer⁴, Matthew E. Hurles¹ - Show less +39 more•Institutions (9)

Wellcome Trust Sanger Institute¹, University of Tokyo², Thermo Fisher Scientific³, University of Toronto⁴, Brigham and Women's Hospital⁵, Harvard University⁶, Pompeu Fabra University⁷, University of Alberta⁸, University of Chicago⁹

23 Nov 2006-Nature

TL;DR: A first-generation CNV map of the human genome is constructed through the study of 270 individuals from four populations with ancestry in Europe, Africa or Asia, underscoring the importance of CNV in genetic diversity and evolution and the utility of this resource for genetic disease studies.

...read moreread less

Abstract: Copy number variation (CNV) of DNA sequences is functionally significant but has yet to be fully ascertained. We have constructed a first-generation CNV map of the human genome through the study of 270 individuals from four populations with ancestry in Europe, Africa or Asia (the HapMap collection). DNA from these individuals was screened for CNV using two complementary technologies: single-nucleotide polymorphism (SNP) genotyping arrays, and clone-based comparative genomic hybridization. A total of 1,447 copy number variable regions (CNVRs), which can encompass overlapping or adjacent gains or losses, covering 360 megabases (12% of the genome) were identified in these populations. These CNVRs contained hundreds of genes, disease loci, functional elements and segmental duplications. Notably, the CNVRs encompassed more nucleotide content per genome than SNPs, underscoring the importance of CNV in genetic diversity and evolution. The data obtained delineate linkage disequilibrium patterns for many CNVs, and reveal marked variation in copy number among populations. We also demonstrate the utility of this resource for genetic disease studies.

...read moreread less

4,692 citations

Journal Article•10.1371/JOURNAL.PONE.0011147•

progressiveMauve: Multiple Genome Alignment with Gene Gain, Loss and Rearrangement

[...]

Aaron E. Darling¹, Bob Mau¹, Nicole T. Perna¹•Institutions (1)

University of Wisconsin-Madison¹

25 Jun 2010-PLOS ONE

TL;DR: A new method to align two or more genomes that have undergone rearrangements due to recombination and substantial amounts of segmental gain and loss is described, demonstrating high accuracy in situations where genomes have undergone biologically feasible amounts of genome rearrangement, segmental loss and loss.

...read moreread less

Abstract: Background Multiple genome alignment remains a challenging problem. Effects of recombination including rearrangement, segmental duplication, gain, and loss can create a mosaic pattern of homology even among closely related organisms.

...read moreread less

3,833 citations

Journal Article•10.1038/NG1416•

Detection of large-scale variation in the human genome.

[...]

A. John Iafrate¹, Lars Feuk², Miguel Rivera¹, Miguel Rivera³, Marc L. Listewnik¹, Patricia K. Donahoe³, Ying Qi², Stephen W. Scherer², Charles Lee³, Charles Lee¹ - Show less +6 more•Institutions (3)

Brigham and Women's Hospital¹, University of Toronto², Harvard University³

01 Sep 2004-Nature Genetics

TL;DR: This article identified 255 loci across the human genome that contain genomic imbalances among unrelated individuals, and revealed that half of these regions overlap with genes, and many coincide with segmental duplications or gaps in human genome assembly.

...read moreread less

Abstract: We identified 255 loci across the human genome that contain genomic imbalances among unrelated individuals. Twenty-four variants are present in > 10% of the individuals that we examined. Half of these regions overlap with genes, and many coincide with segmental duplications or gaps in the human genome assembly. This previously unappreciated heterogeneity may underlie certain human phenotypic variation and susceptibility to disease and argues for a more dynamic human genome structure.

...read moreread less

3,154 citations

Journal Article•10.1186/1471-2229-4-10•

The roles of segmental and tandem gene duplication in the evolution of large gene families in Arabidopsis thaliana

[...]

Steven B. Cannon¹, Arvind Mitra², Andrew Baumgarten¹, Nevin D. Young¹, Georgiana May¹ - Show less +1 more•Institutions (2)

University of Minnesota¹, Ithaca College²

01 Jun 2004-BMC Plant Biology

TL;DR: Combining information about genomic segmental duplications, gene family phylogenies, and gene positions provides a method to evaluate contributions of tandem duplication and segmental genome duplication in the generation and maintenance of gene families.

...read moreread less

Abstract: Background Most genes in Arabidopsis thaliana are members of gene families. How do the members of gene families arise, and how are gene family copy numbers maintained? Some gene families may evolve primarily through tandem duplication and high rates of birth and death in clusters, and others through infrequent polyploidy or large-scale segmental duplications and subsequent losses.

...read moreread less

2,108 citations

...

Expand

Year	Papers
2026	1
2025	71
2024	45
2023	46
2022	89
2021	75

Topic Tools

Papers published on a yearly basis

Papers

Finishing the euchromatic sequence of the human genome

Global variation in copy number in the human genome

progressiveMauve: Multiple Genome Alignment with Gene Gain, Loss and Rearrangement

Detection of large-scale variation in the human genome.

The roles of segmental and tandem gene duplication in the evolution of large gene families in Arabidopsis thaliana

Related Topics (5)

Performance Metrics