Contig Mapping

Topic Tools

Papers published on a yearly basis

Papers

Journal Article•10.1038/NMETH.3444•

A complete bacterial genome assembled de novo using only nanopore sequencing data

[...]

Nicholas J. Loman¹, Joshua Quick¹, Jared T. Simpson²•Institutions (2)

University of Birmingham¹, Ontario Institute for Cancer Research²

01 Aug 2015-Nature Methods

TL;DR: This work has assembled de novo the Escherichia coli K-12 MG1655 chromosome in a single 4.6-Mb contig using only nanopore data and reconstructs gene order and has 99.5% nucleotide identity.

...read moreread less

Abstract: We have assembled de novo the Escherichia coli K-12 MG1655 chromosome in a single 4.6-Mb contig using only nanopore data. Our method has three stages: (i) overlaps are detected between reads and then corrected by a multiple-alignment process; (ii) corrected reads are assembled using the Celera Assembler; and (iii) the assembly is polished using a probabilistic model of the signal-level data. The assembly reconstructs gene order and has 99.5% nucleotide identity.

...read moreread less

1,410 citations

Journal Article•10.1038/NATURE22971•

Improved maize reference genome with single-molecule technologies

[...]

Yinping Jiao¹, Paul Peluso², Jinghua Shi, Tiffany Y. Liang, Michelle C. Stitzer³, Bo Wang¹, Michael S. Campbell¹, Joshua C. Stein¹, Xuehong Wei¹, Chen-Shan Chin², Katherine E. Guill⁴, Michael Regulski¹, Sunita Kumari¹, Andrew Olson¹, Jonathan I. Gent⁵, Kevin L. Schneider⁶, Thomas K. Wolfgruber⁶, Michael R. May³, Nathan M. Springer⁷, Eric Antoniou¹, W. Richard McCombie¹, Gernot G. Presting⁶, Michael D. McMullen⁴, Jeffrey Ross-Ibarra³, R. Kelly Dawe⁵, Alex Hastie, David R. Rank², Doreen Ware⁸, Doreen Ware¹ - Show less +25 more•Institutions (8)

Cold Spring Harbor Laboratory¹, Pacific Biosciences², University of California, Davis³, United States Department of Agriculture⁴, University of Georgia⁵, University of Hawaii at Manoa⁶, University of Minnesota⁷, Cornell University⁸

12 Jun 2017-Nature

TL;DR: The assembly and annotation of a reference genome of maize is reported, using single-molecule real-time sequencing and high-resolution optical mapping to identify transposable element lineage expansions that are unique to maize.

...read moreread less

Abstract: Complete and accurate reference genomes and annotations provide fundamental tools for characterization of genetic and functional variation. These resources facilitate the determination of biological processes and support translation of research findings into improved and sustainable agricultural technologies. Many reference genomes for crop plants have been generated over the past decade, but these genomes are often fragmented and missing complex repeat regions. Here we report the assembly and annotation of a reference genome of maize, a genetic and agricultural model species, using single-molecule real-time sequencing and high-resolution optical mapping. Relative to the previous reference genome, our assembly features a 52-fold increase in contig length and notable improvements in the assembly of intergenic spaces and centromeres. Characterization of the repetitive portion of the genome revealed more than 130,000 intact transposable elements, allowing us to identify transposable element lineage expansions that are unique to maize. Gene annotations were updated using 111,000 full-length transcripts obtained by single-molecule real-time sequencing. In addition, comparative optical mapping of two other inbred maize lines revealed a prevalence of deletions in regions of low gene density and maize lineage-specific genes.

...read moreread less

1,196 citations

Journal Article•10.1101/GR.185579.114•

The Release 6 reference sequence of the Drosophila melanogaster genome

[...]

Roger A. Hoskins¹, Joseph W. Carlson¹, Kenneth H. Wan¹, Soo Park¹, Ivonne Mendez¹, Samuel E. Galle¹, Benjamin W. Booth¹, Barret D. Pfeiffer², Reed A. George², Robert Svirskas², Martin Krzywinski³, Jacqueline E. Schein³, Maria Carmela Accardo⁴, Elisabetta Damia⁴, Giovanni Messina⁴, Maria Mendez-Lago⁵, Beatriz de Pablos⁵, Olga V. Demakova⁶, Evgeniya N. Andreyeva⁶, Lidiya V. Boldyreva⁶, Marco A. Marra³, A. Bernardo Carvalho⁷, Patrizio Dimitri⁴, Alfredo Villasante⁵, Igor F. Zhimulev⁶, Igor F. Zhimulev⁸, Gerald M. Rubin², Gary H. Karpen¹, Gary H. Karpen⁹, Susan E. Celniker¹ - Show less +26 more•Institutions (9)

Lawrence Berkeley National Laboratory¹, Howard Hughes Medical Institute², BC Cancer Agency³, Sapienza University of Rome⁴, Spanish National Research Council⁵, Russian Academy of Sciences⁶, Federal University of Rio de Janeiro⁷, Novosibirsk State University⁸, University of California, Berkeley⁹

14 Jan 2015-Genome Research

TL;DR: An improved reference sequence of the single-copy and middle-repetitive regions of the genome is reported, produced using cytogenetic mapping to mitotic and polytene chromosomes, clone-based finishing and BAC fingerprint verification, ordering of scaffolds by alignment to cDNA sequences, incorporation of other map and sequence data, and validation by whole-genome optical restriction mapping.

...read moreread less

Abstract: Drosophila melanogaster plays an important role in molecular, genetic, and genomic studies of heredity, development, metabolism, behavior, and human disease. The initial reference genome sequence reported more than a decade ago had a profound impact on progress in Drosophila research, and improving the accuracy and completeness of this sequence continues to be important to further progress. We previously described improvement of the 117-Mb sequence in the euchromatic portion of the genome and 21 Mb in the heterochromatic portion, using a whole-genome shotgun assembly, BAC physical mapping, and clone-based finishing. Here, we report an improved reference sequence of the single-copy and middle-repetitive regions of the genome, produced using cytogenetic mapping to mitotic and polytene chromosomes, clone-based finishing and BAC fingerprint verification, ordering of scaffolds by alignment to cDNA sequences, incorporation of other map and sequence data, and validation by whole-genome optical restriction mapping. These data substantially improve the accuracy and completeness of the reference sequence and the order and orientation of sequence scaffolds into chromosome arm assemblies. Representation of the Y chromosome and other heterochromatic regions is particularly improved. The new 143.9-Mb reference sequence, designated Release 6, effectively exhausts clone-based technologies for mapping and sequencing. Highly repeat-rich regions, including large satellite blocks and functional elements such as the ribosomal RNA genes and the centromeres, are largely inaccessible to current sequencing and assembly methods and remain poorly represented. Further significant improvements will require sequencing technologies that do not depend on molecular cloning and that produce very long reads.

...read moreread less

408 citations

Journal Article•10.1101/GR.828403•

Whole-Genome Sequence Assembly for Mammalian Genomes: Arachne 2

[...]

David B. Jaffe¹, Jonathan Butler, Sante Gnerre, Evan Mauceli, Kerstin Lindblad-Toh, Jill P. Mesirov, Michael C. Zody, Eric S. Lander - Show less +4 more•Institutions (1)

Massachusetts Institute of Technology¹

01 Jan 2003-Genome Research

TL;DR: Algorithmic adaptations to the whole-genome assembly program Arachne are described, allowing for assembly of mammalian-size genomes, and also improving the assembly of smaller genomes.

...read moreread less

Abstract: We previously described the whole-genome assembly program Arachne, presenting assemblies of simulated data for small to mid-sized genomes. Here we describe algorithmic adaptations to the program, allowing for assembly of mammalian-size genomes, and also improving the assembly of smaller genomes. Three principal changes were simultaneously made and applied to the assembly of the mouse genome, during a six-month period of development: (1) Supercontigs (scaffolds) were iteratively broken and rejoined using several criteria, yielding a 64-fold increase in length (N50), and apparent elimination of all global misjoins; (2) gaps between contigs in supercontigs were filled (partially or completely) by insertion of reads, as suggested by pairing within the supercontig, increasing the N50 contig length by 50%; (3) memory usage was reduced fourfold. The outcome of this mouse assembly and its analysis are described in (Mouse Genome Sequencing Consortium 2002).

...read moreread less

369 citations

Journal Article•10.1038/NCOMMS15324•

Sequencing and de novo assembly of a near complete indica rice genome.

[...]

Huilong Du¹, Ying Yu¹, Yanfei Ma¹, Qiang Gao¹, Yinghao Cao¹, Zhuo Chen¹, Bin Ma¹, Ming Qi¹, Yan Li¹, Xianfeng Zhao¹, Jing Wang¹, Kunfan Liu¹, Peng Qin², Xin Yang¹, Lihuang Zhu¹, Shigui Li², Chengzhi Liang¹ - Show less +13 more•Institutions (2)

Chinese Academy of Sciences¹, Sichuan Agricultural University²

04 May 2017-Nature Communications

TL;DR: The de novo assembly of an indica rice genome Shuhui498 (R498) is reported through the integration of single-molecule sequencing and mapping data, genetic map and fosmid sequence tags, demonstrating how to de noovo assemble a highly contiguous and near-complete plant genome through an integrative strategy.

...read moreread less

Abstract: A high-quality reference genome is critical for understanding genome structure, genetic variation and evolution of an organism. Here we report the de novo assembly of an indica rice genome Shuhui498 (R498) through the integration of single-molecule sequencing and mapping data, genetic map and fosmid sequence tags. The 390.3 Mb assembly is estimated to cover more than 99% of the R498 genome and is more continuous than the current reference genomes of japonica rice Nipponbare (MSU7) and Arabidopsis thaliana (TAIR10). We annotate high-quality protein-coding genes in R498 and identify genetic variations between R498 and Nipponbare and presence/absence variations by comparing them to 17 draft genomes in cultivated rice and its closest wild relatives. Our results demonstrate how to de novo assemble a highly contiguous and near-complete plant genome through an integrative strategy. The R498 genome will serve as a reference for the discovery of genes and structural variations in rice. High-quality reference genomes facilitate analysis of genome structure and variation. Here Duet al. create a near-complete assembly of the indicarice genome by combining single molecule sequencing with mapping data and fosmid sequences and identify genetic variants by comparison with other rice genomes.

...read moreread less

320 citations

...

Expand

Year	Papers
2020	2
2019	4
2018	3
2017	5
2016	7
2015	9

Topic Tools

Papers published on a yearly basis

Papers

A complete bacterial genome assembled de novo using only nanopore sequencing data

Improved maize reference genome with single-molecule technologies

The Release 6 reference sequence of the Drosophila melanogaster genome

Whole-Genome Sequence Assembly for Mammalian Genomes: Arachne 2

Sequencing and de novo assembly of a near complete indica rice genome.

Related Topics (5)

Performance Metrics