Assemblers

Topic Tools

Papers published on a yearly basis

Papers

Journal Article•10.1371/JOURNAL.PONE.0017915•

A practical comparison of de novo genome assembly software tools for next-generation sequencing technologies.

[...]

Wenyu Zhang¹, Jiajia Chen¹, Yang Yang¹, Yifei Tang¹, Jing Shang¹, Bairong Shen¹ - Show less +2 more•Institutions (1)

Soochow University (Suzhou)¹

14 Mar 2011-PLOS ONE

TL;DR: This study indicates that string-based assemblers, overlap-layout-consensus (OLC) assemblers are well-suited for very short reads and longer reads of small genomes respectively, and graph-basedassemblers would be more appropriate for large datasets of more than hundred millions of short reads.

...read moreread less

Abstract: The advent of next-generation sequencing technologies is accompanied with the development of many whole-genome sequence assembly methods and software, especially for de novo fragment assembly. Due to the poor knowledge about the applicability and performance of these software tools, choosing a befitting assembler becomes a tough task. Here, we provide the information of adaptivity for each program, then above all, compare the performance of eight distinct tools against eight groups of simulated datasets from Solexa sequencing platform. Considering the computational time, maximum random access memory (RAM) occupancy, assembly accuracy and integrity, our study indicate that string-based assemblers, overlap-layout-consensus (OLC) assemblers are well-suited for very short reads and longer reads of small genomes respectively. For large datasets of more than hundred millions of short reads, De Bruijn graph-based assemblers would be more appropriate. In terms of software implementation, string-based assemblers are superior to graph-based ones, of which SOAPdenovo is complex for the creation of configuration file. Our comparison study will assist researchers in selecting a well-suited assembler and offer essential information for the improvement of existing assemblers or the developing of novel assemblers.

...read moreread less

300 citations

Journal Article•10.1371/JOURNAL.PONE.0019175•

Comparing de novo genome assembly: the long and short of it.

[...]

Giuseppe Narzisi¹, Bud Mishra², Bud Mishra¹•Institutions (2)

Courant Institute of Mathematical Sciences¹, New York University²

29 Apr 2011-PLOS ONE

TL;DR: This paper highlights common anomalies in assembly accuracy through a rigorous study of several assemblers, compared under both standard metrics as well as a more comprehensive metric (Feature-Response Curves, FRC) that is introduced here; FRC transparently captures the trade-offs between contigs' quality against their sizes.

...read moreread less

Abstract: Recent advances in DNA sequencing technology and their focal role in Genome Wide Association Studies (GWAS) have rekindled a growing interest in the whole-genome sequence assembly (WGSA) problem, thereby, inundating the field with a plethora of new formalizations, algorithms, heuristics and implementations. And yet, scant attention has been paid to comparative assessments of these assemblers' quality and accuracy. No commonly accepted and standardized method for comparison exists yet. Even worse, widely used metrics to compare the assembled sequences emphasize only size, poorly capturing the contig quality and accuracy. This paper addresses these concerns: it highlights common anomalies in assembly accuracy through a rigorous study of several assemblers, compared under both standard metrics (N50, coverage, contig sizes, etc.) as well as a more comprehensive metric (Feature-Response Curves, FRC) that is introduced here; FRC transparently captures the trade-offs between contigs' quality against their sizes. For this purpose, most of the publicly available major sequence assemblers – both for low-coverage long (Sanger) and high-coverage short (Illumina) reads technologies – are compared. These assemblers are applied to microbial (Escherichia coli, Brucella, Wolbachia, Staphylococcus, Helicobacter) and partial human genome sequences (Chr. Y), using sequence reads of various read-lengths, coverages, accuracies, and with and without mate-pairs. It is hoped that, based on these evaluations, computational biologists will identify innovative sequence assembly paradigms, bioinformaticists will determine promising approaches for developing “next-generation” assemblers, and biotechnologists will formulate more meaningful design desiderata for sequencing technology platforms. A new software tool for computing the FRC metric has been developed and is available through the AMOS open-source consortium.

...read moreread less

169 citations

Journal Article•10.2307/255863•

Effects of Gender on Self- and Supervisory Ratings

[...]

Lynn M. Shore¹, George C. Thornton¹•Institutions (1)

Colorado State University¹

01 Mar 1986-Academy of Management Journal

TL;DR: This paper investigated the effects of supervisors' and subordinates' genders on self-and supervisory ratings in an organizational setting and found that both genders had an effect on self and supervisory rating.

...read moreread less

Abstract: This research investigated the effects of supervisors' and subordinates' genders on self- and supervisory ratings in an organizational setting. Participants were assemblers, 35 men and 35 women, an...

...read moreread less

111 citations

Journal Article•10.1111/J.1540-5885.2008.00312.X•

Interfirm Innovation under Uncertainty: Empirical Evidence for Strategic Knowledge Partitioning*

[...]

Jaegul Lee¹, Francisco Veloso²•Institutions (2)

Wayne State University¹, Carnegie Mellon University²

01 Sep 2008-Journal of Product Innovation Management

TL;DR: In this paper, the authors analyze how uncertainty and life-cycle effects condition the knowledge boundary between assemblers and suppliers in interfirm product development, showing that assemblers' greater emphasis on component innovation in periods of greater uncertainty is only true as a relative deviation from an overall trend toward increasing component innovation over time.

...read moreread less

99 citations

Posted Content•10.1101/637637•

metaFlye: scalable long-read metagenome assembly using repeat graphs

[...]

Mikhail Kolmogorov¹, Mikhail Rayko², Jeffrey Yuan¹, Evgeny Polevikov², Pavel A. Pevzner¹ - Show less +1 more•Institutions (2)

University of California, Berkeley¹, Saint Petersburg State University²

15 May 2019-bioRxiv

TL;DR: The metaFlye assembler is presented and it is demonstrated that it generates highly contiguous and accurate metagenome assemblies and captures many 16S RNA genes within long contigs, thus providing new opportunities for analyzing the microbial “dark matter of life”.

...read moreread less

Abstract: Long-read sequencing technologies substantially improved assemblies of many isolate bacterial genomes as compared to fragmented assemblies produced with short-read technologies. However, assembling complex metagenomic datasets remains a challenge even for the state-of-the-art long-read assemblers. To address this gap, we present the metaFlye assembler and demonstrate that it generates highly contiguous and accurate metagenome assemblies. In contrast to short-read metagenomics assemblers that typically fail to reconstruct full-length 16S RNA genes, metaFlye captures many 16S RNA genes within long contigs, thus providing new opportunities for analyzing the microbial "dark matter of life". We also demonstrate that long-read metagenome assemblers significantly improve full-length plasmid and virus reconstruction as compared to short-read assemblers and reveal many novel plasmids and viruses.

...read moreread less

56 citations

...

Expand

Year	Papers
2021	2
2020	5
2019	1
2018	5
2017	5
2016	3

Topic Tools

Papers published on a yearly basis

Papers

A practical comparison of de novo genome assembly software tools for next-generation sequencing technologies.

Comparing de novo genome assembly: the long and short of it.

Effects of Gender on Self- and Supervisory Ratings

Interfirm Innovation under Uncertainty: Empirical Evidence for Strategic Knowledge Partitioning*

metaFlye: scalable long-read metagenome assembly using repeat graphs

Related Topics (5)

Performance Metrics