Optimizing read mapping to reference genomes to determine composition and species prevalence in microbial communities
John Martin,Sean M. Sykes,Sarah Young,Karthik Kota,Ravi Sanka,Nihar U. Sheth,Joshua Orvis,Erica Sodergren,Zhengyuan Wang,George M. Weinstock,Makedonka Mitreva +10 more
TL;DR: It is found that constraining alignment length had more impact on sensitivity than does constraining similarity in all cases tested, and choosing the top hit randomly when multiple, equally strong mappings are available increases overall sensitivity at the expense of taxonomic resolution.
read more
Abstract: The Human Microbiome Project (HMP) aims to characterize the microbial communities of 18 body sites from healthy individuals. To accomplish this, the HMP generated two types of shotgun data: reference shotgun sequences isolated from different anatomical sites on the human body and shotgun metagenomic sequences from the microbial communities of each site. The alignment strategy for characterizing these metagenomic communities using available reference sequence is important to the success of HMP data analysis. Six next-generation aligners were used to align a community of known composition against a database comprising reference organisms known to be present in that community. All aligners report nearly complete genome coverage (>97%) for strains with over 6X depth of coverage, however they differ in speed, memory requirement and ease of use issues such as database size limitations and supported mapping strategies. The selected aligner was tested across a range of parameters to maximize sensitivity while maintaining a low false positive rate. We found that constraining alignment length had more impact on sensitivity than does constraining similarity in all cases tested. However, when reference species were replaced with phylogenetic neighbors, similarity begins to play a larger role in detection. We also show that choosing the top hit randomly when multiple, equally strong mappings are available increases overall sensitivity at the expense of taxonomic resolution. The results of this study identified a strategy that was used to map over 3 tera-bases of microbial sequence against a database of more than 5,000 reference genomes in just over a month.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Diet rapidly and reproducibly alters the human gut microbiome
Lawrence A. David,Corinne F. Maurice,Rachel N. Carmody,David B. Gootenberg,Julie E. Button,Benjamin E. Wolfe,Alisha V. Ling,A. Sloan Devlin,Yug Varma,Michael A. Fischbach,Sudha B. Biddinger,Rachel J. Dutton,Peter J. Turnbaugh +12 more
TL;DR: Increases in the abundance and activity of Bilophila wadsworthia on the animal-based diet support a link between dietary fat, bile acids and the outgrowth of microorganisms capable of triggering inflammatory bowel disease.
A framework for human microbiome research
Barbara A. Methé,Karen E. Nelson,Mihai Pop,Heather Huot Creasy,Michelle G. Giglio,Curtis Huttenhower,Curtis Huttenhower,Dirk Gevers,Joseph F. Petrosino,Sahar Abubucker,Jonathan H. Badger,Asif T. Chinwalla,Ashlee M. Earl,Michael Fitzgerald,Robert S. Fulton,Kymberlie Hallsworth-Pepin,Elizabeth A. Lobos,Ramana Madupu,Vincent Magrini,John Martin,Makedonka Mitreva,Donna M. Muzny,Erica Sodergren,James Versalovic,Aye Wollam,Kim C. Worley,Jennifer R. Wortman,Sarah Young,Qiandong Zeng,Kjersti Aagaard,Olukemi O. Abolude,Emma Allen-Vercoe,Eric J. Alm,Eric J. Alm,Lucia Alvarado,Gary L. Andersen,Scott Anderson,Elizabeth L. Appelbaum,Harindra Arachchi,Gary C. Armitage,Cesar Arze,Tulin Ayvaz,Carl C. Baker,Lisa Begg,Tsegahiwot Belachew,Veena Bhonagiri,Monika Bihan,Martin J. Blaser,Toby Bloom,Vivien Bonazzi,Paul Brooks,Gregory A. Buck,Christian J. Buhay,Dana A. Busam,Joseph L. Campbell,Shane Canon,Brandi L. Cantarel,Patrick S. G. Chain,Patrick S. G. Chain,I. Min A. Chen,Lei Chen,Shaila Chhibba,Ken Chu,Dawn Ciulla,Jose C. Clemente,Sandra W. Clifton,Sean Conlan,Jonathan Crabtree,Mary A. Cutting,Noam J. Davidovics,Catherine C. Davis,Todd Z. DeSantis,Carolyn Deal,Kimberley D. Delehaunty,Floyd E. Dewhirst,Elena Deych,Yan Ding,David J. Dooling,Shannon Dugan,W. Michael Dunne,W. Michael Dunne,A. Scott Durkin,Robert C. Edgar,Rachel L. Erlich,Candace N. Farmer,Ruth M. Farrell,Karoline Faust,Michael Feldgarden,Victor Felix,Sheila Fisher,Anthony A. Fodor,Larry J. Forney,Leslie Foster,Valentina Di Francesco,Jonathan Friedman,Dennis C. Friedrich,Catrina Fronick,Lucinda Fulton,Hongyu Gao,Nathalia Garcia,Georgia Giannoukos,Christina Giblin,Maria Y. Giovanni,Jonathan M. Goldberg,Johannes B. Goll,Antonio Gonzalez,Allison D. Griggs,Sharvari Gujja,Brian J. Haas,Holli A. Hamilton,Emily L. Harris,Theresa A. Hepburn,Brandi Herter,Diane E. Hoffmann,Michael Holder,Clinton Howarth,Katherine H. Huang,Susan M. Huse,Jacques Izard,Janet K. Jansson,Huaiyang Jiang,Catherine Jordan,Vandita Joshi,James A. Katancik,Wendy A. Keitel,Scott T. Kelley,Cristyn Kells,Susan Kinder-Haake,Nicholas B. King,Rob Knight,Rob Knight,Dan Knights,Heidi H. Kong,Omry Koren,Sergey Koren,Karthik Kota,Christie Kovar,Nikos C. Kyrpides,Patricio S. La Rosa,Sandra L. Lee,Katherine P. Lemon,Niall Lennon,Cecil M. Lewis,Lora Lewis,Ruth E. Ley,Kelvin Li,Konstantinos Liolios,Bo Liu,Yue Liu,Chien Chi Lo,Catherine A. Lozupone,R. Dwayne Lunsford,Tessa Madden,Anup Mahurkar,Peter J. Mannon,Elaine R. Mardis,Victor M. Markowitz,Victor M. Markowitz,Konstantinos Mavrommatis,Jamison McCorrison,Daniel McDonald,Jean E. McEwen,Amy L. McGuire,Pamela McInnes,Teena Mehta,Kathie A. Mihindukulasuriya,Jason R. Miller,Patrick Minx,Irene Newsham,Chad Nusbaum,Michelle O'Laughlin,Joshua Orvis,Ioanna Pagani,Krishna Palaniappan,Shital M. Patel,Matthew D. Pearson,Jane Peterson,Mircea Podar,Craig Pohl,Katherine S. Pollard,Margaret Priest,Lita M. Proctor,Xiang Qin,Jeroen Raes,Jacques Ravel,Jeffrey G. Reid,Mina Rho,Rosamond Rhodes,Kevin Riehle,Maria C. Rivera,Beltran Rodriguez-Mueller,Yu-Hui Rogers,Matthew C. Ross,Carsten Russ,Ravi Sanka,Pamela Sankar,J. Fah Sathirapongsasuti,Jeffery A. Schloss,Patrick D. Schloss,Thomas M. Schmidt,Matthew B. Scholz,Lynn M. Schriml,Alyxandria M. Schubert,Nicola Segata,Julia A. Segre,William D. Shannon,Richard R. Sharp,Thomas J. Sharpton,Narmada Shenoy,Nihar U. Sheth,Gina A. Simone,Indresh Singh,Christopher Smillie,Jack D. Sobel,Daniel D. Sommer,Paul Spicer,Granger G. Sutton,Sean M. Sykes,Diana Tabbaa,Mathangi Thiagarajan,Chad Tomlinson,Manolito Torralba,Todd J. Treangen,Rebecca Truty,Tatiana A. Vishnivetskaya,Jason Walker,Lu Wang,Zhengyuan Wang,Doyle V. Ward,Wesley C. Warren,Mark A. Watson,Christopher Wellington,Kris A. Wetterstrand,James R. White,Katarzyna Wilczek-Boney,Yuan Qing Wu,Kristine M. Wylie,Todd Wylie,Chandri Yandava,Liang Ye,Yuzhen Ye,Shibu Yooseph,Bonnie P. Youmans,Lan Zhang,Yanjiao Zhou,Yiming Zhu,Laurie Zoloth,Jeremy Zucker,Bruce W. Birren,Richard A. Gibbs,Sarah K. Highlander,George M. Weinstock,Richard K. Wilson,Owen White +253 more
TL;DR: The Human Microbiome Project (HMP) Consortium has established a population-scale framework which catalyzed significant development of metagenomic protocols resulting in a broad range of quality-controlled resources and data including standardized methods for creating, processing and interpreting distinct types of high-throughput metagenomics data available to the scientific community as mentioned in this paper.
Metabolic Reconstruction for Metagenomic Data and Its Application to the Human Microbiome
Sahar Abubucker,Nicola Segata,Johannes B. Goll,Alyxandria M. Schubert,Jacques Izard,Brandi L. Cantarel,Beltran Rodriguez-Mueller,Jeremy Zucker,Mathangi Thiagarajan,Bernard Henrissat,Owen White,Scott T. Kelley,Barbara A. Methé,Patrick D. Schloss,Dirk Gevers,Makedonka Mitreva,Curtis Huttenhower,Curtis Huttenhower +17 more
TL;DR: We derive the functional and metabolic potential of a microbial community metagenome directly from short sequence reads.
Metabolic Reconstruction for Metagenomic Data and Its Application to the Human Microbiome
Sahar Abubucker,Nicola Segata,Johannes B. Goll,Alyxandria M. Schubert,Jacques Izard,Brandi L. Cantarel,Beltran Rodriguez-Mueller,Jeremy Zucker,Mathangi Thiagarajan,Bernard Henrissat,Owen White,Scott T. Kelley,Barbara A. Methé,Patrick D. Schloss,Dirk Gevers,Makedonka Mitreva,Curtis Huttenhower,Curtis Huttenhower +17 more
- 01 Jun 2012
TL;DR: This work determined the gene families and pathways present or absent within a community, as well as their relative abundances, directly from short sequence reads, enabling the determination of community roles in the HMP cohort and in future metagenomic studies.
770
An introduction to the analysis of shotgun metagenomic data.
TL;DR: This review describes the analytical strategies and specific tools that can be applied to metagenomic data and the considerations and caveats associated with their use and documents how metagenomes can be analyzed to quantify community structure and diversity.
577
References
The Sequence Alignment/Map format and SAMtools
Heng Li,Bob Handsaker,Alec Wysoker,T. J. Fennell,Jue Ruan,Nils Homer,Gabor T. Marth,Gonçalo R. Abecasis,Richard Durbin +8 more
TL;DR: SAMtools as discussed by the authors implements various utilities for post-processing alignments in the SAM format, such as indexing, variant caller and alignment viewer, and thus provides universal tools for processing read alignments.
Fast and accurate short read alignment with Burrows–Wheeler transform
Heng Li,Richard Durbin +1 more
TL;DR: Burrows-Wheeler Alignment tool (BWA) is implemented, a new read alignment package that is based on backward search with Burrows–Wheeler Transform (BWT), to efficiently align short sequencing reads against a large reference sequence such as the human genome, allowing mismatches and gaps.
Naïve Bayesian Classifier for Rapid Assignment of rRNA Sequences into the New Bacterial Taxonomy
TL;DR: The RDP Classifier can rapidly and accurately classify bacterial 16S rRNA sequences into the new higher-order taxonomy proposed in Bergey's Taxonomic Outline of the Prokaryotes, and the majority of the classification errors appear to be due to anomalies in the current taxonomies.
Diversity of the human intestinal microbial flora.
Paul B. Eckburg,Elisabeth M. Bik,Charles N. Bernstein,Elizabeth Purdom,Les Dethlefsen,Michael Sargent,Steven R. Gill,Karen E. Nelson,David A. Relman,David A. Relman,David A. Relman +10 more
TL;DR: A majority of the bacterial sequences corresponded to uncultivated species and novel microorganisms, and significant intersubject variability and differences between stool and mucosa community composition were discovered.
Mauve: multiple alignment of conserved genomic sequence with rearrangements.
TL;DR: This work presents methods for identification and alignment of conserved genomic DNA in the presence of rearrangements and horizontal transfer and evaluated the quality of Mauve alignments and drawn comparison to other methods through extensive simulations of genome evolution.
Related Papers (5)
Junjie Qin,Ruiqiang Li,Jeroen Raes,Manimozhiyan Arumugam,Kristoffer Sølvsten Burgdorf,Chaysavanh Manichanh,Trine Nielsen,Nicolas Pons,Florence Levenez,Takuji Yamada,Daniel R. Mende,Junhua Li,Junming Xu,Shaochuan Li,Dongfang Li,Jianjun Cao,Bo Wang,Huiqing Liang,Huisong Zheng,Yinlong Xie,Julien Tap,Patricia Lepage,Marcelo Bertalan,Jean-Michel Batto,Torben Hansen,Denis Le Paslier,Allan Linneberg,H. Bjørn Nielsen,Eric Pelletier,Pierre Renault,Thomas Sicheritz-Pontén,Keith Turner,Hongmei Zhu,Chang Yu,Shengting Li,Min Jian,Yan Zhou,Yingrui Li,Xiuqing Zhang,Songgang Li,Nan Qin,Huanming Yang,Jian Wang,Søren Brunak,Joël Doré,Francisco Guarner,Karsten Kristiansen,Oluf Pedersen,Julian Parkhill,Jean Weissenbach,Peer Bork,S. Dusko Ehrlich,Jun Wang +52 more
Curtis Huttenhower,Curtis Huttenhower,Dirk Gevers,Rob Knight,Rob Knight,Sahar Abubucker,Jonathan H. Badger,Asif T. Chinwalla,Heather Huot Creasy,Ashlee M. Earl,Michael Fitzgerald,Robert S. Fulton,Michelle G. Giglio,Kymberlie Hallsworth-Pepin,Elizabeth A. Lobos,Ramana Madupu,Vincent Magrini,John Martin,Makedonka Mitreva,Donna M. Muzny,Erica Sodergren,James Versalovic,Aye Wollam,Kim C. Worley,Jennifer R. Wortman,Sarah Young,Qiandong Zeng,Kjersti Aagaard,Olukemi O. Abolude,Emma Allen-Vercoe,Eric J. Alm,Eric J. Alm,Lucia Alvarado,Gary L. Andersen,Scott Anderson,Elizabeth L. Appelbaum,Harindra Arachchi,Gary C. Armitage,Cesar Arze,Tulin Ayvaz,Carl C. Baker,Lisa Begg,Tsegahiwot Belachew,Veena Bhonagiri,Monika Bihan,Martin J. Blaser,Toby Bloom,Vivien Bonazzi,J. Paul Brooks,Gregory A. Buck,Christian J. Buhay,Dana A. Busam,Joseph L. Campbell,Shane Canon,Brandi L. Cantarel,Patrick S. G. Chain,Patrick S. G. Chain,I. Min A. Chen,Lei Chen,Shaila Chhibba,Ken Chu,Dawn Ciulla,Jose C. Clemente,Sandra W. Clifton,Sean Conlan,Jonathan Crabtree,Mary A. Cutting,Noam J. Davidovics,Catherine C. Davis,Todd Z. DeSantis,Carolyn Deal,Kimberley D. Delehaunty,Floyd E. Dewhirst,Elena Deych,Yan Ding,David J. Dooling,Shannon Dugan,Wm. Michael Dunne,Wm. Michael Dunne,A. Scott Durkin,Robert C. Edgar,Rachel L. Erlich,Candace N. Farmer,Ruth M. Farrell,Karoline Faust,Michael Feldgarden,Victor Felix,Sheila Fisher,Anthony A. Fodor,Larry J. Forney,Leslie Foster,Valentina Di Francesco,Jonathan Friedman,Dennis C. Friedrich,Catrina Fronick,Lucinda Fulton,Hongyu Gao,Nathalia Garcia,Georgia Giannoukos,Christina Giblin,Maria Y. Giovanni,Jonathan M. Goldberg,Johannes B. Goll,Antonio Gonzalez,Allison D. Griggs,Sharvari Gujja,Susan Kinder Haake,Brian J. Haas,Holli A. Hamilton,Emily L. Harris,Theresa A. Hepburn,Brandi Herter,Diane E. Hoffmann,Michael Holder,Clinton Howarth,Katherine H. Huang,Susan M. Huse,Jacques Izard,Janet K. Jansson,Huaiyang Jiang,Catherine Jordan,Vandita Joshi,James A. Katancik,Wendy A. Keitel,Scott T. Kelley,Cristyn Kells,Nicholas B. King,Dan Knights,Heidi H. Kong,Omry Koren,Sergey Koren,Karthik Kota,Christie Kovar,Nikos C. Kyrpides,Patricio S. La Rosa,Sandra L. Lee,Katherine P. Lemon,Niall J. Lennon,Cecil M. Lewis,Lora Lewis,Ruth E. Ley,Kelvin Li,Konstantinos Liolios,Bo Liu,Yue Liu,Chien Chi Lo,Catherine A. Lozupone,R. Dwayne Lunsford,Tessa Madden,Anup Mahurkar,Peter J. Mannon,Elaine R. Mardis,Victor M. Markowitz,Victor M. Markowitz,Konstantinos Mavromatis,Jamison McCorrison,Daniel McDonald,Jean E. McEwen,Amy L. McGuire,Pamela McInnes,Teena Mehta,Kathie A. Mihindukulasuriya,Jason R. Miller,Patrick Minx,Irene Newsham,Chad Nusbaum,Michelle Oglaughlin,Joshua Orvis,Ioanna Pagani,Krishna Palaniappan,Shital M. Patel,Matthew D. Pearson,Jane Peterson,Mircea Podar,Craig Pohl,Katherine S. Pollard,Mihai Pop,Margaret Priest,Lita M. Proctor,Xiang Qin,Jeroen Raes,Jacques Ravel,Jeffrey G. Reid,Mina Rho,Rosamond Rhodes,Kevin Riehle,Maria C. Rivera,Beltran Rodriguez-Mueller,Yu-Hui Rogers,Matthew C. Ross,Carsten Russ,Ravi Sanka,Pamela Sankar,J. Fah Sathirapongsasuti,Jeffery A. Schloss,Patrick D. Schloss,Thomas M. Schmidt,Matthew B. Scholz,Lynn M. Schriml,Alyxandria M. Schubert,Nicola Segata,Julia A. Segre,William D. Shannon,Richard R. Sharp,Thomas J. Sharpton,Narmada Shenoy,Nihar U. Sheth,Gina A. Simone,Indresh Singh,Christopher Smillie,Jack D. Sobel,Daniel D. Sommer,Paul Spicer,Granger G. Sutton,Sean M. Sykes,Diana Tabbaa,Mathangi Thiagarajan,Chad Tomlinson,Manolito Torralba,Todd J. Treangen,Rebecca Truty,Tatiana A. Vishnivetskaya,Jason Walker,Lu Wang,Zhengyuan Wang,Doyle V. Ward,Wesley C. Warren,Mark A. Watson,Christopher Wellington,Kris A. Wetterstrand,James R. White,Katarzyna Wilczek-Boney,Yuanqing Wu,Kristine M. Wylie,Todd Wylie,Chandri Yandava,Liang Ye,Yuzhen Ye,Shibu Yooseph,Bonnie P. Youmans,Lan Zhang,Yanjiao Zhou,Yiming Zhu,Laurie Zoloth,Jeremy Zucker,Bruce W. Birren,Richard A. Gibbs,Sarah K. Highlander,Barbara A. Methé,Karen E. Nelson,Joseph F. Petrosino,George M. Weinstock,Richard K. Wilson,Owen White +253 more
Manimozhiyan Arumugam,Jeroen Raes,Eric Pelletier,Denis Le Paslier,Takuji Yamada,Daniel R. Mende,Gabriel Fernandes,Julien Tap,Thomas Brüls,Jean-Michel Batto,Marcelo Bertalan,Natalia Borruel,Francesc Casellas,Leyden Fernández,Laurent Gautier,Torben Hansen,Masahira Hattori,Tetsuya Hayashi,Michiel Kleerebezem,Ken Kurokawa,Marion Leclerc,Florence Levenez,Chaysavanh Manichanh,H. Bjørn Nielsen,Trine Nielsen,Nicolas Pons,Julie Poulain,Junjie Qin,Thomas Sicheritz-Pontén,Sebastian Tims,David Torrents,Edgardo Ugarte,Erwin G. Zoetendal,Jun Wang,Francisco Guarner,Oluf Pedersen,Willem M. de Vos,Søren Brunak,Joël Doré,Jean Weissenbach,S. Dusko Ehrlich,Peer Bork +41 more
Tanya Yatsunenko,Federico E. Rey,Mark J. Manary,Mark J. Manary,Indi Trehan,Indi Trehan,Maria Gloria Dominguez-Bello,Monica Contreras,Magda Magris,Glida Hidalgo,Robert N. Baldassano,Andrey P. Anokhin,Andrew C. Heath,Barbara B. Warner,Jens Reeder,Justin Kuczynski,J. Gregory Caporaso,Catherine A. Lozupone,Christian L. Lauber,Jose C. Clemente,Dan Knights,Rob Knight,Jeffrey I. Gordon +22 more
J. Gregory Caporaso,Justin Kuczynski,Jesse Stombaugh,Kyle Bittinger,Frederic D. Bushman,Elizabeth K. Costello,Noah Fierer,Antonio Gonzalez Peña,Julia K. Goodrich,Jeffrey I. Gordon,Gavin A. Huttley,Scott T. Kelley,Dan Knights,Jeremy E. Koenig,Ruth E. Ley,Catherine A. Lozupone,Daniel McDonald,Brian D. Muegge,Meg Pirrung,Jens Reeder,Joel Sevinsky,Peter J. Turnbaugh,William A. Walters,Jeremy Widmann,Tanya Yatsunenko,Jesse R. Zaneveld,Rob Knight,Rob Knight +27 more