Unlocking Short Read Sequencing for Metagenomics
Abstract
We describe an experimental and computational pipeline yielding millions of reads that can exceed 200 bp with quality scores approaching that of traditional Sanger sequencing. The method combines an automatable gel-less library construction step with paired-end sequencing on a short-read instrument. With appropriately sized library inserts, mate-pair sequences can overlap, and we describe the SHERA software package that joins them to form a longer composite read.
- Authors:
-
- Massachusetts Inst. of Technology (MIT), Cambridge, MA (United States). Dept. of Civil and Environmental Engineering
- Publication Date:
- Research Org.:
- Massachusetts Inst. of Technology (MIT), Cambridge, MA (United States)
- Sponsoring Org.:
- USDOE
- OSTI Identifier:
- 1026647
- Report Number(s):
- DoE/ER/-64506-4
Journal ID: ISSN 1932-6203
- Grant/Contract Number:
- FG02-07ER64506
- Resource Type:
- Accepted Manuscript
- Journal Name:
- PLoS ONE
- Additional Journal Information:
- Journal Volume: 5; Journal Issue: 7; Journal ID: ISSN 1932-6203
- Publisher:
- Public Library of Science
- Country of Publication:
- United States
- Language:
- English
- Subject:
- 59 BASIC BIOLOGICAL SCIENCES
Citation Formats
Rodrigue, Sébastien, Materna, Arne C., Timberlake, Sonia C., Blackburn, Matthew C., Malmstrom, Rex R., Alm, Eric J., Chisholm, Sallie W., and Gilbert, Jack Anthony. Unlocking Short Read Sequencing for Metagenomics. United States: N. p., 2010.
Web. doi:10.1371/journal.pone.0011840.
Rodrigue, Sébastien, Materna, Arne C., Timberlake, Sonia C., Blackburn, Matthew C., Malmstrom, Rex R., Alm, Eric J., Chisholm, Sallie W., & Gilbert, Jack Anthony. Unlocking Short Read Sequencing for Metagenomics. United States. https://doi.org/10.1371/journal.pone.0011840
Rodrigue, Sébastien, Materna, Arne C., Timberlake, Sonia C., Blackburn, Matthew C., Malmstrom, Rex R., Alm, Eric J., Chisholm, Sallie W., and Gilbert, Jack Anthony. Wed .
"Unlocking Short Read Sequencing for Metagenomics". United States. https://doi.org/10.1371/journal.pone.0011840. https://www.osti.gov/servlets/purl/1026647.
@article{osti_1026647,
title = {Unlocking Short Read Sequencing for Metagenomics},
author = {Rodrigue, Sébastien and Materna, Arne C. and Timberlake, Sonia C. and Blackburn, Matthew C. and Malmstrom, Rex R. and Alm, Eric J. and Chisholm, Sallie W. and Gilbert, Jack Anthony},
abstractNote = {We describe an experimental and computational pipeline yielding millions of reads that can exceed 200 bp with quality scores approaching that of traditional Sanger sequencing. The method combines an automatable gel-less library construction step with paired-end sequencing on a short-read instrument. With appropriately sized library inserts, mate-pair sequences can overlap, and we describe the SHERA software package that joins them to form a longer composite read.},
doi = {10.1371/journal.pone.0011840},
journal = {PLoS ONE},
number = 7,
volume = 5,
place = {United States},
year = {2010},
month = {7}
}
Free Publicly Available Full Text
Publisher's Version of Record
Other availability
Cited by: 92 works
Citation information provided by
Web of Science
Web of Science
Save to My Library
You must Sign In or Create an Account in order to save documents to your library.
Works referenced in this record:
Next-generation sequencing transforms today's biology
journal, December 2007
- Schuster, Stephan C.
- Nature Methods, Vol. 5, Issue 1
Parallel, tag-directed assembly of locally derived short sequence reads
journal, January 2010
- Hiatt, Joseph B.; Patwardhan, Rupali P.; Turner, Emily H.
- Nature Methods, Vol. 7, Issue 2
The Long March: A Sample Preparation Technique that Enhances Contig Length and Coverage by High-Throughput Short-Read Sequencing
journal, October 2008
- Sorber, Katherine; Chiu, Charles; Webster, Dale
- PLoS ONE, Vol. 3, Issue 10
A scalable, fully automated process for construction of sequence-ready barcoded libraries for 454
journal, January 2010
- Lennon, Niall J.; Lintner, Robert E.; Anderson, Scott
- Genome Biology, Vol. 11, Issue 2
Solid-phase reversible immobilization for the isolation of PCR products
journal, January 1995
- DeAngelis, Margaret M.; Wang, David G.; Hawkins, Trevor L.
- Nucleic Acids Research, Vol. 23, Issue 22
DNA purification and isolation using a solid-phase
journal, January 1994
- Hawkins, Trevor L.; O‘Connor-Morin, Tarra; Roy, Aparna
- Nucleic Acids Research, Vol. 22, Issue 21
Magnetic hydrophilic methacrylate-based polymer microspheres for genomic DNA isolation
journal, February 2005
- Křížová, Jana; Španová, Alena; Rittich, Bohuslav
- Journal of Chromatography A, Vol. 1064, Issue 2
Base-Calling of Automated Sequencer Traces Using Phred. II. Error Probabilities
journal, March 1998
- Ewing, Brent; Green, Phil
- Genome Research, Vol. 8, Issue 3
Short clones or long clones? A simulation study on the use of paired reads in metagenomics
journal, January 2010
- Mitra, Suparna; Schubach, Max; Huson, Daniel H.
- BMC Bioinformatics, Vol. 11, Issue S1
Widespread known and novel phosphonate utilization pathways in marine bacteria revealed by functional screening and metagenomic analyses
journal, January 2010
- Martinez, Asuncion; Tyson, Gene W.; DeLong, Edward F.
- Environmental Microbiology, Vol. 12, Issue 1
Metagenomics: Read Length Matters
journal, January 2008
- Wommack, K. E.; Bhavsar, J.; Ravel, J.
- Applied and Environmental Microbiology, Vol. 74, Issue 5
Gene prediction in metagenomic fragments: A large scale machine learning approach
journal, April 2008
- Hoff, Katharina J.; Tech, Maike; Lingner, Thomas
- BMC Bioinformatics, Vol. 9, Issue 1
Phymm and PhymmBL: metagenomic phylogenetic classification with interpolated Markov models
journal, August 2009
- Brady, Arthur; Salzberg, Steven L.
- Nature Methods, Vol. 6, Issue 9
Patterns and Implications of Gene Gain and Loss in the Evolution of Prochlorococcus
journal, January 2007
- Kettler, Gregory C.; Martiny, Adam C.; Huang, Katherine
- PLoS Genetics, Vol. 3, Issue 12
Whole Genome Amplification and De novo Assembly of Single Bacterial Cells
journal, September 2009
- Rodrigue, Sébastien; Malmstrom, Rex R.; Berlin, Aaron M.
- PLoS ONE, Vol. 4, Issue 9
Accurate whole human genome sequencing using reversible terminator chemistry
journal, November 2008
- Bentley, David R.; Balasubramanian, Shankar; Swerdlow, Harold P.
- Nature, Vol. 456, Issue 7218
Amplification of cDNA ends based on template-switching effect and step- out PCR
journal, March 1999
- Matz, M.
- Nucleic Acids Research, Vol. 27, Issue 6
Regulation of average length of complex PCR product
journal, September 1999
- Shagin, D.
- Nucleic Acids Research, Vol. 27, Issue 18
An improved PCR method for walking in uncloned genomic DNA
journal, January 1995
- Siebert, Paul D.; Chenchik, Alex; Kellogg, David E.
- Nucleic Acids Research, Vol. 23, Issue 6
Mapping short DNA sequencing reads and calling variants using mapping quality scores
journal, November 2008
- Li, H.; Ruan, J.; Durbin, R.
- Genome Research, Vol. 18, Issue 11
Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
journal, September 1997
- Altschul, Stephen F.; Madden, Thomas L.; Schäffer, Alejandro A.
- Nucleic Acids Research, Vol. 25, Issue 17, p. 3389-3402
MEGAN analysis of metagenomic data
journal, February 2007
- Huson, D. H.; Auch, A. F.; Qi, J.
- Genome Research, Vol. 17, Issue 3
Works referencing / citing this record:
Illumina-based analysis of microbial community diversity
journal, June 2011
- Degnan, Patrick H.; Ochman, Howard
- The ISME Journal, Vol. 6, Issue 1
Ecology of uncultured Prochlorococcus clades revealed through single-cell genomics and biogeographic analysis
journal, August 2012
- Malmstrom, Rex R.; Rodrigue, Sébastien; Huang, Katherine H.
- The ISME Journal, Vol. 7, Issue 1
Transcriptional response of bathypelagic marine bacterioplankton to the Deepwater Horizon oil spill
journal, August 2013
- Rivers, Adam R.; Sharma, Shalabh; Tringe, Susannah G.
- The ISME Journal, Vol. 7, Issue 12
Members of the human gut microbiota involved in recovery from Vibrio cholerae infection
journal, September 2014
- Hsiao, Ansel; Ahmed, A. M. Shamsir; Subramanian, Sathish
- Nature, Vol. 515, Issue 7527
Methylotrophic methanogenic Thermoplasmata implicated in reduced methane emissions from bovine rumen
journal, February 2013
- Poulsen, Morten; Schwab, Clarissa; Borg Jensen, Bent
- Nature Communications, Vol. 4, Issue 1
Next-generation transcriptome assembly
journal, September 2011
- Martin, Jeffrey A.; Wang, Zhong
- Nature Reviews Genetics, Vol. 12, Issue 10
Experimental and analytical tools for studying the human microbiome
journal, December 2011
- Kuczynski, Justin; Lauber, Christian L.; Walters, William A.
- Nature Reviews Genetics, Vol. 13, Issue 1
The development of colitis in Il10−/− mice is dependent on IL-22
journal, January 2020
- Gunasekera, Dilini C.; Ma, Jinxia; Vacharathit, Vimvara
- Mucosal Immunology, Vol. 13, Issue 3
Genomes of diverse isolates of the marine cyanobacterium Prochlorococcus
journal, September 2014
- Biller, Steven J.; Berube, Paul M.; Berta-Thompson, Jessie W.
- Scientific Data, Vol. 1, Issue 1
Viruses of the Nahant Collection, characterization of 251 marine Vibrionaceae viruses
journal, July 2018
- Kauffman, Kathryn M.; Brown, Julia M.; Sharma, Radhey S.
- Scientific Data, Vol. 5, Issue 1
A near complete snapshot of the Zea mays seedling transcriptome revealed from ultra-deep sequencing
journal, March 2014
- Martin, Jeffrey A.; Johnson, Nicole V.; Gross, Stephen M.
- Scientific Reports, Vol. 4, Issue 1
Mutational landscape of EGFR- , MYC- , and Kras- driven genetically engineered mouse models of lung adenocarcinoma
journal, October 2016
- McFadden, David G.; Politi, Katerina; Bhutkar, Arjun
- Proceedings of the National Academy of Sciences, Vol. 113, Issue 42
Apoptotic cleavage of DNA in human lymphocyte chromatin shows high sequence specificity
journal, June 2012
- Bettecken, Thomas; Frenkel, Zakharia M.; Altmüller, Janine
- Journal of Biomolecular Structure and Dynamics, Vol. 30, Issue 2
Current opportunities and challenges in microbial metagenome analysis--a bioinformatic perspective
journal, September 2012
- Teeling, H.; Glockner, F. O.
- Briefings in Bioinformatics, Vol. 13, Issue 6
FLASH: fast length adjustment of short reads to improve genome assemblies
journal, September 2011
- Magoc, T.; Salzberg, S. L.
- Bioinformatics, Vol. 27, Issue 21
A de novo metagenomic assembly program for shotgun DNA reads
journal, April 2012
- Lai, Binbin; Ding, Ruogu; Li, Yang
- Bioinformatics, Vol. 28, Issue 11
COPE: an accurate k-mer-based pair-end reads connection tool to facilitate genome assembly
journal, October 2012
- Liu, B.; Yuan, J.; Yiu, S. -M.
- Bioinformatics, Vol. 28, Issue 22
PEAR: a fast and accurate Illumina Paired-End reAd mergeR
journal, October 2013
- Zhang, J.; Kobert, K.; Flouri, T.
- Bioinformatics, Vol. 30, Issue 5
Error filtering, pair assembly and error correction for next-generation sequencing reads
journal, July 2015
- Edgar, Robert C.; Flyvbjerg, Henrik
- Bioinformatics, Vol. 31, Issue 21
Sequence-specific error profile of Illumina sequencers
journal, May 2011
- Nakamura, Kensuke; Oshima, Taku; Morimoto, Takuya
- Nucleic Acids Research, Vol. 39, Issue 13
Metagenomic 16S rDNA Illumina tags are a powerful alternative to amplicon sequencing to explore diversity and structure of microbial communities: Using
journal, September 2013
- Logares, Ramiro; Sunagawa, Shinichi; Salazar, Guillem
- Environmental Microbiology, Vol. 16, Issue 9
Mosaic patterns of B-vitamin synthesis and utilization in a natural marine microbial community: B-vitamin mosaics
journal, May 2018
- Gómez-Consarnau, Laura; Sachdeva, Rohan; Gifford, Scott M.
- Environmental Microbiology, Vol. 20, Issue 8
Towards quantitative metagenomics of wild viruses and other ultra-low concentration DNA samples: a rigorous assessment and optimization of the linker amplification method: Linker amplification for ultra-low DNA samples
journal, June 2012
- Duhaime, Melissa B.; Deng, Li; Poulos, Bonnie T.
- Environmental Microbiology, Vol. 14, Issue 9
Prevention, diagnosis and treatment of high-throughput sequencing data pathologies
journal, March 2014
- Zhou, Xiaofan; Rokas, Antonis
- Molecular Ecology, Vol. 23, Issue 7
Bacterial Vesicles in Marine Ecosystems
journal, January 2014
- Biller, S. J.; Schubotz, F.; Roggensack, S. E.
- Science, Vol. 343, Issue 6167
Single-Cell Genomics Reveals Hundreds of Coexisting Subpopulations in Wild Prochlorococcus
journal, April 2014
- Kashtan, N.; Roggensack, S. E.; Rodrigue, S.
- Science, Vol. 344, Issue 6182
Multispecies diel transcriptional oscillations in open ocean heterotrophic bacterial assemblages
journal, July 2014
- Ottesen, E. A.; Young, C. R.; Gifford, S. M.
- Science, Vol. 345, Issue 6193
H3K9me3-heterochromatin loss at protein-coding genes enables developmental lineage specification
journal, January 2019
- Nicetto, Dario; Donahue, Greg; Jain, Tanya
- Science, Vol. 363, Issue 6424
Tomatidine Is a Lead Antibiotic Molecule That Targets Staphylococcus aureus ATP Synthase Subunit C
journal, April 2018
- Lamontagne Boulet, Maxime; Isabelle, Charles; Guay, Isabelle
- Antimicrobial Agents and Chemotherapy, Vol. 62, Issue 6
Complete Genome Sequence of Escherichia coli BW25113
journal, September 2014
- Grenier, F.; Matteau, D.; Baby, V.
- Genome Announcements, Vol. 2, Issue 5
Comparative Analysis of Mobilizable Genomic Islands
journal, November 2012
- Daccord, A.; Ceccarelli, D.; Rodrigue, S.
- Journal of Bacteriology, Vol. 195, Issue 3
Unbiased Parallel Detection of Viral Pathogens in Clinical Samples by Use of a Metagenomic Approach
journal, August 2011
- Yang, J.; Yang, F.; Ren, L.
- Journal of Clinical Microbiology, Vol. 49, Issue 10
Natural Bacterial Communities Serve as Quantitative Geochemical Biosensors
journal, May 2015
- Smith, Mark B.; Rocha, Andrea M.; Smillie, Chris S.
- mBio, Vol. 6, Issue 3
Inferring the Minimal Genome of Mesoplasma florum by Comparative Genomics and Transposon Mutagenesis
journal, April 2018
- Baby, Vincent; Lachance, Jean-Christophe; Gagnon, Jules
- mSystems, Vol. 3, Issue 3
Short-read reading-frame predictors are not created equal: sequence error causes loss of signal
journal, July 2012
- Trimble, William L.; Keegan, Kevin P.; D’Souza, Mark
- BMC Bioinformatics, Vol. 13, Issue 1
PANDAseq: paired-end assembler for illumina sequences
journal, January 2012
- Masella, Andre P.; Bartram, Andrea K.; Truszkowski, Jakub M.
- BMC Bioinformatics, Vol. 13, Issue 1
GapFiller: a de novo assembly approach to fill the gap within paired reads
journal, September 2012
- Nadalin, Francesca; Vezzi, Francesco; Policriti, Alberto
- BMC Bioinformatics, Vol. 13, Issue S14
Diminishing return for increased Mappability with longer sequencing reads: implications of the k-mer distributions in the human genome
journal, January 2014
- Li, Wentian; Freudenberg, Jan; Miramontes, Pedro
- BMC Bioinformatics, Vol. 15, Issue 1
The venom-gland transcriptome of the eastern diamondback rattlesnake (Crotalus adamanteus)
journal, January 2012
- Rokyta, Darin R.; Lemmon, Alan R.; Margres, Mark J.
- BMC Genomics, Vol. 13, Issue 1
The venom-gland transcriptome of the eastern coral snake (Micrurus fulvius) reveals high venom complexity in the intragenomic evolution of venoms
journal, January 2013
- Margres, Mark J.; Aronow, Karalyn; Loyacano, Jacob
- BMC Genomics, Vol. 14, Issue 1
Functional genomics and microbiome profiling of the Asian longhorned beetle (Anoplophora glabripennis) reveal insights into the digestive physiology and nutritional ecology of wood feeding beetles
journal, January 2014
- Scully, Erin D.; Geib, Scott M.; Carlson, John E.
- BMC Genomics, Vol. 15, Issue 1
The Amazon continuum dataset: quantitative metagenomic and metatranscriptomic inventories of the Amazon River plume, June 2010
journal, January 2014
- Satinsky, Brandon M.; Zielinski, Brian L.; Doherty, Mary
- Microbiome, Vol. 2, Issue 1
WiseScaffolder: an algorithm for the semi-automatic scaffolding of Next Generation Sequencing data
journal, September 2015
- Farrant, Gregory K.; Hoebeke, Mark; Partensky, Frédéric
- BMC Bioinformatics, Vol. 16, Issue 1
MeFiT: merging and filtering tool for illumina paired-end reads for 16S rRNA amplicon sequencing
journal, December 2016
- Parikh, Hardik I.; Koparde, Vishal N.; Bradley, Steven P.
- BMC Bioinformatics, Vol. 17, Issue 1
Practical guidelines for B-cell receptor repertoire sequencing analysis
journal, November 2015
- Yaari, Gur; Kleinstein, Steven H.
- Genome Medicine, Vol. 7, Issue 1
Metatranscriptomic analysis of diverse microbial communities reveals core metabolic pathways and microbiome-specific functionality
journal, January 2016
- Jiang, Yue; Xiong, Xuejian; Danska, Jayne
- Microbiome, Vol. 4, Issue 1
First draft genome sequence of a strain belonging to the Zoogloea genus and its gene expression in situ
journal, October 2017
- Muller, Emilie E. L.; Narayanasamy, Shaman; Zeimes, Myriam
- Standards in Genomic Sciences, Vol. 12, Issue 1
Incorporating 16S Gene Copy Number Information Improves Estimates of Microbial Diversity and Abundance
journal, October 2012
- Kembel, Steven W.; Wu, Martin; Eisen, Jonathan A.
- PLoS Computational Biology, Vol. 8, Issue 10
Microbiome Profiling by Illumina Sequencing of Combinatorial Sequence-Tagged PCR Products
journal, October 2010
- Gloor, Gregory B.; Hummelen, Ruben; Macklaim, Jean M.
- PLoS ONE, Vol. 5, Issue 10
CREST – Classification Resources for Environmental Sequence Tags
journal, November 2012
- Lanzén, Anders; Jørgensen, Steffen L.; Huson, Daniel H.
- PLoS ONE, Vol. 7, Issue 11
Species Identification and Profiling of Complex Microbial Communities Using Shotgun Illumina Sequencing of 16S rRNA Amplicon Sequences
journal, April 2013
- Ong, Swee Hoe; Kukkillaya, Vinutha Uppoor; Wilm, Andreas
- PLoS ONE, Vol. 8, Issue 4
A Microfluidic Device for Preparing Next Generation DNA Sequencing Libraries and for Automating Other Laboratory Protocols That Require One or More Column Chromatography Steps
journal, July 2013
- Tan, Swee Jin; Phan, Huan; Gerry, Benjamin Michael
- PLoS ONE, Vol. 8, Issue 7
Optimizing Information in Next-Generation-Sequencing (NGS) Reads for Improving De Novo Genome Assembly
journal, July 2013
- Liu, Tsunglin; Tsai, Cheng-Hung; Lee, Wen-Bin
- PLoS ONE, Vol. 8, Issue 7
Identification, Characterization, and Diel Pattern of Expression of Canonical Clock Genes in Nephrops norvegicus (Crustacea: Decapoda) Eyestalk
journal, November 2015
- Sbragaglia, Valerio; Lamanna, Francesco; M. Mat, Audrey
- PLOS ONE, Vol. 10, Issue 11
Taxonomic and Functional Metagenomic Signature of Turfs in the Abrolhos Reef System (Brazil)
journal, August 2016
- Walter, Juline M.; Tschoeke, Diogo A.; Meirelles, Pedro M.
- PLOS ONE, Vol. 11, Issue 8
Longitudinal microbiome profiling reveals impermanence of probiotic bacteria in domestic pigeons
journal, June 2019
- Grond, Kirsten; Perreau, Julie M.; Loo, Wesley T.
- PLOS ONE, Vol. 14, Issue 6
Rapid Whole-Genome Sequencing for Surveillance of Salmonella enterica Serovar Enteritidis
journal, August 2014
- den Bakker, Henk C.; Allard, Marc W.; Bopp, Dianna
- Emerging Infectious Diseases, Vol. 20, Issue 8
Microbial diversity and activity in the Nematostella vectensis holobiont: insights from 16S rRNA gene sequencing, isolate genomes, and a pilot-scale survey of gene expression
journal, September 2015
- Har, Jia Y.; Helbig, Tim; Lim, Ju H.
- Frontiers in Microbiology, Vol. 6
Analysis of plant microbe interactions in the era of next generation sequencing technologies
journal, May 2014
- Knief, Claudia
- Frontiers in Plant Science, Vol. 5