skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: The Fast Changing Landscape of Sequencing Technologies and Their Impact on Microbial Genome Assemblies and Annotation

Journal Article · · PLoS ONE
 [1];  [2];  [2];  [2];  [1];  [1];  [3];  [1];  [1];  [4];  [2];  [1]
  1. U.S. Department of Energy, Joint Genome Institute
  2. ORNL
  3. Los Alamos National Laboratory (LANL)
  4. DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany

Background: The emergence of next generation sequencing (NGS) has provided the means for rapid and high throughput sequencing and data generation at low cost, while concomitantly creating a new set of challenges. The number of available assembled microbial genomes continues to grow rapidly and their quality reflects the quality of the sequencing technology used, but also of the analysis software employed for assembly and annotation. Methodology/Principal Findings: In this work, we have explored the quality of the microbial draft genomes across various sequencing technologies. We have compared the draft and finished assemblies of 133 microbial genomes sequenced at the Department of Energy-Joint Genome Institute and finished at the Los Alamos National Laboratory using a variety of combinations of sequencing technologies, reflecting the transition of the institute from Sanger-based sequencing platforms to NGS platforms. The quality of the public assemblies and of the associated gene annotations was evaluated using various metrics. Results obtained with the different sequencing technologies, as well as their effects on downstream processes, were analyzed. Our results demonstrate that the Illumina HiSeq 2000 sequencing system, the primary sequencing technology currently used for de novo genome sequencing and assembly at JGI, has various advantages in terms of total sequence throughput and cost, but it also introduces challenges for the downstream analyses. In all cases assembly results although on average are of high quality, need to be viewed critically and consider sources of errors in them prior to analysis. Conclusion: These data follow the evolution of microbial sequencing and downstream processing at the JGI from draft genome sequences with large gaps corresponding to missing genes of significant biological role to assemblies with multiple small gaps (Illumina) and finally to assemblies that generate almost complete genomes (Illumina+PacBio).

Research Organization:
Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States)
Sponsoring Organization:
Work for Others (WFO)
DOE Contract Number:
DE-AC05-00OR22725
OSTI ID:
1063155
Journal Information:
PLoS ONE, Vol. 7, Issue 12; ISSN 1932--6203
Country of Publication:
United States
Language:
English

Similar Records

Gap Resolution
Software · Tue Jun 16 00:00:00 EDT 2009 · OSTI ID:1063155

Challenges in Whole-Genome Annotation of Pyrosequenced Eukaryotic Genomes
Conference · Fri Apr 17 00:00:00 EDT 2009 · OSTI ID:1063155

A Case Study into Microbial Genome Assembly Gap Sequences and Finishing Strategies
Journal Article · Tue Jul 18 00:00:00 EDT 2017 · Frontiers in Microbiology · OSTI ID:1063155

Related Subjects