skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Sequence Data for Clostridium autoethanogenum using Three Generations of Sequencing Technologies

Journal Article · · Scientific Data
 [1];  [1];  [2];  [1];  [2];  [3];  [1]
  1. Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States)
  2. North Carolina State Univ., Raleigh, NC (United States)
  3. LanzaTech, Skokie, IL (United States)

During the past decade, DNA sequencing output has been mostly dominated by the second generation sequencing platforms which are characterized by low cost, high throughput and shorter read lengths for example, Illumina. The emergence and development of so called third generation sequencing platforms such as PacBio has permitted exceptionally long reads (over 20 kb) to be generated. Due to read length increases, algorithm improvements and hybrid assembly approaches, the concept of one chromosome, one contig and automated finishing of microbial genomes is now a realistic and achievable task for many microbial laboratories. In this paper, we describe high quality sequence datasets which span three generations of sequencing technologies, containing six types of data from four NGS platforms and originating from a single microorganism, Clostridium autoethanogenum. The dataset reported here will be useful for the scientific community to evaluate upcoming NGS platforms, enabling comparison of existing and novel bioinformatics approaches and will encourage interest in the development of innovative experimental and computational methods for NGS data.

Research Organization:
Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States). BioEnergy Science Center (BESC)
Sponsoring Organization:
USDOE
Grant/Contract Number:
AC05-00OR22725
OSTI ID:
1185931
Journal Information:
Scientific Data, Vol. 2; ISSN 2052-4463
Publisher:
Nature Publishing GroupCopyright Statement
Country of Publication:
United States
Language:
English
Citation Metrics:
Cited by: 33 works
Citation information provided by
Web of Science

References (41)

PBHoney: identifying genomic variants via long-read discordance and interrupted mapping journal June 2014
Disk Compression of k-mer Sets text January 2020
Hybrid error correction and de novo assembly of single-molecule sequencing reads journal July 2012
LoRDEC: accurate and efficient long read error correction journal August 2014
A tale of three next generation sequencing platforms: comparison of Ion torrent, pacific biosciences and illumina MiSeq sequencers journal January 2012
Data from: Sequence data for Clostridium autoethanogenum using three generations of sequencing technologies
  • Utturkar, Sagar M.; Klingeman, Dawn M.; Bruno-Barcena, José M.
  • Dryad Digital Repository-Supplementary information for journal article at DOI: 10.1038/sdata.2015.14, 3 files https://doi.org/10.5061/dryad.6fm1p
dataset April 2015
Continuous base identification for single-molecule nanopore DNA sequencing journal February 2009
Genome Sequence of the Autotrophic Acetogen Clostridium autoethanogenum JA1-1 Strain DSM 10061, a Producer of Ethanol from Carbon Monoxide journal June 2013
Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data journal May 2013
Development of an electrotransformation protocol for genetic manipulation of Clostridium pasteurianum journal January 2013
ExSPAnder: a universal repeat resolver for DNA fragment assembly journal June 2014
Sanger_Sequence_Data dataset January 2015
DSM10061_PacBio_Data dataset January 2015
Motif_GFF_File dataset January 2015
Single-molecule sequencing of an individual human genome journal August 2009
Long-read, whole-genome shotgun sequence data for five model organisms journal November 2014
Four-color DNA sequencing by synthesis using cleavable fluorescent nucleotide reversible terminators journal December 2006
Reducing assembly complexity of microbial genomes with single-molecule sequencing journal January 2013
Clostridium ljungdahlii represents a microbial production platform based on syngas journal July 2010
Ten years of next-generation sequencing technology journal September 2014
Mapping single molecule sequencing reads using basic local alignment with successive refinement (BLASR): application and theory journal September 2012
In vivo methylation in Escherichia coli by the Bacillus subtilis phage phi 3T I methyltransferase to protect plasmids from restriction upon transformation of Clostridium acetobutylicum ATCC 824. journal January 1993
The advantages of SMRT sequencing journal July 2013
Fast gapped-read alignment with Bowtie 2 journal March 2012
The advantages of SMRT sequencing journal June 2013
Comparison of single-molecule sequencing and hybrid approaches for finishing the genome of Clostridium autoethanogenum and analysis of CRISPR systems in industrial relevant Clostridia journal January 2014
proovread : large-scale high-accuracy PacBio correction through iterative short read consensus journal July 2014
Evaluation and validation of de novo and hybrid assembly techniques to derive high-quality genome sequences journal June 2014
Genome sequencing in microfabricated high-density picolitre reactors journal July 2005
Long-read, whole-genome shotgun sequence data for five model organisms posted_content October 2014
Pilon: An Integrated Tool for Comprehensive Microbial Variant Detection and Genome Assembly Improvement journal November 2014
A reference bacterial genome dataset generated on the MinION™ portable single-molecule nanopore sequencer journal October 2014
GAGE: A critical evaluation of genome assemblies and assembly algorithms journal January 2012
Development of an in vivo methylation system for the solventogen Clostridium saccharobutylicum NCP 262 and analysis of two endonuclease mutants journal October 2014
Entering the era of bacterial epigenomics with single molecule real time DNA sequencing journal April 2013
Ethanol-Tolerant Gene Identification in Clostridium thermocellum Using Pyro-Resequencing for Metabolic Engineering book November 2011
One chromosome, one contig: complete microbial genomes from long-read sequencing and assembly journal February 2015
Complete Genome Sequences of Eight Helicobacter pylori Strains with Different Virulence Factor Genotypes and Methylation Profiles, Isolated from Patients with Diverse Gastrointestinal Diseases on Okinawa Island, Japan, Determined Using PacBio Single-Molecule Real-Time Technology journal March 2014
Complete Genome Sequence of the Sugar Cane Endophyte Pseudomonas aurantiaca PB-St2, a Disease-Suppressive Bacterium with Antifungal Activity toward the Plant Pathogen Colletotrichum falcatum journal January 2014
Complete Genome Sequence of Highly Adherent Pseudomonas aeruginosa Small-Colony Variant SCV20265 journal January 2014
Comparison of Next-Generation Sequencing Systems journal January 2012

Cited By (12)

Data from: Sequence data for Clostridium autoethanogenum using three generations of sequencing technologies
  • Utturkar, Sagar M.; Klingeman, Dawn M.; Bruno-Barcena, José M.
  • Dryad Digital Repository-Supplementary information for journal article at DOI: 10.1038/sdata.2015.14, 3 files https://doi.org/10.5061/dryad.6fm1p
dataset April 2015
Whole genome sequence and manual annotation of Clostridium autoethanogenum, an industrially relevant bacterium journal December 2015
Gas Fermentation—A Flexible Platform for Commercial Scale Production of Low-Carbon-Fuels and Chemicals from Waste and Renewable Feedstocks journal May 2016
Partial replacement of fishmeal with Clostridium autoethanogenum single‐cell protein in the diet for juvenile black sea bream ( Acanthopagrus schlegelii ) journal December 2019
Development of a metabolic pathway transfer and genomic integration system for the syngas-fermenting bacterium Clostridium ljungdahlii journal May 2019
A Case Study into Microbial Genome Assembly Gap Sequences and Finishing Strategies journal July 2017
Draft Genome Sequence of Pyrodictium occultum PL19 T , a Marine Hyperthermophilic Species of Archaea That Grows Optimally at 105°C journal February 2016
Near-Complete Genome Sequence of Clostridium paradoxum Strain JW-YL-7 journal May 2016
Near-Complete Genome Sequence of Thalassospira sp. Strain KO164 Isolated from a Lignin-Enriched Marine Sediment Microcosm journal December 2016
Gas fermentation: cellular engineering possibilities and scale up journal April 2017
Inferring Heterozygosity from Ancient and Low Coverage Genomes journal January 2017
Synthetic Biology on Acetogenic Bacteria for Highly Efficient Conversion of C1 Gases to Biochemicals journal October 2020


Figures / Tables (6)


Similar Records

Comparative genomic analysis of single-molecule sequencing and hybrid approaches for finishing the Clostridium autoethanogenum JA1-1 strain DSM 10061 genome
Journal Article · Wed Jan 01 00:00:00 EST 2014 · Biotechnology for Biofuels · OSTI ID:1185931

A Case Study into Microbial Genome Assembly Gap Sequences and Finishing Strategies
Journal Article · Tue Jul 18 00:00:00 EDT 2017 · Frontiers in Microbiology · OSTI ID:1185931

Clostridium autoethanogenum isopropanol production via native plasmid pCA replicon
Journal Article · Fri Aug 05 00:00:00 EDT 2022 · Frontiers in Bioengineering and Biotechnology · OSTI ID:1185931

Related Subjects