DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Terabase-Scale Coassembly of a Tropical Soil Microbiome

Journal Article · · Microbiology Spectrum
ORCiD logo [1];  [1];  [1];  [2];  [1]; ORCiD logo [1];  [1];  [3];  [1];  [1];  [2];  [3];  [4]; ORCiD logo [5];  [1];  [1]; ORCiD logo [1];
  1. Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley California, USA
  2. Physical and Life Sciences Directorate, Lawrence Livermore National Laboratory, Livermore, California, USA
  3. Applied Math and Computational Research Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA
  4. Applied Math and Computational Research Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA, Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, California, USA
  5. Physical and Life Sciences Directorate, Lawrence Livermore National Laboratory, Livermore, California, USA, Life &, Environmental Sciences Department, University of California Merced, Merced, California, USA

Petabases of reads are being produced by environmental metagenome sequencing. An essential step in analyzing these data is metagenome assembly, the computational reconstruction of genome sequences from microbial communities. “Coassembly” of metagenomic sequence data, in which multiple samples are assembled together, enables more complete detection of microbial genomes in an environment than “multiassembly,” in which samples are assembled individually.

Sponsoring Organization:
USDOE
Grant/Contract Number:
AC02-05CH11231; AC05-00OR22725
OSTI ID:
1984894
Journal Information:
Microbiology Spectrum, Journal Name: Microbiology Spectrum Journal Issue: 4 Vol. 11; ISSN 2165-0497
Publisher:
American Society for MicrobiologyCopyright Statement
Country of Publication:
United States
Language:
English

References (73)

Using ggtree to Visualize Data on Tree‐Like Structures journal March 2020
Assignment of homology to genome sequences using a library of hidden Markov models that represent all proteins of known structure journal November 2001
tRNAscan-SE: Searching for tRNA Genes in Genomic Sequences book January 2019
Microbial community and antibiotic resistance genes of biofilm on pipes and their interactions in domestic hot water system journal May 2021
Redox Fluctuations Control the Coupled Cycling of Iron and Carbon in Tropical Forest Soils journal November 2018
How to apply de Bruijn graphs to genome assembly journal November 2011
Minimum information about a single amplified genome (MISAG) and a metagenome-assembled genome (MIMAG) of bacteria and archaea journal August 2017
Minimum Information about an Uncultivated Virus Genome (MIUViG) journal December 2018
Ecology and exploration of the rare biosphere journal March 2015
Function-driven single-cell genomics uncovers cellulose-degrading bacteria from the rare biosphere journal November 2019
Candidatus Eremiobacterota, a metabolically and phylogenetically diverse terrestrial phylum with acid-tolerant adaptations journal March 2021
Plant microbiomes harbor potential to promote nutrient turnover in impoverished substrates of a Brazilian biodiversity hotspot journal December 2022
Metagenomic compendium of 189,680 DNA viruses from the human gut microbiome journal June 2021
Tara Oceans: towards global ocean ecosystems biology journal May 2020
The Integrative Human Microbiome Project journal May 2019
CheckV assesses the quality and completeness of metagenome-assembled viral genomes journal December 2020
Critical Assessment of Metagenome Interpretation: the second round of challenges journal April 2022
From Louvain to Leiden: guaranteeing well-connected communities journal March 2019
Terabase-scale metagenome coassembly with MetaHipMer journal July 2020
Microbial diversity in the deep sea and the underexplored "rare biosphere" journal July 2006
Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences journal May 2006
trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses journal June 2009
Picante: R tools for integrating phylogenies and ecology journal April 2010
Infernal 1.1: 100-fold faster RNA homology searches journal September 2013
MEGAHIT: an ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph journal January 2015
BFC: correcting Illumina sequencing errors journal May 2015
GTDB-Tk: a toolkit to classify genomes with the Genome Taxonomy Database journal November 2019
The microbial rare biosphere: current concepts, methods and ecological principles journal November 2020
BUSCO Update: Novel and Streamlined Workflows along with Broader and Deeper Phylogenetic Coverage for Scoring of Eukaryotic, Prokaryotic, and Viral Genomes journal July 2021
MAFFT Multiple Sequence Alignment Software Version 7: Improvements in Performance and Usability journal January 2013
IQ-TREE: A Fast and Effective Stochastic Algorithm for Estimating Maximum-Likelihood Phylogenies journal November 2014
The IMG/M data management and analysis system v.6.0: new tools and advanced capabilities journal October 2020
Database resources of the national center for biotechnology information journal December 2021
TIGRFAMs and Genome Properties in 2013 journal November 2012
MycoCosm portal: gearing up for 1000 fungal genomes journal December 2013
Challenges in homology search: HMMER3 and convergent evolution of coiled-coil regions journal April 2013
Expanded microbial genome coverage and improved protein family annotation in the COG database journal November 2014
The Pfam protein families database: towards a more sustainable future journal December 2015
High speed BLASTN: an accelerated MegaBLAST search tool journal January 2015
Rfam 13.0: shifting to a genome-centric resource for non-coding RNA families journal November 2017
20 years of the SMART protein domain annotation resource journal October 2017
CATH: expanding the horizons of structure-based functional annotations for genome sequences journal November 2018
Vulgatibacter incomptus gen. nov., sp. nov. and Labilithrix luteola gen. nov., sp. nov., two myxobacteria isolated from soil in Yakushima Island, and the description of Vulgatibacteraceae fam. nov., Labilitrichaceae fam. nov. and Anaeromyxobacteraceae fam. nov. journal October 2014
You can move, but you can’t hide: identification of mobile genetic elements with geNomad posted_content March 2023
Adaptive seeds tame genomic sequence comparison journal January 2011
CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes journal May 2015
metaSPAdes: a new versatile metagenomic assembler journal March 2017
Modeling leaderless transcription and atypical genes results in more accurate gene prediction in prokaryotes journal May 2018
merAligner: A Fully Parallel Sequence Aligner conference May 2015
Modifying HMMER3 to Run Efficiently on the Cori Supercomputer Using OpenMP Tasking conference May 2018
Ecological generalism drives hyperdiversity of secondary metabolite gene clusters in xylarialean endophytes journal December 2021
The Fusarium graminearum Genome Reveals a Link Between Localized Polymorphism and Pathogen Specialization journal September 2007
Metagenomic Discovery of Biomass-Degrading Genes and Genomes from Cow Rumen journal January 2011
Novel High-Rank Phylogenetic Lineages within a Sulfur Spring (Zodletone Spring, Oklahoma), Revealed Using a Combined Pyrosequencing-Sanger Approach journal February 2012
Rapid Method for Coextraction of DNA and RNA from Natural Environments for Analysis of Ribosomal DNA- and rRNA-Based Microbial Community Composition journal December 2000
Draft Genome Sequence of Neurospora crassa Strain FGSC 73 journal March 2015
Draft Genome Sequence of Coniochaeta ligniaria NRRL 30616, a Lignocellulolytic Fungus for Bioabatement of Inhibitors in Plant Biomass Hydrolysates journal January 2017
Patterns in Wetland Microbial Community Composition and Functional Gene Repertoire Associated with Methane Emissions journal May 2015
Genomic Analysis of the Yet-Uncultured Binatota Reveals Broad Methylotrophic, Alkane-Degradation, and Pigment Production Capacities journal June 2021
Nonpareil 3: Fast Estimation of Metagenomic Coverage and Sequence Diversity journal April 2018
DOE JGI Metagenome Workflow journal June 2021
Graph Clustering Via a Discrete Uncoupling Process journal January 2008
Prodigal: prokaryotic gene recognition and translation initiation site identification journal March 2010
CRISPR Recognition Tool (CRT): a tool for automatic detection of clustered regularly interspaced palindromic repeats journal June 2007
Mash: fast genome and metagenome distance estimation using MinHash journal June 2016
Estimating the quality of eukaryotic genomes recovered from metagenomic analysis with EukCC journal September 2020
Members of the Candidate Phyla Radiation are functionally differentiated by carbon- and nitrogen-cycling capabilities journal September 2017
HT-SIP: a semi-automated stable isotope probing pipeline identifies cross-kingdom interactions in the hyphosphere of arbuscular mycorrhizal fungi journal November 2022
Identification of Optimum Sequencing Depth Especially for De Novo Genome Assembly of Small Genomes Using Next Generation Sequencing Data journal April 2013
Myxobacteria: Moving, Killing, Feeding, and Surviving Together journal May 2016
Metagenomic Data Assembly – The Way of Decoding Unknown Microorganisms journal March 2021
Six Heterocyclic Metabolites from the Myxobacterium Labilithrix luteola journal February 2018
MetaBAT 2: an adaptive binning algorithm for robust and efficient genome reconstruction from metagenome assemblies journal January 2019