skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Genome-reconstruction for eukaryotes from complex natural microbial communities

Abstract

Microbial eukaryotes are integral components of natural microbial communities, and their inclusion is critical for many ecosystem studies, yet the majority of published metagenome analyses ignore eukaryotes. In order to include eukaryotes in environmental studies, we propose a method to recover eukaryotic genomes from complex metagenomic samples. A key step for genome recovery is separation of eukaryotic and prokaryotic fragments. We developed a k-mer-based strategy, EukRep, for eukaryotic sequence identification and applied it to environmental samples to show that it enables genome recovery, genome completeness evaluation, and prediction of metabolic potential. We used this approach to test the effect of addition of organic carbon on a geyser-associated microbial community and detected a substantial change of the community metabolism, with selection against almost all candidate phyla bacteria and archaea and for eukaryotes. Near complete genomes were reconstructed for three fungi placed within the Eurotiomycetes and an arthropod. While carbon fixation and sulfur oxidation were important functions in the geyser community prior to carbon addition, the organic carbon-impacted community showed enrichment for secreted proteases, secreted lipases, cellulose targeting CAZymes, and methanol oxidation. We demonstrate the broader utility of EukRep by reconstructing and evaluating relatively high-quality fungal, protist, and rotifer genomes from complexmore » environmental samples. This approach opens the way for cultivation-independent analyses of whole microbial communities.« less

Authors:
 [1];  [2];  [3];  [2];  [4]
  1. Univ. of California, Berkeley, CA (United States). Department of Plant and Microbial Biology
  2. Univ. of California, Berkeley, CA (United States). Department of Earth and Planetary Science
  3. Univ. of California, Berkeley, CA (United States). Department of Plant and Microbial Biology ; USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States)
  4. Univ. of California, Berkeley, CA (United States). Department of Earth and Planetary Science and Department of Environmental Science, Policy, and Management; Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Earth Sciences Division
Publication Date:
Research Org.:
Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
Sponsoring Org.:
USDOE Office of Science (SC)
OSTI Identifier:
1477272
Grant/Contract Number:  
AC02-05CH11231
Resource Type:
Journal Article: Accepted Manuscript
Journal Name:
Genome Research
Additional Journal Information:
Journal Volume: 28; Journal Issue: 4; Journal ID: ISSN 1088-9051
Publisher:
Cold Spring Harbor Laboratory Press
Country of Publication:
United States
Language:
English
Subject:
59 BASIC BIOLOGICAL SCIENCES

Citation Formats

West, Patrick T., Probst, Alexander J., Grigoriev, Igor V., Thomas, Brian C., and Banfield, Jillian F. Genome-reconstruction for eukaryotes from complex natural microbial communities. United States: N. p., 2018. Web. doi:10.1101/gr.228429.117.
West, Patrick T., Probst, Alexander J., Grigoriev, Igor V., Thomas, Brian C., & Banfield, Jillian F. Genome-reconstruction for eukaryotes from complex natural microbial communities. United States. https://doi.org/10.1101/gr.228429.117
West, Patrick T., Probst, Alexander J., Grigoriev, Igor V., Thomas, Brian C., and Banfield, Jillian F. 2018. "Genome-reconstruction for eukaryotes from complex natural microbial communities". United States. https://doi.org/10.1101/gr.228429.117. https://www.osti.gov/servlets/purl/1477272.
@article{osti_1477272,
title = {Genome-reconstruction for eukaryotes from complex natural microbial communities},
author = {West, Patrick T. and Probst, Alexander J. and Grigoriev, Igor V. and Thomas, Brian C. and Banfield, Jillian F.},
abstractNote = {Microbial eukaryotes are integral components of natural microbial communities, and their inclusion is critical for many ecosystem studies, yet the majority of published metagenome analyses ignore eukaryotes. In order to include eukaryotes in environmental studies, we propose a method to recover eukaryotic genomes from complex metagenomic samples. A key step for genome recovery is separation of eukaryotic and prokaryotic fragments. We developed a k-mer-based strategy, EukRep, for eukaryotic sequence identification and applied it to environmental samples to show that it enables genome recovery, genome completeness evaluation, and prediction of metabolic potential. We used this approach to test the effect of addition of organic carbon on a geyser-associated microbial community and detected a substantial change of the community metabolism, with selection against almost all candidate phyla bacteria and archaea and for eukaryotes. Near complete genomes were reconstructed for three fungi placed within the Eurotiomycetes and an arthropod. While carbon fixation and sulfur oxidation were important functions in the geyser community prior to carbon addition, the organic carbon-impacted community showed enrichment for secreted proteases, secreted lipases, cellulose targeting CAZymes, and methanol oxidation. We demonstrate the broader utility of EukRep by reconstructing and evaluating relatively high-quality fungal, protist, and rotifer genomes from complex environmental samples. This approach opens the way for cultivation-independent analyses of whole microbial communities.},
doi = {10.1101/gr.228429.117},
url = {https://www.osti.gov/biblio/1477272}, journal = {Genome Research},
issn = {1088-9051},
number = 4,
volume = 28,
place = {United States},
year = {Thu Mar 01 00:00:00 EST 2018},
month = {Thu Mar 01 00:00:00 EST 2018}
}

Journal Article:
Free Publicly Available Full Text
Publisher's Version of Record

Citation Metrics:
Cited by: 72 works
Citation information provided by
Web of Science

Save / Share:

Works referenced in this record:

One Bacterial Cell, One Complete Genome
journal, April 2010


Diverse Bacteria Inhabit Living Hyphae of Phylogenetically Diverse Fungal Endophytes
journal, April 2010


UniProt: the universal protein knowledgebase
journal, November 2016


Gene and translation initiation site prediction in metagenomic sequences
journal, July 2012


Single cell genome analysis of an uncultured heterotrophic stramenopile
journal, April 2014


UniRef: comprehensive and non-redundant UniProt reference clusters
journal, March 2007


CBOL Protist Working Group: Barcoding Eukaryotic Richness beyond the Animal, Plant, and Fungal Kingdoms
journal, November 2012


SignalP 4.0: discriminating signal peptides from transmembrane regions
journal, September 2011


The Genome of the Foraminiferan Reticulomyxa filosa
journal, January 2014


Predicting active site residue annotations in the Pfam database
journal, August 2007


BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs
journal, June 2015


Targeted metagenomics and ecology of globally important uncultured eukaryotic phytoplankton
journal, July 2010


Protein structure determination using metagenome sequence data
journal, January 2017


The Carbohydrate-Active EnZymes database (CAZy): an expert resource for Glycogenomics
journal, January 2009


Gene Family Evolution Reflects Adaptation to Soil Environmental Stressors in the Genome of the Collembolan Orchesella cincta
journal, June 2016


Twenty years of the MEROPS database of proteolytic enzymes, their substrates and inhibitors
journal, November 2015


dbCAN: a web resource for automated carbohydrate-active enzyme annotation
journal, May 2012


Search and clustering orders of magnitude faster than BLAST
journal, August 2010


Genome-Resolved Meta-Omics Ties Microbial Dynamics to Process Performance in Biotechnology for Thiocyanate Degradation
journal, February 2017


Kingdom-Wide Analysis of Fungal Small Secreted Proteins (SSPs) Reveals their Potential Role in Host Association
journal, February 2016


ConPADE: Genome Assembly Ploidy Estimation from Next-Generation Sequencing Data
journal, April 2015


Biology of a widespread uncultivated archaeon that contributes to carbon fixation in the subsurface
journal, November 2014


Candida parapsilosis, an Emerging Fungal Pathogen
journal, October 2008


Determining the quality and complexity of next-generation sequencing data without a reference genome
journal, December 2014


KEGG as a reference resource for gene and protein annotation
journal, October 2015


Small Genomes and Sparse Metabolisms of Sediment-Associated Bacteria from Four Candidate Phyla
journal, October 2013


Broadly Sampled Multigene Analyses Yield a Well-Resolved Eukaryotic Tree of Life
journal, July 2010


Large-scale machine learning for metagenomics sequence classification
journal, November 2015


Rfam 12.0: updates to the RNA families database
journal, November 2014


The Chlamydomonas Genome Reveals the Evolution of Key Animal and Plant Functions
journal, October 2007


The Paleozoic Origin of Enzymatic Lignin Decomposition Reconstructed from 31 Fungal Genomes
journal, June 2012


Single-Cell Genomics Reveals Organismal Interactions in Uncultivated Marine Protists
journal, May 2011


Binning metagenomic contigs by coverage and composition
journal, September 2014


Measurement of bacterial replication rates in microbial communities
journal, November 2016


Creating the CIPRES Science Gateway for inference of large phylogenetic trees
conference, November 2010


A new view of the tree of life
journal, April 2016


The genome sequence of the filamentous fungus Neurospora crassa
journal, April 2003


Predicting transmembrane protein topology with a hidden markov model: application to complete genomes11Edited by F. Cohen
journal, January 2001


Accessing the genomic information of unculturable oceanic picoeukaryotes by combining multiple single cells
journal, January 2017


dRep: a tool for fast and accurate genomic comparisons that enables improved genome recovery from metagenomes through de-replication
journal, July 2017


The Lipase Engineering Database: a navigation and analysis tool for protein families
journal, January 2003


Metagenomes of the Picoalga Bathycoccus from the Chile Coastal Upwelling
journal, June 2012


Diverse Lifestyles and Strategies of Plant Pathogenesis Encoded in the Genomes of Eighteen Dothideomycetes Fungi
journal, December 2012


MUSCLE: multiple sequence alignment with high accuracy and high throughput
journal, March 2004


Thousands of microbial genomes shed light on interconnected biogeochemical processes in an aquifer system
journal, October 2016


Gene prediction in novel fungal genomes using an ab initio algorithm with unsupervised training
journal, October 2008


Profile hidden Markov models
journal, October 1998


More Genes or More Taxa? The Relative Contribution of Gene Number and Taxon Number to Phylogenetic Accuracy
journal, March 2005


UniProt: the Universal Protein knowledgebase
journal, January 2004


Database resources of the National Center for Biotechnology Information
journal, January 2006


Database resources of the National Center for Biotechnology Information
journal, October 2020


Database resources of the National Center for Biotechnology Information
journal, December 2007


UniProt: the universal protein knowledgebase
journal, February 2018


Database resources of the National Center for Biotechnology Information
journal, November 2018


The Lipase Engineering Database – a navigation and analysis tool for protein families
collection, January 2003


Works referencing / citing this record:

Fungi in aquatic ecosystems
journal, March 2019


Building de novo reference genome assemblies of complex eukaryotic microorganisms from single nuclei
journal, January 2020


Combining morphology, behaviour and genomics to understand the evolution and ecology of microbial eukaryotes
journal, October 2019


Metabolic Capability and Phylogenetic Diversity of Mono Lake during a Bloom of the Eukaryotic Phototroph Picocystis sp. Strain ML
journal, August 2018


Lipid analysis of CO2-rich subsurface aquifers suggests an autotrophy-based deep biosphere with lysolipids enriched in CPR bacteria
journal, March 2020


Accurate and complete genomes from metagenomes
journal, March 2020


MetaEuk—sensitive, high-throughput gene discovery, and annotation for large-scale eukaryotic metagenomics
journal, April 2020


Building de novo reference genome assemblies of complex eukaryotic microorganisms from single nuclei
journal, January 2020


Metabolic Capability and Phylogenetic Diversity of Mono Lake during a Bloom of the Eukaryotic Phototroph Picocystis sp. Strain ML
journal, August 2018


Metagenome-Assembled Genome Sequences of Three Uncultured Planktomarina sp. Strains from the Northeast Atlantic Ocean
journal, March 2020