DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Genome-reconstruction for eukaryotes from complex natural microbial communities

Abstract

Microbial eukaryotes are integral components of natural microbial communities, and their inclusion is critical for many ecosystem studies, yet the majority of published metagenome analyses ignore eukaryotes. In order to include eukaryotes in environmental studies, we propose a method to recover eukaryotic genomes from complex metagenomic samples. A key step for genome recovery is separation of eukaryotic and prokaryotic fragments. We developed a k-mer-based strategy, EukRep, for eukaryotic sequence identification and applied it to environmental samples to show that it enables genome recovery, genome completeness evaluation, and prediction of metabolic potential. We used this approach to test the effect of addition of organic carbon on a geyser-associated microbial community and detected a substantial change of the community metabolism, with selection against almost all candidate phyla bacteria and archaea and for eukaryotes. Near complete genomes were reconstructed for three fungi placed within the Eurotiomycetes and an arthropod. While carbon fixation and sulfur oxidation were important functions in the geyser community prior to carbon addition, the organic carbon-impacted community showed enrichment for secreted proteases, secreted lipases, cellulose targeting CAZymes, and methanol oxidation. We demonstrate the broader utility of EukRep by reconstructing and evaluating relatively high-quality fungal, protist, and rotifer genomes from complexmore » environmental samples. This approach opens the way for cultivation-independent analyses of whole microbial communities.« less

Authors:
 [1];  [2];  [3];  [2];  [4]
  1. Univ. of California, Berkeley, CA (United States). Department of Plant and Microbial Biology
  2. Univ. of California, Berkeley, CA (United States). Department of Earth and Planetary Science
  3. Univ. of California, Berkeley, CA (United States). Department of Plant and Microbial Biology ; USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States)
  4. Univ. of California, Berkeley, CA (United States). Department of Earth and Planetary Science and Department of Environmental Science, Policy, and Management; Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Earth Sciences Division
Publication Date:
Research Org.:
Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
Sponsoring Org.:
USDOE Office of Science (SC)
OSTI Identifier:
1477272
Grant/Contract Number:  
AC02-05CH11231
Resource Type:
Accepted Manuscript
Journal Name:
Genome Research
Additional Journal Information:
Journal Volume: 28; Journal Issue: 4; Journal ID: ISSN 1088-9051
Publisher:
Cold Spring Harbor Laboratory Press
Country of Publication:
United States
Language:
English
Subject:
59 BASIC BIOLOGICAL SCIENCES

Citation Formats

West, Patrick T., Probst, Alexander J., Grigoriev, Igor V., Thomas, Brian C., and Banfield, Jillian F. Genome-reconstruction for eukaryotes from complex natural microbial communities. United States: N. p., 2018. Web. doi:10.1101/gr.228429.117.
West, Patrick T., Probst, Alexander J., Grigoriev, Igor V., Thomas, Brian C., & Banfield, Jillian F. Genome-reconstruction for eukaryotes from complex natural microbial communities. United States. https://doi.org/10.1101/gr.228429.117
West, Patrick T., Probst, Alexander J., Grigoriev, Igor V., Thomas, Brian C., and Banfield, Jillian F. Thu . "Genome-reconstruction for eukaryotes from complex natural microbial communities". United States. https://doi.org/10.1101/gr.228429.117. https://www.osti.gov/servlets/purl/1477272.
@article{osti_1477272,
title = {Genome-reconstruction for eukaryotes from complex natural microbial communities},
author = {West, Patrick T. and Probst, Alexander J. and Grigoriev, Igor V. and Thomas, Brian C. and Banfield, Jillian F.},
abstractNote = {Microbial eukaryotes are integral components of natural microbial communities, and their inclusion is critical for many ecosystem studies, yet the majority of published metagenome analyses ignore eukaryotes. In order to include eukaryotes in environmental studies, we propose a method to recover eukaryotic genomes from complex metagenomic samples. A key step for genome recovery is separation of eukaryotic and prokaryotic fragments. We developed a k-mer-based strategy, EukRep, for eukaryotic sequence identification and applied it to environmental samples to show that it enables genome recovery, genome completeness evaluation, and prediction of metabolic potential. We used this approach to test the effect of addition of organic carbon on a geyser-associated microbial community and detected a substantial change of the community metabolism, with selection against almost all candidate phyla bacteria and archaea and for eukaryotes. Near complete genomes were reconstructed for three fungi placed within the Eurotiomycetes and an arthropod. While carbon fixation and sulfur oxidation were important functions in the geyser community prior to carbon addition, the organic carbon-impacted community showed enrichment for secreted proteases, secreted lipases, cellulose targeting CAZymes, and methanol oxidation. We demonstrate the broader utility of EukRep by reconstructing and evaluating relatively high-quality fungal, protist, and rotifer genomes from complex environmental samples. This approach opens the way for cultivation-independent analyses of whole microbial communities.},
doi = {10.1101/gr.228429.117},
journal = {Genome Research},
number = 4,
volume = 28,
place = {United States},
year = {Thu Mar 01 00:00:00 EST 2018},
month = {Thu Mar 01 00:00:00 EST 2018}
}

Journal Article:
Free Publicly Available Full Text
Publisher's Version of Record

Citation Metrics:
Cited by: 87 works
Citation information provided by
Web of Science

Save / Share:

Works referenced in this record:

One Bacterial Cell, One Complete Genome
journal, April 2010


Diverse Bacteria Inhabit Living Hyphae of Phylogenetically Diverse Fungal Endophytes
journal, April 2010

  • Hoffman, M. T.; Arnold, A. E.
  • Applied and Environmental Microbiology, Vol. 76, Issue 12
  • DOI: 10.1128/AEM.02928-09

UniProt: the universal protein knowledgebase
journal, November 2016


MAKER2: an annotation pipeline and genome-database management tool for second-generation genome projects
journal, December 2011


Gene and translation initiation site prediction in metagenomic sequences
journal, July 2012


Single cell genome analysis of an uncultured heterotrophic stramenopile
journal, April 2014

  • Roy, Rajat S.; Price, Dana C.; Schliep, Alexander
  • Scientific Reports, Vol. 4, Issue 1
  • DOI: 10.1038/srep04780

Time series community genomics analysis reveals rapid shifts in bacterial species, strains, and phage during infant gut colonization
journal, August 2012

  • Sharon, I.; Morowitz, M. J.; Thomas, B. C.
  • Genome Research, Vol. 23, Issue 1
  • DOI: 10.1101/gr.142315.112

UniRef: comprehensive and non-redundant UniProt reference clusters
journal, March 2007


CBOL Protist Working Group: Barcoding Eukaryotic Richness beyond the Animal, Plant, and Fungal Kingdoms
journal, November 2012


SignalP 4.0: discriminating signal peptides from transmembrane regions
journal, September 2011

  • Petersen, Thomas Nordahl; Brunak, Søren; von Heijne, Gunnar
  • Nature Methods, Vol. 8, Issue 10
  • DOI: 10.1038/nmeth.1701

The Genome of the Foraminiferan Reticulomyxa filosa
journal, January 2014

  • Glöckner, Gernot; Hülsmann, Norbert; Schleicher, Michael
  • Current Biology, Vol. 24, Issue 1
  • DOI: 10.1016/j.cub.2013.11.027

Predicting active site residue annotations in the Pfam database
journal, August 2007


BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs
journal, June 2015


Targeted metagenomics and ecology of globally important uncultured eukaryotic phytoplankton
journal, July 2010

  • Cuvelier, M. L.; Allen, A. E.; Monier, A.
  • Proceedings of the National Academy of Sciences, Vol. 107, Issue 33
  • DOI: 10.1073/pnas.1001665107

Taxator-tk: precise taxonomic assignment of metagenomes by fast approximation of evolutionary neighborhoods
journal, November 2014


Differential depth distribution of microbial function and putative symbionts through sediment-hosted aquifers in the deep terrestrial subsurface
journal, January 2018

  • Probst, Alexander J.; Ladd, Bethany; Jarett, Jessica K.
  • Nature Microbiology, Vol. 3, Issue 3
  • DOI: 10.1038/s41564-017-0098-y

Protein structure determination using metagenome sequence data
journal, January 2017


The Carbohydrate-Active EnZymes database (CAZy): an expert resource for Glycogenomics
journal, January 2009

  • Cantarel, B. L.; Coutinho, P. M.; Rancurel, C.
  • Nucleic Acids Research, Vol. 37, Issue Database
  • DOI: 10.1093/nar/gkn663

Gene Family Evolution Reflects Adaptation to Soil Environmental Stressors in the Genome of the Collembolan Orchesella cincta
journal, June 2016

  • Faddeeva-Vakhrusheva, Anna; Derks, Martijn F. L.; Anvar, Seyed Yahya
  • Genome Biology and Evolution, Vol. 8, Issue 7
  • DOI: 10.1093/gbe/evw134

Twenty years of the MEROPS database of proteolytic enzymes, their substrates and inhibitors
journal, November 2015

  • Rawlings, Neil D.; Barrett, Alan J.; Finn, Robert
  • Nucleic Acids Research, Vol. 44, Issue D1
  • DOI: 10.1093/nar/gkv1118

dbCAN: a web resource for automated carbohydrate-active enzyme annotation
journal, May 2012

  • Yin, Yanbin; Mao, Xizeng; Yang, Jincai
  • Nucleic Acids Research, Vol. 40, Issue W1
  • DOI: 10.1093/nar/gks479

Search and clustering orders of magnitude faster than BLAST
journal, August 2010


Genome-Resolved Meta-Omics Ties Microbial Dynamics to Process Performance in Biotechnology for Thiocyanate Degradation
journal, February 2017

  • Kantor, Rose S.; Huddy, Robert J.; Iyer, Ramsunder
  • Environmental Science & Technology, Vol. 51, Issue 5
  • DOI: 10.1021/acs.est.6b04477

Kingdom-Wide Analysis of Fungal Small Secreted Proteins (SSPs) Reveals their Potential Role in Host Association
journal, February 2016


ConPADE: Genome Assembly Ploidy Estimation from Next-Generation Sequencing Data
journal, April 2015


Biology of a widespread uncultivated archaeon that contributes to carbon fixation in the subsurface
journal, November 2014

  • Probst, Alexander J.; Weinmaier, Thomas; Raymann, Kasie
  • Nature Communications, Vol. 5, Issue 1
  • DOI: 10.1038/ncomms6497

Candida parapsilosis, an Emerging Fungal Pathogen
journal, October 2008

  • Trofa, D.; Gacser, A.; Nosanchuk, J. D.
  • Clinical Microbiology Reviews, Vol. 21, Issue 4
  • DOI: 10.1128/CMR.00013-08

Determining the quality and complexity of next-generation sequencing data without a reference genome
journal, December 2014


KEGG as a reference resource for gene and protein annotation
journal, October 2015

  • Kanehisa, Minoru; Sato, Yoko; Kawashima, Masayuki
  • Nucleic Acids Research, Vol. 44, Issue D1
  • DOI: 10.1093/nar/gkv1070

Small Genomes and Sparse Metabolisms of Sediment-Associated Bacteria from Four Candidate Phyla
journal, October 2013


Broadly Sampled Multigene Analyses Yield a Well-Resolved Eukaryotic Tree of Life
journal, July 2010

  • Parfrey, Laura Wegener; Grant, Jessica; Tekle, Yonas I.
  • Systematic Biology, Vol. 59, Issue 5
  • DOI: 10.1093/sysbio/syq037

Bioreactor microbial ecosystems for thiocyanate and cyanide degradation unravelled with genome-resolved metagenomics: Metagenomics of thiocyanate/cyanide biodegradation
journal, July 2015

  • Kantor, Rose S.; van Zyl, A. Wynand; van Hille, Robert P.
  • Environmental Microbiology, Vol. 17, Issue 12
  • DOI: 10.1111/1462-2920.12936

Large-scale machine learning for metagenomics sequence classification
journal, November 2015


RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies
journal, January 2014


IDBA-UD: a de novo assembler for single-cell and metagenomic sequencing data with highly uneven depth
journal, April 2012


Rfam 12.0: updates to the RNA families database
journal, November 2014

  • Nawrocki, Eric P.; Burge, Sarah W.; Bateman, Alex
  • Nucleic Acids Research, Vol. 43, Issue D1
  • DOI: 10.1093/nar/gku1063

The Chlamydomonas Genome Reveals the Evolution of Key Animal and Plant Functions
journal, October 2007

  • Merchant, S. S.; Prochnik, S. E.; Vallon, O.
  • Science, Vol. 318, Issue 5848, p. 245-250
  • DOI: 10.1126/science.1143609

The Paleozoic Origin of Enzymatic Lignin Decomposition Reconstructed from 31 Fungal Genomes
journal, June 2012


Single-Cell Genomics Reveals Organismal Interactions in Uncultivated Marine Protists
journal, May 2011


Binning metagenomic contigs by coverage and composition
journal, September 2014

  • Alneberg, Johannes; Bjarnason, Brynjar Smári; de Bruijn, Ino
  • Nature Methods, Vol. 11, Issue 11
  • DOI: 10.1038/nmeth.3103

Measurement of bacterial replication rates in microbial communities
journal, November 2016

  • Brown, Christopher T.; Olm, Matthew R.; Thomas, Brian C.
  • Nature Biotechnology, Vol. 34, Issue 12
  • DOI: 10.1038/nbt.3704

Gut bacteria are rarely shared by co-hospitalized premature infants, regardless of necrotizing enterocolitis development
journal, March 2015


Evidence for persistent and shared bacterial strains against a background of largely unique gut colonization in hospitalized premature infants
journal, June 2016

  • Raveh-Sadka, Tali; Firek, Brian; Sharon, Itai
  • The ISME Journal, Vol. 10, Issue 12
  • DOI: 10.1038/ismej.2016.83

Creating the CIPRES Science Gateway for inference of large phylogenetic trees
conference, November 2010

  • Miller, Mark A.; Pfeiffer, Wayne; Schwartz, Terri
  • 2010 Gateway Computing Environments Workshop (GCE)
  • DOI: 10.1109/GCE.2010.5676129

A new view of the tree of life
journal, April 2016


The genome sequence of the filamentous fungus Neurospora crassa
journal, April 2003

  • Galagan, James E.; Calvo, Sarah E.; Borkovich, Katherine A.
  • Nature, Vol. 422, Issue 6934
  • DOI: 10.1038/nature01554

Predicting transmembrane protein topology with a hidden markov model: application to complete genomes11Edited by F. Cohen
journal, January 2001

  • Krogh, Anders; Larsson, Björn; von Heijne, Gunnar
  • Journal of Molecular Biology, Vol. 305, Issue 3
  • DOI: 10.1006/jmbi.2000.4315

Accessing the genomic information of unculturable oceanic picoeukaryotes by combining multiple single cells
journal, January 2017

  • Mangot, Jean-François; Logares, Ramiro; Sánchez, Pablo
  • Scientific Reports, Vol. 7, Issue 1
  • DOI: 10.1038/srep41498

dRep: a tool for fast and accurate genomic comparisons that enables improved genome recovery from metagenomes through de-replication
journal, July 2017

  • Olm, Matthew R.; Brown, Christopher T.; Brooks, Brandon
  • The ISME Journal, Vol. 11, Issue 12
  • DOI: 10.1038/ismej.2017.126

The Lipase Engineering Database: a navigation and analysis tool for protein families
journal, January 2003


Gene finding in novel genomes
journal, May 2004


Metagenomes of the Picoalga Bathycoccus from the Chile Coastal Upwelling
journal, June 2012


Diverse Lifestyles and Strategies of Plant Pathogenesis Encoded in the Genomes of Eighteen Dothideomycetes Fungi
journal, December 2012


MUSCLE: multiple sequence alignment with high accuracy and high throughput
journal, March 2004

  • Edgar, R. C.
  • Nucleic Acids Research, Vol. 32, Issue 5, p. 1792-1797
  • DOI: 10.1093/nar/gkh340

Thousands of microbial genomes shed light on interconnected biogeochemical processes in an aquifer system
journal, October 2016

  • Anantharaman, Karthik; Brown, Christopher T.; Hug, Laura A.
  • Nature Communications, Vol. 7, Issue 1
  • DOI: 10.1038/ncomms13219

Gene prediction in novel fungal genomes using an ab initio algorithm with unsupervised training
journal, October 2008

  • Ter-Hovhannisyan, V.; Lomsadze, A.; Chernoff, Y. O.
  • Genome Research, Vol. 18, Issue 12
  • DOI: 10.1101/gr.081612.108

Profile hidden Markov models
journal, October 1998


More Genes or More Taxa? The Relative Contribution of Gene Number and Taxon Number to Phylogenetic Accuracy
journal, March 2005

  • Rokas, Antonis; Carroll, Sean B.
  • Molecular Biology and Evolution, Vol. 22, Issue 5
  • DOI: 10.1093/molbev/msi121

Molecular profiling of microbial community structure and their CAZymes via metagenomics, from Tsomgo lake in the Eastern Himalayas
journal, April 2021


Comparative mitochondrial genome analysis reveals intron dynamics and gene rearrangements in two Trametes species
journal, January 2021


Extra virgin olive oil improved body weight and insulin sensitivity in high fat diet-induced obese LDLr−/−.Leiden mice without attenuation of steatohepatitis
journal, April 2021

  • Álvarez-Amor, Leticia; Sierra, Amparo Luque; Cárdenas, Antonio
  • Scientific Reports, Vol. 11, Issue 1
  • DOI: 10.1038/s41598-021-87761-3

UniProt: the Universal Protein knowledgebase
journal, January 2004


Database resources of the National Center for Biotechnology Information
journal, January 2006


Database resources of the National Center for Biotechnology Information
journal, October 2020

  • Sayers, Eric W.; Beck, Jeffrey; Bolton, Evan E.
  • Nucleic Acids Research, Vol. 49, Issue D1
  • DOI: 10.1093/nar/gkaa892

Database resources of the National Center for Biotechnology Information
journal, December 2007

  • Wheeler, D. L.; Barrett, T.; Benson, D. A.
  • Nucleic Acids Research, Vol. 36, Issue Database
  • DOI: 10.1093/nar/gkm1000

UniProt: the universal protein knowledgebase
journal, February 2018

  • UniProt Consortium, The
  • Nucleic Acids Research, Vol. 46, Issue 5
  • DOI: 10.1093/nar/gky092

Database resources of the National Center for Biotechnology Information
journal, November 2018

  • Sayers, Eric W.; Agarwala, Richa; Bolton, Evan E.
  • Nucleic Acids Research, Vol. 47, Issue D1
  • DOI: 10.1093/nar/gky1069

The Lipase Engineering Database – a navigation and analysis tool for protein families
collection, January 2003


Works referencing / citing this record:

Fungi in aquatic ecosystems
journal, March 2019

  • Grossart, Hans-Peter; Van den Wyngaert, Silke; Kagami, Maiko
  • Nature Reviews Microbiology, Vol. 17, Issue 6
  • DOI: 10.1038/s41579-019-0175-8

Building de novo reference genome assemblies of complex eukaryotic microorganisms from single nuclei
journal, January 2020

  • Montoliu-Nerin, Merce; Sánchez-García, Marisol; Bergin, Claudia
  • Scientific Reports, Vol. 10, Issue 1
  • DOI: 10.1038/s41598-020-58025-3

Combining morphology, behaviour and genomics to understand the evolution and ecology of microbial eukaryotes
journal, October 2019

  • Keeling, Patrick J.
  • Philosophical Transactions of the Royal Society B: Biological Sciences, Vol. 374, Issue 1786
  • DOI: 10.1098/rstb.2019.0085

Metabolic Capability and Phylogenetic Diversity of Mono Lake during a Bloom of the Eukaryotic Phototroph Picocystis sp. Strain ML
journal, August 2018

  • Stamps, Blake W.; Nunn, Heather S.; Petryshyn, Victoria A.
  • Applied and Environmental Microbiology, Vol. 84, Issue 21
  • DOI: 10.1128/aem.01171-18

Genome-resolved metagenomics of eukaryotic populations during early colonization of premature infants and in hospital rooms
journal, February 2019


Lipid analysis of CO2-rich subsurface aquifers suggests an autotrophy-based deep biosphere with lysolipids enriched in CPR bacteria
journal, March 2020

  • Probst, Alexander J.; Elling, Felix J.; Castelle, Cindy J.
  • The ISME Journal, Vol. 14, Issue 6
  • DOI: 10.1038/s41396-020-0624-4

Accurate and complete genomes from metagenomes
journal, March 2020

  • Chen, Lin-Xing; Anantharaman, Karthik; Shaiber, Alon
  • Genome Research, Vol. 30, Issue 3
  • DOI: 10.1101/gr.258640.119

MetaEuk—sensitive, high-throughput gene discovery, and annotation for large-scale eukaryotic metagenomics
journal, April 2020


Building de novo reference genome assemblies of complex eukaryotic microorganisms from single nuclei
journal, January 2020

  • Montoliu-Nerin, Merce; Sánchez-García, Marisol; Bergin, Claudia
  • Scientific Reports, Vol. 10, Issue 1
  • DOI: 10.1038/s41598-020-58025-3

Metabolic Capability and Phylogenetic Diversity of Mono Lake during a Bloom of the Eukaryotic Phototroph Picocystis sp. Strain ML
journal, August 2018

  • Stamps, Blake W.; Nunn, Heather S.; Petryshyn, Victoria A.
  • Applied and Environmental Microbiology, Vol. 84, Issue 21
  • DOI: 10.1128/aem.01171-18

Genome-resolved metagenomics of eukaryotic populations during early colonization of premature infants and in hospital rooms
journal, February 2019