PanFP: Pangenome-based functional profiles for microbial communities
Abstract
For decades there has been increasing interest in understanding the relationships between microbial communities and ecosystem functions. Current DNA sequencing technologies allows for the exploration of microbial communities in two principle ways: targeted rRNA gene surveys and shotgun metagenomics. For large study designs, it is often still prohibitively expensive to sequence metagenomes at both the breadth and depth necessary to statistically capture the true functional diversity of a community. Although rRNA gene surveys provide no direct evidence of function, they do provide a reasonable estimation of microbial diversity, while being a very cost effective way to screen samples of interest for later shotgun metagenomic analyses. However, there is a great deal of 16S rRNA gene survey data currently available from diverse environments, and thus a need for tools to infer functional composition of environmental samples based on 16S rRNA gene survey data. As a result, we present a computational method called pangenome based functional profiles (PanFP), which infers functional profiles of microbial communities from 16S rRNA gene survey data for Bacteria and Archaea. PanFP is based on pangenome reconstruction of a 16S rRNA gene operational taxonomic unit (OTU) from known genes and genomes pooled from the OTU s taxonomic lineage.more »
- Authors:
-
- Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States); Univ. of Tennessee, Knoxville, TN (United States)
- Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States)
- Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States); Colorado State Univ., Fort Collins, CO (United States)
- Publication Date:
- Research Org.:
- Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
- Sponsoring Org.:
- USDOE Office of Science (SC)
- OSTI Identifier:
- 1256793
- Grant/Contract Number:
- AC05-00OR22725
- Resource Type:
- Accepted Manuscript
- Journal Name:
- BMC Research Notes
- Additional Journal Information:
- Journal Volume: 8; Journal Issue: 1; Journal ID: ISSN 1756-0500
- Publisher:
- BioMed Central
- Country of Publication:
- United States
- Language:
- English
- Subject:
- 59 BASIC BIOLOGICAL SCIENCES; microbial communities; metagenome; 16S rRNA survey; pangenome
Citation Formats
Jun, Se -Ran, Hauser, Loren John, Schadt, Christopher Warren, Gorin, Andrey A., and Robeson, Michael S. PanFP: Pangenome-based functional profiles for microbial communities. United States: N. p., 2015.
Web. doi:10.1186/s13104-015-1462-8.
Jun, Se -Ran, Hauser, Loren John, Schadt, Christopher Warren, Gorin, Andrey A., & Robeson, Michael S. PanFP: Pangenome-based functional profiles for microbial communities. United States. https://doi.org/10.1186/s13104-015-1462-8
Jun, Se -Ran, Hauser, Loren John, Schadt, Christopher Warren, Gorin, Andrey A., and Robeson, Michael S. Sat .
"PanFP: Pangenome-based functional profiles for microbial communities". United States. https://doi.org/10.1186/s13104-015-1462-8. https://www.osti.gov/servlets/purl/1256793.
@article{osti_1256793,
title = {PanFP: Pangenome-based functional profiles for microbial communities},
author = {Jun, Se -Ran and Hauser, Loren John and Schadt, Christopher Warren and Gorin, Andrey A. and Robeson, Michael S.},
abstractNote = {For decades there has been increasing interest in understanding the relationships between microbial communities and ecosystem functions. Current DNA sequencing technologies allows for the exploration of microbial communities in two principle ways: targeted rRNA gene surveys and shotgun metagenomics. For large study designs, it is often still prohibitively expensive to sequence metagenomes at both the breadth and depth necessary to statistically capture the true functional diversity of a community. Although rRNA gene surveys provide no direct evidence of function, they do provide a reasonable estimation of microbial diversity, while being a very cost effective way to screen samples of interest for later shotgun metagenomic analyses. However, there is a great deal of 16S rRNA gene survey data currently available from diverse environments, and thus a need for tools to infer functional composition of environmental samples based on 16S rRNA gene survey data. As a result, we present a computational method called pangenome based functional profiles (PanFP), which infers functional profiles of microbial communities from 16S rRNA gene survey data for Bacteria and Archaea. PanFP is based on pangenome reconstruction of a 16S rRNA gene operational taxonomic unit (OTU) from known genes and genomes pooled from the OTU s taxonomic lineage. From this lineage, we derive an OTU functional profile by weighting a pangenome s functional profile with the OTUs abundance observed in a given sample. We validated our method by comparing PanFP to the functional profiles obtained from the direct shotgun metagenomic measurement of 65 diverse communities via Spearman correlation coefficients. These correlations improved with increasing sequencing depth, within the range of 0.8 0.9 for the most deeply sequenced Human Microbiome Project mock community samples. PanFP is very similar in performance to another recently released tool, PICRUSt, for almost all of survey data analysed here. But, our method is unique in that any OTU building method can be used, as opposed to being limited to closed reference OTU picking strategies against specific reference sequence databases. In conclusion, we developed an automated computational method, which derives an inferred functional profile based on the 16S rRNA gene surveys of microbial communities. The inferred functional profile provides a cost effective way to study complex ecosystems through predicted comparative functional metagenomes and metadata analysis. All PanFP source code and additional documentation are freely available online at GitHub.},
doi = {10.1186/s13104-015-1462-8},
journal = {BMC Research Notes},
number = 1,
volume = 8,
place = {United States},
year = {Sat Sep 26 00:00:00 EDT 2015},
month = {Sat Sep 26 00:00:00 EDT 2015}
}
Figures / Tables:
Works referenced in this record:
How much metagenomic sequencing is enough to achieve a given goal?
journal, June 2013
- Ni, Jiajia; Yan, Qingyun; Yu, Yuhe
- Scientific Reports, Vol. 3, Issue 1
A Semi-Quantitative, Synteny-Based Method to Improve Functional Predictions for Hypothetical and Poorly Annotated Bacterial and Archaeal Genes
journal, October 2011
- Yelton, Alexis P.; Thomas, Brian C.; Simmons, Sheri L.
- PLoS Computational Biology, Vol. 7, Issue 10
Computational meta'omics for microbial community studies
journal, January 2013
- Segata, Nicola; Boernigen, Daniela; Tickle, Timothy L.
- Molecular Systems Biology, Vol. 9, Issue 1
Metabolic Reconstruction for Metagenomic Data and Its Application to the Human Microbiome
journal, June 2012
- Abubucker, Sahar; Segata, Nicola; Goll, Johannes
- PLoS Computational Biology, Vol. 8, Issue 6
The TIGRFAMs database of protein families
journal, January 2003
- Haft, D. H.
- Nucleic Acids Research, Vol. 31, Issue 1
Analysis, Optimization and Verification of Illumina-Generated 16S rRNA Gene Amplicon Surveys
journal, April 2014
- Nelson, Michael C.; Morrison, Hilary G.; Benjamino, Jacquelynn
- PLoS ONE, Vol. 9, Issue 4
Predictive functional profiling of microbial communities using 16S rRNA marker gene sequences
journal, August 2013
- Langille, Morgan G. I.; Zaneveld, Jesse; Caporaso, J. Gregory
- Nature Biotechnology, Vol. 31, Issue 9
An improved Greengenes taxonomy with explicit ranks for ecological and evolutionary analyses of bacteria and archaea
journal, December 2011
- McDonald, Daniel; Price, Morgan N.; Goodrich, Julia
- The ISME Journal, Vol. 6, Issue 3
UniProt Knowledgebase: a hub of integrated protein data
journal, January 2011
- Magrane, M.; Consortium, U.
- Database, Vol. 2011, Issue 0
Comparative Metagenomics of Toxic Freshwater Cyanobacteria Bloom Communities on Two Continents
journal, August 2012
- Steffen, Morgan M.; Li, Zhou; Effler, T. Chad
- PLoS ONE, Vol. 7, Issue 8
Prokaryotic taxonomy in the sequencing era - the polyphasic approach revisited: Prokaryotic taxonomy in the sequencing era
journal, October 2011
- Kämpfer, Peter; Glaeser, Stefanie P.
- Environmental Microbiology, Vol. 14, Issue 2
QIIME allows analysis of high-throughput community sequencing data
journal, April 2010
- Caporaso, J. Gregory; Kuczynski, Justin; Stombaugh, Jesse
- Nature Methods, Vol. 7, Issue 5
Metagenomics - a guide from sampling to data analysis
journal, February 2012
- Thomas, Torsten; Gilbert, Jack; Meyer, Folker
- Microbial Informatics and Experimentation, Vol. 2, Issue 1
Gene Ontology: tool for the unification of biology
journal, May 2000
- Ashburner, Michael; Ball, Catherine A.; Blake, Judith A.
- Nature Genetics, Vol. 25, Issue 1
KEGG for integration and interpretation of large-scale molecular data sets
journal, November 2011
- Kanehisa, M.; Goto, S.; Sato, Y.
- Nucleic Acids Research, Vol. 40, Issue D1
The Pfam protein families database
journal, November 2011
- Punta, M.; Coggill, P. C.; Eberhardt, R. Y.
- Nucleic Acids Research, Vol. 40, Issue D1
IMG: the integrated microbial genomes database and comparative analysis system
journal, December 2011
- Markowitz, V. M.; Chen, I. -M. A.; Palaniappan, K.
- Nucleic Acids Research, Vol. 40, Issue D1
Chapter 12: Human Microbiome Analysis
journal, December 2012
- Morgan, Xochitl C.; Huttenhower, Curtis
- PLoS Computational Biology, Vol. 8, Issue 12
Search and clustering orders of magnitude faster than BLAST
journal, August 2010
- Edgar, Robert C.
- Bioinformatics, Vol. 26, Issue 19, p. 2460-2461
A Primer on Metagenomics
journal, February 2010
- Wooley, John C.; Godzik, Adam; Friedberg, Iddo
- PLoS Computational Biology, Vol. 6, Issue 2
Trends and barriers to lateral gene transfer in prokaryotes
journal, October 2011
- Popa, Ovidiu; Dagan, Tal
- Current Opinion in Microbiology, Vol. 14, Issue 5
Diversity of 16S rRNA Genes within Individual Prokaryotic Genomes
journal, August 2010
- Pei, Anna Y.; Oberdorf, William E.; Nossa, Carlos W.
- Applied and Environmental Microbiology, Vol. 76, Issue 15
Diversity of 16S rRNA Genes within Individual Prokaryotic Genomes
journal, April 2010
- Pei, A. Y.; Oberdorf, W. E.; Nossa, C. W.
- Applied and Environmental Microbiology, Vol. 76, Issue 12
Hidden state prediction: a modification of classic ancestral state reconstruction algorithms helps unravel complex symbioses
journal, August 2014
- Zaneveld, Jesse R. R.; Thurber, Rebecca L. V.
- Frontiers in Microbiology, Vol. 5
Works referencing / citing this record:
The nasal and gut microbiome in Parkinson's disease and idiopathic rapid eye movement sleep behavior disorder: Nose and Gut Microbiome in PD and iRBD
journal, August 2017
- Heintz-Buschart, Anna; Pandey, Urvashi; Wicke, Tamara
- Movement Disorders, Vol. 33, Issue 1
Birth mode is associated with earliest strain-conferred gut microbiome functions and immunostimulatory potential
journal, November 2018
- Wampach, Linda; Heintz-Buschart, Anna; Fritz, Joëlle V.
- Nature Communications, Vol. 9, Issue 1
Best practices for analysing microbiomes
journal, May 2018
- Knight, Rob; Vrbanac, Alison; Taylor, Bryn C.
- Nature Reviews Microbiology, Vol. 16, Issue 7
Generating amplicon reads for microbial community assessment with next‐generation sequencing
journal, August 2019
- Gołębiewski, M.; Tretyn, A.
- Journal of Applied Microbiology, Vol. 128, Issue 2
Sample storage conditions induce post-collection biases in microbiome profiles
journal, December 2018
- Jenkins, Samir V.; Vang, Kieng B.; Gies, Allen
- BMC Microbiology, Vol. 18, Issue 1
Addressing Global Ruminant Agricultural Challenges Through Understanding the Rumen Microbiome: Past, Present, and Future
journal, September 2018
- Huws, Sharon A.; Creevey, Christopher J.; Oyama, Linda B.
- Frontiers in Microbiology, Vol. 9
Bioinformatics for Marine Products: An Overview of Resources, Bottlenecks, and Perspectives
journal, October 2019
- Ambrosino, Luca; Tangherlini, Michael; Colantuono, Chiara
- Marine Drugs, Vol. 17, Issue 10
Interest of bacterial pangenome analyses in clinical microbiology
journal, December 2020
- Anani, Hussein; Zgheib, Rita; Hasni, Issam
- Microbial Pathogenesis, Vol. 149
Birth mode is associated with earliest strain-conferred gut microbiome functions and immunostimulatory potential
journal, November 2018
- Wampach, Linda; Heintz-Buschart, Anna; Fritz, Joëlle V.
- Nature Communications, Vol. 9, Issue 1
Microbial communities of aquatic environments on Heard Island characterized by pyrotag sequencing and environmental data
journal, March 2017
- Allen, Michelle A.; Cavicchioli, Ricardo
- Scientific Reports, Vol. 7, Issue 1
Sample storage conditions induce post-collection biases in microbiome profiles
journal, December 2018
- Jenkins, Samir V.; Vang, Kieng B.; Gies, Allen
- BMC Microbiology, Vol. 18, Issue 1
Metagenomic Approaches to Investigate the Contribution of the Vineyard Environment to the Quality of Wine Fermentation: Potentials and Difficulties
journal, May 2018
- Stefanini, Irene; Cavalieri, Duccio
- Frontiers in Microbiology, Vol. 9