skip to main content
DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: A Novel Sparse Compositional Technique Reveals Microbial Perturbations

Abstract

ABSTRACT The central aims of many host or environmental microbiome studies are to elucidate factors associated with microbial community compositions and to relate microbial features to outcomes. However, these aims are often complicated by difficulties stemming from high-dimensionality, non-normality, sparsity, and the compositional nature of microbiome data sets. A key tool in microbiome analysis is beta diversity, defined by the distances between microbial samples. Many different distance metrics have been proposed, all with varying discriminatory power on data with differing characteristics. Here, we propose a compositional beta diversity metric rooted in a centered log-ratio transformation and matrix completion called robust Aitchison PCA. We demonstrate the benefits of compositional transformations upstream of beta diversity calculations through simulations. Additionally, we demonstrate improved effect size, classification accuracy, and robustness to sequencing depth over the current methods on several decreased sample subsets of real microbiome data sets. Finally, we highlight the ability of this new beta diversity metric to retain the feature loadings linked to sample ordinations revealing salient intercommunity niche feature importance. IMPORTANCE By accounting for the sparse compositional nature of microbiome data sets, robust Aitchison PCA can yield high discriminatory power and salient feature ranking between microbial niches. The software to performmore » this analysis is available under an open-source license and can be obtained at https://github.com/biocore/DEICODE ; additionally, a QIIME 2 plugin is provided to perform this analysis at https://library.qiime2.org/plugins/q2-deicode .« less

Authors:
; ; ; ORCiD logo; ; ; ;
Publication Date:
Research Org.:
Univ. of California, San Diego, CA (United States); Johns Hopkins Univ., Baltimore, MD (United States)
Sponsoring Org.:
USDOE Office of Science (SC)
OSTI Identifier:
1494211
Alternate Identifier(s):
OSTI ID: 1611890
Grant/Contract Number:  
SC0012658; SC0012586
Resource Type:
Published Article
Journal Name:
mSystems
Additional Journal Information:
Journal Name: mSystems Journal Volume: 4 Journal Issue: 1; Journal ID: ISSN 2379-5077
Publisher:
American Society for Microbiology
Country of Publication:
United States
Language:
English
Subject:
59 BASIC BIOLOGICAL SCIENCES; Microbiology; compositional; computational biology; matrix completion; microbiome; metagenomics

Citation Formats

Martino, Cameron, Morton, James T., Marotz, Clarisse A., Thompson, Luke R., Tripathi, Anupriya, Knight, Rob, Zengler, Karsten, and Neufeld, ed., Josh D. A Novel Sparse Compositional Technique Reveals Microbial Perturbations. United States: N. p., 2019. Web. https://doi.org/10.1128/mSystems.00016-19.
Martino, Cameron, Morton, James T., Marotz, Clarisse A., Thompson, Luke R., Tripathi, Anupriya, Knight, Rob, Zengler, Karsten, & Neufeld, ed., Josh D. A Novel Sparse Compositional Technique Reveals Microbial Perturbations. United States. https://doi.org/10.1128/mSystems.00016-19
Martino, Cameron, Morton, James T., Marotz, Clarisse A., Thompson, Luke R., Tripathi, Anupriya, Knight, Rob, Zengler, Karsten, and Neufeld, ed., Josh D. Tue . "A Novel Sparse Compositional Technique Reveals Microbial Perturbations". United States. https://doi.org/10.1128/mSystems.00016-19.
@article{osti_1494211,
title = {A Novel Sparse Compositional Technique Reveals Microbial Perturbations},
author = {Martino, Cameron and Morton, James T. and Marotz, Clarisse A. and Thompson, Luke R. and Tripathi, Anupriya and Knight, Rob and Zengler, Karsten and Neufeld, ed., Josh D.},
abstractNote = {ABSTRACT The central aims of many host or environmental microbiome studies are to elucidate factors associated with microbial community compositions and to relate microbial features to outcomes. However, these aims are often complicated by difficulties stemming from high-dimensionality, non-normality, sparsity, and the compositional nature of microbiome data sets. A key tool in microbiome analysis is beta diversity, defined by the distances between microbial samples. Many different distance metrics have been proposed, all with varying discriminatory power on data with differing characteristics. Here, we propose a compositional beta diversity metric rooted in a centered log-ratio transformation and matrix completion called robust Aitchison PCA. We demonstrate the benefits of compositional transformations upstream of beta diversity calculations through simulations. Additionally, we demonstrate improved effect size, classification accuracy, and robustness to sequencing depth over the current methods on several decreased sample subsets of real microbiome data sets. Finally, we highlight the ability of this new beta diversity metric to retain the feature loadings linked to sample ordinations revealing salient intercommunity niche feature importance. IMPORTANCE By accounting for the sparse compositional nature of microbiome data sets, robust Aitchison PCA can yield high discriminatory power and salient feature ranking between microbial niches. The software to perform this analysis is available under an open-source license and can be obtained at https://github.com/biocore/DEICODE ; additionally, a QIIME 2 plugin is provided to perform this analysis at https://library.qiime2.org/plugins/q2-deicode .},
doi = {10.1128/mSystems.00016-19},
journal = {mSystems},
number = 1,
volume = 4,
place = {United States},
year = {2019},
month = {2}
}

Journal Article:
Free Publicly Available Full Text
Publisher's Version of Record
https://doi.org/10.1128/mSystems.00016-19

Citation Metrics:
Cited by: 8 works
Citation information provided by
Web of Science

Save / Share:

Works referenced in this record:

Regularization for matrix completion
conference, June 2010

  • Keshavan, Raghunandan H.; Montanari, Andrea
  • 2010 IEEE International Symposium on Information Theory - ISIT
  • DOI: 10.1109/ISIT.2010.5513563

Succession of microbial consortia in the developing infant gut microbiome
journal, July 2010

  • Koenig, J. E.; Spor, A.; Scalfone, N.
  • Proceedings of the National Academy of Sciences, Vol. 108, Issue Supplement_1
  • DOI: 10.1073/pnas.1000081107

Intermittent Hypoxia and Hypercapnia, a Hallmark of Obstructive Sleep Apnea, Alters the Gut Microbiome and Metabolome
journal, June 2018


Erratum: Corrigendum: SparRec: An effective matrix completion framework of missing data imputation for GWAS
journal, November 2016

  • Jiang, Bo; Ma, Shiqian; Causey, Jason
  • Scientific Reports, Vol. 6, Issue 1
  • DOI: 10.1038/srep37365

Cryptic diversity of the symbiotic cyanobacterium Synechococcus spongiarum among sponge hosts
journal, June 2008


Balances: a New Perspective for Microbiome Analysis
journal, July 2018


Intestinal Colonization by a Lachnospiraceae Bacterium Contributes to the Development of Diabetes in Obese Mice
journal, January 2014


Multiple factor analysis: principal component analysis for multitable and multiblock data sets: Multiple factor analysis
journal, February 2013

  • Abdi, Hervé; Williams, Lynne J.; Valentin, Domininique
  • Wiley Interdisciplinary Reviews: Computational Statistics, Vol. 5, Issue 2
  • DOI: 10.1002/wics.1246

Probabilistic Principal Component Analysis
journal, August 1999

  • Tipping, Michael E.; Bishop, Christopher M.
  • Journal of the Royal Statistical Society: Series B (Statistical Methodology), Vol. 61, Issue 3, p. 611-622
  • DOI: 10.1111/1467-9868.00196

Structured Matrix Completion with Applications to Genomic Data Integration
journal, April 2016


Functional Intestinal Bile Acid 7α-Dehydroxylation by Clostridium scindens Associated with Protection from Clostridium difficile Infection in a Gnotobiotic Mouse Model
journal, December 2016

  • Studer, Nicolas; Desharnais, Lyne; Beutler, Markus
  • Frontiers in Cellular and Infection Microbiology, Vol. 6
  • DOI: 10.3389/fcimb.2016.00191

Biplots of compositional data
journal, October 2002

  • Aitchison, John; Greenacre, Michael
  • Journal of the Royal Statistical Society: Series C (Applied Statistics), Vol. 51, Issue 4
  • DOI: 10.1111/1467-9876.00275

Microbial community profiling for human microbiome projects: Tools, techniques, and challenges
journal, April 2009


Quantitative and Qualitative   Diversity Measures Lead to Different Insights into Factors That Structure Microbial Communities
journal, January 2007

  • Lozupone, C. A.; Hamady, M.; Kelley, S. T.
  • Applied and Environmental Microbiology, Vol. 73, Issue 5
  • DOI: 10.1128/AEM.01996-06

A phylogenetic transform enhances analysis of compositional microbiota data
journal, February 2017

  • Silverman, Justin D.; Washburne, Alex D.; Mukherjee, Sayan
  • eLife, Vol. 6
  • DOI: 10.7554/eLife.21887

Correlational Procedures for Research
journal, November 1979

  • Ginter, James L.; Thorndike, Robert M.
  • Journal of Marketing Research, Vol. 16, Issue 4
  • DOI: 10.2307/3150840

The Biological Observation Matrix (BIOM) format or: how I learned to stop worrying and love the ome-ome
journal, July 2012

  • McDonald, Daniel; Clemente, Jose C.; Kuczynski, Justin
  • GigaScience, Vol. 1, Issue 1
  • DOI: 10.1186/2047-217X-1-7

The development of numerical classification and ordination
journal, October 1980


On Information and Sufficiency
journal, March 1951

  • Kullback, S.; Leibler, R. A.
  • The Annals of Mathematical Statistics, Vol. 22, Issue 1
  • DOI: 10.1214/aoms/1177729694

Differential abundance analysis for microbial marker-gene surveys
journal, September 2013

  • Paulson, Joseph N.; Stine, O. Colin; Bravo, Héctor Corrada
  • Nature Methods, Vol. 10, Issue 12
  • DOI: 10.1038/nmeth.2658

Waste Not, Want Not: Why Rarefying Microbiome Data Is Inadmissible
journal, April 2014


Robust principal component analysis?
journal, May 2011


Sponge-mediated nitrification in tropical benthic communities
journal, January 1997


Pyrosequencing-Based Assessment of Soil pH as a Predictor of Soil Bacterial Community Structure at the Continental Scale
journal, June 2009

  • Lauber, C. L.; Hamady, M.; Knight, R.
  • Applied and Environmental Microbiology, Vol. 75, Issue 15
  • DOI: 10.1128/AEM.00335-09

Matrix completion from a few entries
conference, June 2009

  • Keshavan, Raghunandan H.; Oh, Sewoong; Montanari, Andrea
  • 2009 IEEE International Symposium on Information Theory - ISIT
  • DOI: 10.1109/ISIT.2009.5205567

Associating microbiome composition with environmental covariates using generalized UniFrac distances
journal, June 2012


Phylogenetic Placement of Exact Amplicon Sequences Improves Associations with Clinical Information
journal, April 2018


Distribution-Free and Robust Statistical Methods: Viable Alternatives to Parametric Statistics
journal, September 1993

  • Potvin, Catherine; Roff, Derek A.
  • Ecology, Vol. 74, Issue 6
  • DOI: 10.2307/1939920

The ecology and phylogeny of cyanobacterial symbionts in sponges
journal, June 2008


zCompositions — R package for multivariate imputation of left-censored data under a compositional approach
journal, April 2015

  • Palarea-Albaladejo, Javier; Martín-Fernández, Josep Antoni
  • Chemometrics and Intelligent Laboratory Systems, Vol. 143
  • DOI: 10.1016/j.chemolab.2015.02.019

Forensic identification using skin bacterial communities
journal, March 2010

  • Fierer, N.; Lauber, C. L.; Zhou, N.
  • Proceedings of the National Academy of Sciences, Vol. 107, Issue 14
  • DOI: 10.1073/pnas.1000162107

Matplotlib: A 2D Graphics Environment
journal, January 2007


Uncovering the Horseshoe Effect in Microbial Analyses
journal, February 2017


The sponge microbiome project
journal, August 2017


Deblur Rapidly Resolves Single-Nucleotide Community Sequence Patterns
journal, March 2017


Microbial community resemblance methods differ in their ability to detect biologically relevant patterns
journal, September 2010

  • Kuczynski, Justin; Liu, Zongzhi; Lozupone, Catherine
  • Nature Methods, Vol. 7, Issue 10
  • DOI: 10.1038/nmeth.1499

Logistic-Normal Distributions: Some Properties and Uses
journal, August 1980

  • Aitchison, J.; Shen, S. M.
  • Biometrika, Vol. 67, Issue 2
  • DOI: 10.2307/2335470

The gut microbiome in atherosclerotic cardiovascular disease
journal, October 2017


The Multivariate Poisson-Log Normal Distribution
journal, December 1989

  • Aitchison, J.; Ho, C. H.
  • Biometrika, Vol. 76, Issue 4
  • DOI: 10.2307/2336624

UniFrac: a New Phylogenetic Method for Comparing Microbial Communities
journal, December 2005


Integrated metabolism in sponge–microbe symbiosis revealed by genome-centered metatranscriptomics
journal, March 2017

  • Moitinho-Silva, Lucas; Díez-Vives, Cristina; Batani, Giampiero
  • The ISME Journal, Vol. 11, Issue 7
  • DOI: 10.1038/ismej.2017.25

Qiita: rapid, web-enabled microbiome meta-analysis
journal, October 2018

  • Gonzalez, Antonio; Navas-Molina, Jose A.; Kosciolek, Tomasz
  • Nature Methods, Vol. 15, Issue 10
  • DOI: 10.1038/s41592-018-0141-9