Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

A Bayesian nonparametric analysis for zero-inflated multivariate count data with application to microbiome study

Journal Article · · Journal of the Royal Statistical Society, Series C: Applied Statistics
DOI:https://doi.org/10.1111/rssc.12493· OSTI ID:1810768
 [1];  [2];  [2];  [3]
  1. Sandia National Lab. (SNL-NM), Albuquerque, NM (United States)
  2. Univ. of California, Santa Barbara, CA (United States). Chemistry and Biochemistry Dept.
  3. Univ. of California, Santa Cruz, CA (United States). Dept. of Statistics
High-throughput sequencing technology has enabled researchers to profile microbial communities from a variety of environments, but analysis of multivariate taxon count data remains challenging. Here, we develop a Bayesian nonparametric (BNP) regression model with zero inflation to analyse multivariate count data from microbiome studies. A BNP approach flexibly models microbial associations with covariates, such as environmental factors and clinical characteristics. The model produces estimates for probability distributions which relate microbial diversity and differential abundance to covariates, and facilitates community comparisons beyond those provided by simple statistical tests. We compare the model to simpler models and popular alternatives in simulation studies, showing, in addition to these additional community-level insights, it yields superior parameter estimates and model fit in various settings. The model's utility is demonstrated by applying it to a chronic wound microbiome data set and a Human Microbiome Project data set, where it is used to compare microbial communities present in different environments.
Research Organization:
Sandia National Laboratories (SNL-NM), Albuquerque, NM (United States)
Sponsoring Organization:
National Institutes of Health (NIH); National Science Foundation (NSF); USDOE National Nuclear Security Administration (NNSA)
Grant/Contract Number:
AC04-94AL85000; NA0003525
OSTI ID:
1810768
Report Number(s):
SAND--2021-4709J; 695645
Journal Information:
Journal of the Royal Statistical Society, Series C: Applied Statistics, Journal Name: Journal of the Royal Statistical Society, Series C: Applied Statistics Journal Issue: 4 Vol. 70; ISSN 0035-9254
Publisher:
WileyCopyright Statement
Country of Publication:
United States
Language:
English

References (36)

A Bayesian mixture model for clustering and selection of feature occurrence rates under mean constraints: LI et al.
  • Li, Qiwei; Guindani, Michele; Reich, Brian J.
  • Statistical Analysis and Data Mining: The ASA Data Science Journal, Vol. 10, Issue 6 https://doi.org/10.1002/sam.11350
journal June 2017
Strain- and Species-Level Variation in the Microbiome of Diabetic Wounds Is Associated with Clinical Outcomes and Therapeutic Efficacy journal May 2019
Stick-breaking autoregressive processes journal June 2011
Temporal Stability in Chronic Wound Microbiota Is Associated With Poor Healing journal January 2017
Differential abundance analysis for microbial marker-gene surveys journal September 2013
Microbial predictors of healing and short-term effect of debridement on the microbiome of chronic wounds journal May 2020
MIMIX: A Bayesian Mixed-Effects Model for Microbiome Data From Designed Experiments journal July 2019
Bayesian Graphical Compositional Regression for Microbiome Data journal August 2019
edgeR: a Bioconductor package for differential expression analysis of digital gene expression data journal November 2009
A robust approach for identifying differentially abundant features in metagenomic samples journal March 2015
A two-part mixed-effects model for analyzing longitudinal microbiome compositional data journal May 2016
Generalized Spatial Dirichlet Process Models journal August 2007
Latent variable modeling for the microbiome journal June 2018
Zero-inflated generalized Dirichlet multinomial regression model for microbiome compositional data analysis journal June 2018
Bayesian variable selection for multivariate zero-inflated models: Application to microbiome count data journal December 2018
Bayesian measures of model complexity and fit
  • Spiegelhalter, David J.; Best, Nicola G.; Carlin, Bradley P.
  • Journal of the Royal Statistical Society: Series B (Statistical Methodology), Vol. 64, Issue 4 https://doi.org/10.1111/1467-9868.00353
journal October 2002
Spatial and temporal variability of the human microbiota journal July 2012
Comparison of Hierarchical Bayesian Models for Overdispersed Count Data using DIC and Bayes' Factors journal January 2009
Bayesian Nonparametric Nonproportional Hazards Survival Modeling journal February 2009
A Time-Series DDP for Functional Proteomics Profiles journal January 2012
Bayesian Model Choice: Asymptotics and Exact Calculations journal September 1994
Analysis of the chronic wound microbiota of 2,963 patients by 16S rDNA pyrosequencing journal December 2015
Topographical and Temporal Diversity of the Human Skin Microbiome journal May 2009
Modelling of zero-inflation improves inference of metagenomic gene count data journal November 2018
Negative binomial mixed models for analyzing microbiome count data journal January 2017
Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2 journal December 2014
Characterization of oral and gut microbiome temporal variability in hospitalized cancer patients journal February 2017
Gibbs Sampling Methods for Stick-Breaking Priors journal March 2001
An ANOVA Model for Dependent Random Measures journal March 2004
Bayesian Nonparametric Spatial Modeling With Dirichlet Process Mixing journal September 2005
Bayesian semiparametric inference for multivariate doubly-interval-censored data journal December 2010
Nonparametric Bayesian models through probit stick-breaking processes journal March 2011
Bayesian mixed effects models for zero-inflated compositions in microbiome data analysis journal March 2020
Waste Not, Want Not: Why Rarefying Microbiome Data Is Inadmissible journal April 2014
A Bayesian Semiparametric Regression Model for Joint Analysis of Microbiome Data journal March 2018
A longitudinal study of the diabetic skin and wound microbiome journal January 2017

Similar Records

A Graphical Model for Fusing Diverse Microbiome Data
Journal Article · Tue Sep 05 20:00:00 EDT 2023 · IEEE Transactions on Signal Processing · OSTI ID:2283181

Multivariate nonparametric trend assessment with environmental applications
Thesis/Dissertation · Tue Dec 31 23:00:00 EST 1991 · OSTI ID:6965432

Scalable Bayesian Nonparametric Clustering and Classification
Journal Article · Thu Jul 18 20:00:00 EDT 2019 · Journal of Computational and Graphical Statistics · OSTI ID:1566123