A Bayesian nonparametric analysis for zero-inflated multivariate count data with application to microbiome study
Journal Article
·
· Journal of the Royal Statistical Society, Series C: Applied Statistics
- Sandia National Lab. (SNL-NM), Albuquerque, NM (United States)
- Univ. of California, Santa Barbara, CA (United States). Chemistry and Biochemistry Dept.
- Univ. of California, Santa Cruz, CA (United States). Dept. of Statistics
High-throughput sequencing technology has enabled researchers to profile microbial communities from a variety of environments, but analysis of multivariate taxon count data remains challenging. Here, we develop a Bayesian nonparametric (BNP) regression model with zero inflation to analyse multivariate count data from microbiome studies. A BNP approach flexibly models microbial associations with covariates, such as environmental factors and clinical characteristics. The model produces estimates for probability distributions which relate microbial diversity and differential abundance to covariates, and facilitates community comparisons beyond those provided by simple statistical tests. We compare the model to simpler models and popular alternatives in simulation studies, showing, in addition to these additional community-level insights, it yields superior parameter estimates and model fit in various settings. The model's utility is demonstrated by applying it to a chronic wound microbiome data set and a Human Microbiome Project data set, where it is used to compare microbial communities present in different environments.
- Research Organization:
- Sandia National Laboratories (SNL-NM), Albuquerque, NM (United States)
- Sponsoring Organization:
- National Institutes of Health (NIH); National Science Foundation (NSF); USDOE National Nuclear Security Administration (NNSA)
- Grant/Contract Number:
- AC04-94AL85000; NA0003525
- OSTI ID:
- 1810768
- Report Number(s):
- SAND--2021-4709J; 695645
- Journal Information:
- Journal of the Royal Statistical Society, Series C: Applied Statistics, Journal Name: Journal of the Royal Statistical Society, Series C: Applied Statistics Journal Issue: 4 Vol. 70; ISSN 0035-9254
- Publisher:
- WileyCopyright Statement
- Country of Publication:
- United States
- Language:
- English
Similar Records
A Graphical Model for Fusing Diverse Microbiome Data
Multivariate nonparametric trend assessment with environmental applications
Scalable Bayesian Nonparametric Clustering and Classification
Journal Article
·
Tue Sep 05 20:00:00 EDT 2023
· IEEE Transactions on Signal Processing
·
OSTI ID:2283181
Multivariate nonparametric trend assessment with environmental applications
Thesis/Dissertation
·
Tue Dec 31 23:00:00 EST 1991
·
OSTI ID:6965432
Scalable Bayesian Nonparametric Clustering and Classification
Journal Article
·
Thu Jul 18 20:00:00 EDT 2019
· Journal of Computational and Graphical Statistics
·
OSTI ID:1566123