A Graphical Model for Fusing Diverse Microbiome Data
- University of Michigan, Ann Arbor, MI (United States); University of Michigan
- University of Michigan, Ann Arbor, MI (United States)
- University of Wisconsin–Madison, WI (United States); Wisconsin Institute for Discovery, Madison, WI (United States)
- Wisconsin Institute for Discovery, Madison, WI (United States)
- Wisconsin Institute for Discovery, Madison, WI (United States); University of Wisconsin–Madison, WI (United States)
This paper develops a Bayesian graphical model for fusing disparate types of count data. The motivating application is the study of bacterial communities from diverse high-dimensional features, in this case, transcripts, collected from different treatments. In such datasets, there are no explicit correspondences between the communities and each corresponds to different factors, making data fusion challenging. We introduce a flexible multinomial-Gaussian generative model for jointly modeling such count data. This latent variable model jointly characterizes the observed data through a common multivariate Gaussian latent space that parameterizes the set of multinomial probabilities of the transcriptome counts. The covariance matrix of the latent variables induces a covariance matrix of co-dependencies between all the transcripts, effectively fusing multiple data sources. We present a computationally scalable variational Expectation-Maximization (EM) algorithm for inferring the latent variables and the parameters of the model. Here, the inferred latent variables provide a common dimensionality reduction for visualizing the data and the inferred parameters provide a predictive posterior distribution. In addition to simulation studies that demonstrate the variational EM procedure, we apply our model to a bacterial microbiome dataset.
- Research Organization:
- Georgia Institute of Technology, Atlanta, GA (United States)
- Sponsoring Organization:
- USDOE National Nuclear Security Administration (NNSA)
- Grant/Contract Number:
- NA0003921
- OSTI ID:
- 2283181
- Journal Information:
- IEEE Transactions on Signal Processing, Journal Name: IEEE Transactions on Signal Processing Vol. 71; ISSN 1053-587X
- Publisher:
- IEEECopyright Statement
- Country of Publication:
- United States
- Language:
- English
Similar Records
Latent Dirichlet Allocation modeling of environmental microbiomes
Axiom Microbiome Array, the next generation microarray for high-throughput pathogen and microbiome analysis