DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Exact and efficient hybrid Monte Carlo algorithm for accelerated Bayesian inference of gene expression models from snapshots of single-cell transcripts

Abstract

Single cells exhibit a significant amount of variability in transcript levels, which arises from slow, stochastic transitions between gene expression states. Elucidating the nature of these states and understanding how transition rates are affected by different regulatory mechanisms require state-of-the-art methods to infer underlying models of gene expression from single cell data. A Bayesian approach to statistical inference is the most suitable method for model selection and uncertainty quantification of kinetic parameters using small data sets. However, this approach is impractical because current algorithms are too slow to handle typical models of gene expression. To solve this problem, we first show that time-dependent mRNA distributions of discrete-state models of gene expression are dynamic Poisson mixtures, whose mixing kernels are characterized by a piecewise deterministic Markov process. Here, we combined this analytical result with a kinetic Monte Carlo algorithm to create a hybrid numerical method that accelerates the calculation of time-dependent mRNA distributions by 1000-fold compared to current methods. We then integrated the hybrid algorithm into an existing Monte Carlo sampler to estimate the Bayesian posterior distribution of many different, competing models in a reasonable amount of time. We demonstrate that kinetic parameters can be reasonably constrained for modestly sampled datamore » sets if the model is known a priori. If there are many competing models, Bayesian evidence can rigorously quantify the likelihood of a model relative to other models from the data. We demonstrate that Bayesian evidence selects the true model and outperforms approximate metrics typically used for model selection.« less

Authors:
ORCiD logo [1]; ORCiD logo [2]
  1. Los Alamos National Lab. (LANL), Los Alamos, NM (United States)
  2. North Carolina State Univ., Raleigh, NC (United States)
Publication Date:
Research Org.:
Los Alamos National Laboratory (LANL), Los Alamos, NM (United States)
Sponsoring Org.:
USDOE Laboratory Directed Research and Development (LDRD) Program
OSTI Identifier:
1557757
Alternate Identifier(s):
OSTI ID: 1532586
Report Number(s):
LA-UR-18-31392
Journal ID: ISSN 0021-9606
Grant/Contract Number:  
89233218CNA000001
Resource Type:
Accepted Manuscript
Journal Name:
Journal of Chemical Physics
Additional Journal Information:
Journal Volume: 151; Journal Issue: 2; Journal ID: ISSN 0021-9606
Publisher:
American Institute of Physics (AIP)
Country of Publication:
United States
Language:
English
Subject:
59 BASIC BIOLOGICAL SCIENCES; Biological Science; Mathematics

Citation Formats

Lin, Yen Ting, and Buchler, Nicolas E. Exact and efficient hybrid Monte Carlo algorithm for accelerated Bayesian inference of gene expression models from snapshots of single-cell transcripts. United States: N. p., 2019. Web. doi:10.1063/1.5110503.
Lin, Yen Ting, & Buchler, Nicolas E. Exact and efficient hybrid Monte Carlo algorithm for accelerated Bayesian inference of gene expression models from snapshots of single-cell transcripts. United States. https://doi.org/10.1063/1.5110503
Lin, Yen Ting, and Buchler, Nicolas E. Sun . "Exact and efficient hybrid Monte Carlo algorithm for accelerated Bayesian inference of gene expression models from snapshots of single-cell transcripts". United States. https://doi.org/10.1063/1.5110503. https://www.osti.gov/servlets/purl/1557757.
@article{osti_1557757,
title = {Exact and efficient hybrid Monte Carlo algorithm for accelerated Bayesian inference of gene expression models from snapshots of single-cell transcripts},
author = {Lin, Yen Ting and Buchler, Nicolas E.},
abstractNote = {Single cells exhibit a significant amount of variability in transcript levels, which arises from slow, stochastic transitions between gene expression states. Elucidating the nature of these states and understanding how transition rates are affected by different regulatory mechanisms require state-of-the-art methods to infer underlying models of gene expression from single cell data. A Bayesian approach to statistical inference is the most suitable method for model selection and uncertainty quantification of kinetic parameters using small data sets. However, this approach is impractical because current algorithms are too slow to handle typical models of gene expression. To solve this problem, we first show that time-dependent mRNA distributions of discrete-state models of gene expression are dynamic Poisson mixtures, whose mixing kernels are characterized by a piecewise deterministic Markov process. Here, we combined this analytical result with a kinetic Monte Carlo algorithm to create a hybrid numerical method that accelerates the calculation of time-dependent mRNA distributions by 1000-fold compared to current methods. We then integrated the hybrid algorithm into an existing Monte Carlo sampler to estimate the Bayesian posterior distribution of many different, competing models in a reasonable amount of time. We demonstrate that kinetic parameters can be reasonably constrained for modestly sampled data sets if the model is known a priori. If there are many competing models, Bayesian evidence can rigorously quantify the likelihood of a model relative to other models from the data. We demonstrate that Bayesian evidence selects the true model and outperforms approximate metrics typically used for model selection.},
doi = {10.1063/1.5110503},
journal = {Journal of Chemical Physics},
number = 2,
volume = 151,
place = {United States},
year = {Sun Jul 14 00:00:00 EDT 2019},
month = {Sun Jul 14 00:00:00 EDT 2019}
}

Journal Article:
Free Publicly Available Full Text
Publisher's Version of Record

Citation Metrics:
Cited by: 4 works
Citation information provided by
Web of Science

Save / Share:

Works referenced in this record:

Real-Time Kinetics of Gene Activity in Individual Bacteria
journal, December 2005


Directed cell migration towards softer environments
journal, July 2022


Enhancer Histone Acetylation Modulates Transcriptional Bursting Dynamics of Neuronal Activity-Inducible Genes
journal, January 2019


The finite state projection algorithm for the solution of the chemical master equation
journal, January 2006

  • Munsky, Brian; Khammash, Mustafa
  • The Journal of Chemical Physics, Vol. 124, Issue 4
  • DOI: 10.1063/1.2145882

Mammalian Genes Are Transcribed with Widely Different Bursting Kinetics
journal, March 2011


Information Theory and an Extension of the Maximum Likelihood Principle
book, January 1998


Oxytocin facilitates valence-dependent valuation of social evaluation of the self
journal, August 2020


Systematic Identification of Signal-Activated Stochastic Gene Regulation
journal, January 2013


Dichotomous Markov Noise: Exact Results for Out-Of-Equilibrium Systems
journal, August 2006


Bursty Gene Expression in the Intact Mammalian Liver
journal, April 2015


Single-cell RNA sequencing technologies and bioinformatics pipelines
journal, August 2018

  • Hwang, Byungjin; Lee, Ji Hyun; Bang, Duhee
  • Experimental & Molecular Medicine, Vol. 50, Issue 8
  • DOI: 10.1038/s12276-018-0071-8

A general method for numerically simulating the stochastic time evolution of coupled chemical reactions
journal, December 1976


Transcription Factors Modulate c-Fos Transcriptional Bursts
journal, July 2014


Exact stochastic simulation of coupled chemical reactions
journal, December 1977

  • Gillespie, Daniel T.
  • The Journal of Physical Chemistry, Vol. 81, Issue 25
  • DOI: 10.1021/j100540a008

Distribution shapes govern the discovery of predictive models for gene regulation
journal, June 2018

  • Munsky, Brian; Li, Guoliang; Fox, Zachary R.
  • Proceedings of the National Academy of Sciences, Vol. 115, Issue 29
  • DOI: 10.1073/pnas.1804060115

Bayesian inference on stochastic gene transcription from flow cytometry data
journal, September 2018


State-dependent doubly weighted stochastic simulation algorithm for automatic characterization of stochastic biochemical rare events
journal, December 2011

  • Roh, Min K.; Daigle, Bernie J.; Gillespie, Dan T.
  • The Journal of Chemical Physics, Vol. 135, Issue 23
  • DOI: 10.1063/1.3668100

BayFish: Bayesian inference of transcription dynamics from population snapshots of single-molecule RNA FISH in single cells
journal, September 2017


Bursting noise in gene expression dynamics: linking microscopic and mesoscopic models
journal, January 2016

  • Lin, Yen Ting; Galla, Tobias
  • Journal of The Royal Society Interface, Vol. 13, Issue 114
  • DOI: 10.1098/rsif.2015.0772

Stochastic switching in biology: from genotype to phenotype
journal, February 2017

  • Bressloff, Paul C.
  • Journal of Physics A: Mathematical and Theoretical, Vol. 50, Issue 13
  • DOI: 10.1088/1751-8121/aa5db4

Weak convergence and optimal scaling of random walk Metropolis algorithms
journal, February 1997

  • Roberts, G. O.; Gelman, A.; Gilks, W. R.
  • The Annals of Applied Probability, Vol. 7, Issue 1
  • DOI: 10.1214/aoap/1034625254

Exact Distributions for Stochastic Gene Expression Models with Bursting and Feedback
journal, December 2014


A stochastic and dynamical view of pluripotency in mouse embryonic stem cells
journal, February 2018


A continuum model of transcriptional bursting
journal, February 2016

  • Corrigan, Adam M.; Tunnacliffe, Edward; Cannon, Danielle
  • eLife, Vol. 5
  • DOI: 10.7554/elife.13051

Nature, Nurture, or Chance: Stochastic Gene Expression and Its Consequences
journal, October 2008


Refining the weighted stochastic simulation algorithm
journal, May 2009

  • Gillespie, Dan T.; Roh, Min; Petzold, Linda R.
  • The Journal of Chemical Physics, Vol. 130, Issue 17
  • DOI: 10.1063/1.3116791

mRNA-Seq whole-transcriptome analysis of a single cell
journal, April 2009

  • Tang, Fuchou; Barbacioru, Catalin; Wang, Yangzhou
  • Nature Methods, Vol. 6, Issue 5
  • DOI: 10.1038/nmeth.1315

What shapes eukaryotic transcriptional bursting?
journal, January 2017

  • Nicolas, Damien; Phillips, Nick E.; Naef, Felix
  • Molecular BioSystems, Vol. 13, Issue 7
  • DOI: 10.1039/c7mb00154a

Real-Time Observation of Transcription Initiation and Elongation on an Endogenous Yeast Gene
journal, April 2011


Single-RNA counting reveals alternative modes of gene expression in yeast
journal, November 2008

  • Zenklusen, Daniel; Larson, Daniel R.; Singer, Robert H.
  • Nature Structural & Molecular Biology, Vol. 15, Issue 12
  • DOI: 10.1038/nsmb.1514

Bayes Factors
journal, June 1995

  • Kass, Robert E.; Raftery, Adrian E.
  • Journal of the American Statistical Association, Vol. 90, Issue 430
  • DOI: 10.2307/2291091

Transcriptional Bursting Diversifies the Behaviour of a Toggle Switch: Hybrid Simulation of Stochastic Gene Expression
journal, January 2013

  • Bokes, Pavol; King, John R.; Wood, Andrew T. A.
  • Bulletin of Mathematical Biology, Vol. 75, Issue 2
  • DOI: 10.1007/s11538-013-9811-z

Precise Developmental Gene Expression Arises from Globally Stochastic Transcriptional Activity
journal, August 2013


Refining the weighted stochastic simulation algorithm
journal, May 2009

  • Gillespie, Dan T.; Roh, Min; Petzold, Linda R.
  • The Journal of Chemical Physics, Vol. 130, Issue 17
  • DOI: 10.1063/1.3116791

Stochastic mRNA Synthesis in Mammalian Cells
journal, September 2006


Non-equilibrium Thermodynamics of Piecewise Deterministic Markov Processes
journal, October 2009

  • Faggionato, A.; Gabrielli, D.; Ribezzi Crivellari, M.
  • Journal of Statistical Physics, Vol. 137, Issue 2
  • DOI: 10.1007/s10955-009-9850-x

Modulation of transcriptional burst frequency by histone acetylation
journal, June 2018

  • Nicolas, Damien; Zoller, Benjamin; Suter, David M.
  • Proceedings of the National Academy of Sciences, Vol. 115, Issue 27
  • DOI: 10.1073/pnas.1722330115

Universally Sloppy Parameter Sensitivities in Systems Biology Models
journal, January 2005


Analytical distributions for stochastic gene expression
text, January 2008


Enhancer Control of Transcriptional Bursting
journal, July 2016


Universally Sloppy Parameter Sensitivities in Systems Biology Models
journal, January 2007

  • Gutenkunst, Ryan N.; Waterfall, Joshua J.; Casey, Fergal P.
  • PLoS Computational Biology, Vol. 3, Issue 10
  • DOI: 10.1371/journal.pcbi.0030189

Estimating the Marginal Likelihood Using the Arithmetic Mean Identity
journal, March 2017


Reversible jump Markov chain Monte Carlo computation and Bayesian model determination
journal, January 1995


Rigidity enhances a magic-number effect in polymer phase separation
journal, March 2020


Friendship stability in adolescence is associated with ventral striatum responses to vicarious rewards
journal, January 2021

  • Schreuders, Elisabeth; Braams, Barbara R.; Crone, Eveline A.
  • Nature Communications, Vol. 12, Issue 1
  • DOI: 10.1038/s41467-020-20042-1

Bayesian inference on stochastic gene transcription from flow cytometry data
text, January 2018

  • Tiberi, Simone; Walsh, Mark; Cavallaro, Massimo
  • Oxford University Press
  • DOI: 10.5167/uzh-207552

Dichotomous Markov Noise: Exact Results for Out-Of-Equilibrium Systems
journal, August 2006


Chance Fluctuations in mRNA Output in Mammalian Cells
journal, September 2006


Exact distributions for stochastic gene expression models with bursting and feedback
text, January 2014


An efficient and exact stochastic simulation method to analyze rare events in biochemical systems
journal, October 2008

  • Kuwahara, Hiroyuki; Mura, Ivan
  • The Journal of Chemical Physics, Vol. 129, Issue 16
  • DOI: 10.1063/1.2987701

Gene expression distribution deconvolution in single-cell RNA sequencing
journal, June 2018

  • Wang, Jingshu; Huang, Mo; Torre, Eduardo
  • Proceedings of the National Academy of Sciences, Vol. 115, Issue 28
  • DOI: 10.1073/pnas.1721085115

Analytical distributions for stochastic gene expression
journal, November 2008

  • Shahrezaei, V.; Swain, P. S.
  • Proceedings of the National Academy of Sciences, Vol. 105, Issue 45
  • DOI: 10.1073/pnas.0803850105

Systematic Identification of Signal-Activated Stochastic Gene Regulation
journal, January 2013


A general method for numerically simulating the stochastic time evolution of coupled chemical reactions
journal, December 1976


Enhancer Histone Acetylation Modulates Transcriptional Bursting Dynamics of Neuronal Activity-Inducible Genes
journal, January 2019


Computational methods for Bayesian model choice
conference, January 2009

  • Robert, C. P.; Wraith, D.; Goggans, Paul M.
  • BAYESIAN INFERENCE AND MAXIMUM ENTROPY METHODS IN SCIENCE AND ENGINEERING: The 29th International Workshop on Bayesian Inference and Maximum Entropy Methods in Science and Engineering, AIP Conference Proceedings
  • DOI: 10.1063/1.3275622

Regulation of noise in the expression of a single gene
journal, April 2002

  • Ozbudak, Ertugrul M.; Thattai, Mukund; Kurtser, Iren
  • Nature Genetics, Vol. 31, Issue 1
  • DOI: 10.1038/ng869

Estimating the Dimension of a Model
journal, March 1978


Bayes Factors
journal, June 1995


Stochastic Gene Expression in a Single Cell
journal, August 2002


Stochastic Gene Expression with a Multistate Promoter: Breaking Down Exact Distributions
journal, January 2019

  • Herbach, Ulysse
  • SIAM Journal on Applied Mathematics, Vol. 79, Issue 3
  • DOI: 10.1137/18M1181006

The finite state projection algorithm for the solution of the chemical master equation
journal, January 2006

  • Munsky, Brian; Khammash, Mustafa
  • The Journal of Chemical Physics, Vol. 124, Issue 4
  • DOI: 10.1063/1.2145882

Single-cell analysis of transcription kinetics across the cell cycle
journal, January 2016


Single-RNA counting reveals alternative modes of gene expression in yeast
journal, November 2008

  • Zenklusen, Daniel; Larson, Daniel R.; Singer, Robert H.
  • Nature Structural & Molecular Biology, Vol. 15, Issue 12
  • DOI: 10.1038/nsmb.1514

Accurate Chemical Master Equation Solution Using Multi-Finite Buffers
text, January 2017


Analytical distributions for stochastic gene expression
journal, November 2008

  • Shahrezaei, V.; Swain, P. S.
  • Proceedings of the National Academy of Sciences, Vol. 105, Issue 45
  • DOI: 10.1073/pnas.0803850105

mRNA-Seq whole-transcriptome analysis of a single cell
journal, April 2009

  • Tang, Fuchou; Barbacioru, Catalin; Wang, Yangzhou
  • Nature Methods, Vol. 6, Issue 5
  • DOI: 10.1038/nmeth.1315

What shapes eukaryotic transcriptional bursting?
journal, January 2017

  • Nicolas, Damien; Phillips, Nick E.; Naef, Felix
  • Molecular BioSystems, Vol. 13, Issue 7
  • DOI: 10.1039/c7mb00154a

A continuum model of transcriptional bursting
journal, February 2016

  • Corrigan, Adam M.; Tunnacliffe, Edward; Cannon, Danielle
  • eLife, Vol. 5
  • DOI: 10.7554/elife.13051

Efficient analysis of stochastic gene dynamics in the non-adiabatic regime using piecewise deterministic Markov processes
journal, January 2018

  • Lin, Yen Ting; Buchler, Nicolas E.
  • Journal of The Royal Society Interface, Vol. 15, Issue 138
  • DOI: 10.1098/rsif.2017.0804

Distribution shapes govern the discovery of predictive models for gene regulation
journal, June 2018

  • Munsky, Brian; Li, Guoliang; Fox, Zachary R.
  • Proceedings of the National Academy of Sciences, Vol. 115, Issue 29
  • DOI: 10.1073/pnas.1804060115

Real-Time Kinetics of Gene Activity in Individual Bacteria
journal, December 2005


Computational methods for Bayesian model choice
text, January 2009


Mammalian Genes Are Transcribed with Widely Different Bursting Kinetics
journal, March 2011


Universally Sloppy Parameter Sensitivities in Systems Biology Models
journal, January 2007

  • Gutenkunst, Ryan N.; Waterfall, Joshua J.; Casey, Fergal P.
  • PLoS Computational Biology, Vol. 3, Issue 10
  • DOI: 10.1371/journal.pcbi.0030189

Estimating the Marginal Likelihood Using the Arithmetic Mean Identity
journal, March 2017


Transcription Factors Modulate c-Fos Transcriptional Bursts
journal, July 2014


Accurate Chemical Master Equation Solution Using Multi-Finite Buffers
journal, January 2016

  • Cao, Youfang; Terebus, Anna; Liang, Jie
  • Multiscale Modeling & Simulation, Vol. 14, Issue 2
  • DOI: 10.1137/15m1034180

Piecewise-Deterministic Markov Processes: A General Class of Non-Diffusion Stochastic Models
journal, July 1984


Reversible jump Markov chain Monte Carlo computation and Bayesian model determination
journal, January 1995


BayFish: Bayesian inference of transcription dynamics from population snapshots of single-molecule RNA FISH in single cells
journal, September 2017


A stochastic and dynamical view of pluripotency in mouse embryonic stem cells
text, January 2017


Equation of State Calculations by Fast Computing Machines
journal, June 1953

  • Metropolis, Nicholas; Rosenbluth, Arianna W.; Rosenbluth, Marshall N.
  • The Journal of Chemical Physics, Vol. 21, Issue 6
  • DOI: 10.1063/1.1699114

Bursty Gene Expression in the Intact Mammalian Liver
journal, April 2015


Transcriptional Bursting Diversifies the Behaviour of a Toggle Switch: Hybrid Simulation of Stochastic Gene Expression
journal, January 2013

  • Bokes, Pavol; King, John R.; Wood, Andrew T. A.
  • Bulletin of Mathematical Biology, Vol. 75, Issue 2
  • DOI: 10.1007/s11538-013-9811-z

Non-equilibrium Thermodynamics of Piecewise Deterministic Markov Processes
journal, October 2009

  • Faggionato, A.; Gabrielli, D.; Ribezzi Crivellari, M.
  • Journal of Statistical Physics, Vol. 137, Issue 2
  • DOI: 10.1007/s10955-009-9850-x

Precise Developmental Gene Expression Arises from Globally Stochastic Transcriptional Activity
journal, August 2013


Gene expression distribution deconvolution in single-cell RNA sequencing
journal, June 2018

  • Wang, Jingshu; Huang, Mo; Torre, Eduardo
  • Proceedings of the National Academy of Sciences, Vol. 115, Issue 28
  • DOI: 10.1073/pnas.1721085115

Enhancer Control of Transcriptional Bursting
journal, July 2016


A stochastic and dynamical view of pluripotency in mouse embryonic stem cells
text, January 2017


Regulation of noise in the expression of a single gene
journal, April 2002

  • Ozbudak, Ertugrul M.; Thattai, Mukund; Kurtser, Iren
  • Nature Genetics, Vol. 31, Issue 1
  • DOI: 10.1038/ng869

Approximate Bayesian computation scheme for parameter inference and model selection in dynamical systems
journal, July 2008

  • Toni, Tina; Welch, David; Strelkowa, Natalja
  • Journal of The Royal Society Interface, Vol. 6, Issue 31
  • DOI: 10.1098/rsif.2008.0172

Computational methods for Bayesian model choice
conference, January 2009

  • Robert, C. P.; Wraith, D.; Goggans, Paul M.
  • BAYESIAN INFERENCE AND MAXIMUM ENTROPY METHODS IN SCIENCE AND ENGINEERING: The 29th International Workshop on Bayesian Inference and Maximum Entropy Methods in Science and Engineering, AIP Conference Proceedings
  • DOI: 10.1063/1.3275622

Nature, Nurture, or Chance: Stochastic Gene Expression and Its Consequences
journal, October 2008


Bayesian inference on stochastic gene transcription from flow cytometry data
journal, September 2018


An efficient and exact stochastic simulation method to analyze rare events in biochemical systems
journal, October 2008

  • Kuwahara, Hiroyuki; Mura, Ivan
  • The Journal of Chemical Physics, Vol. 129, Issue 16
  • DOI: 10.1063/1.2987701

Accelerated maximum likelihood parameter estimation for stochastic biochemical systems
journal, January 2012

  • Daigle, Bernie J.; Roh, Min K.; Petzold, Linda R.
  • BMC Bioinformatics, Vol. 13, Issue 1
  • DOI: 10.1186/1471-2105-13-68

Modulation of transcriptional burst frequency by histone acetylation
journal, June 2018

  • Nicolas, Damien; Zoller, Benjamin; Suter, David M.
  • Proceedings of the National Academy of Sciences, Vol. 115, Issue 27
  • DOI: 10.1073/pnas.1722330115

Imaging individual mRNA molecules using multiple singly labeled probes
journal, September 2008

  • Raj, Arjun; van den Bogaard, Patrick; Rifkin, Scott A.
  • Nature Methods, Vol. 5, Issue 10
  • DOI: 10.1038/nmeth.1253

Monte Carlo Sampling Methods Using Markov Chains and Their Applications
journal, April 1970


State-dependent doubly weighted stochastic simulation algorithm for automatic characterization of stochastic biochemical rare events
journal, December 2011

  • Roh, Min K.; Daigle, Bernie J.; Gillespie, Dan T.
  • The Journal of Chemical Physics, Vol. 135, Issue 23
  • DOI: 10.1063/1.3668100

Bursting through the cell cycle
journal, March 2016


The geometric foundations of Hamiltonian Monte Carlo
journal, November 2017

  • Betancourt, Michael; Byrne, Simon; Livingstone, Sam
  • Bernoulli, Vol. 23, Issue 4A
  • DOI: 10.3150/16-BEJ810