skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Machine learning for collective variable discovery and enhanced sampling in biomolecular simulation

Journal Article · · Molecular Physics

Classical molecular dynamics simulates the time evolution of molecular systems through the phase space spanned by the positions and velocities of the constituent atoms. Molecular-level thermodynamic, kinetic, and structural data extracted from the resulting trajectories provide valuable information for the understanding, engineering, and design of biological and molecular materials. The cost of simulating many-body atomic systems makes simulations of large molecules prohibitively expensive, and the high-dimensionality of the resulting trajectories presents a challenge for analysis. Driven by advances in algorithms, hardware, and data availability, there has been a flare of interest in recent years in the applications of machine learning – especially deep learning – to molecular simulation. Furthermore, these techniques have demonstrated great power and flexibility in both extracting mechanistic understanding of the important nonlinear collective variables governing the dynamics of a molecular system, and in furnishing good low-dimensional system representations with which to perform enhanced sampling or develop long-timescale dynamical models. It is the purpose of this article to introduce the key machine learning approaches, describe how they are married with statistical mechanical theory into domain-specific tools, and detail applications of these approaches in understanding and accelerating biomolecular simulation.

Research Organization:
Argonne National Laboratory (ANL), Argonne, IL (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Basic Energy Sciences (BES). Materials Sciences & Engineering Division
Grant/Contract Number:
AC02-06CH11357
OSTI ID:
1651117
Journal Information:
Molecular Physics, Vol. 118, Issue 5; ISSN 0026-8976
Publisher:
Taylor & FrancisCopyright Statement
Country of Publication:
United States
Language:
English
Citation Metrics:
Cited by: 77 works
Citation information provided by
Web of Science

References (150)

Reduced Models in Chemical Kinetics via Nonlinear Data-Mining journal January 2014
Charting molecular free-energy landscapes with an atlas of collective variables journal November 2016
A self-learning algorithm for biased molecular dynamics journal September 2010
Girsanov reweighting for metadynamics simulations journal August 2018
Using sketch-map coordinates to analyze and bias molecular dynamics simulations journal March 2012
Graph Laplacian Tomography From Unknown Random Projections journal October 2008
Nonphysical sampling distributions in Monte Carlo free-energy estimation: Umbrella sampling journal February 1977
Transferable Neural Networks for Enhanced Sampling of Protein Dynamics journal March 2018
Machine learning of single molecule free energy surfaces and the impact of chemistry and environment upon structure and dynamics journal March 2015
Molecular modeling studies of peptoid polymers journal January 2017
Nonlinear machine learning in simulations of soft and biological materials journal November 2017
Reaction coordinates and transition pathways of rare events via forward flux sampling journal October 2007
Large-amplitude nonlinear motions in proteins journal April 1992
Dynamical reweighting: Improved estimates of dynamical properties from simulations at multiple temperatures journal June 2011
Optimal use of data in parallel tempering simulations for the construction of discrete-state Markov models of biomolecular dynamics journal June 2011
Diffusion maps journal July 2006
VAMPnets for deep learning of molecular kinetics journal January 2018
Kinetic Distance and Kinetic Maps from Molecular Dynamics Simulation journal September 2015
Variational Koopman models: Slow collective variables and molecular kinetics from short off-equilibrium simulations journal April 2017
Commute Maps: Separating Slowly Mixing Molecular Configurations for Kinetic Modeling journal October 2016
Demonstrating the Transferability and the Descriptive Power of Sketch-Map journal February 2013
Collective variable discovery and enhanced sampling using autoencoders: Innovations in network architecture and error function design journal August 2018
How Fast-Folding Proteins Fold journal October 2011
Nonlinear Machine Learning of Patchy Colloid Self-Assembly Pathways and Mechanisms journal April 2014
A Study of the Morphology, Dynamics, and Folding Pathways of Ring Polymers with Supramolecular Topological Constraints Using Molecular Simulation and Nonlinear Manifold Learning journal January 2018
Discovering Mountain Passes via Torchlight: Methods for the Definition of Reaction Coordinates and Pathways in Complex Macromolecular Reactions journal April 2013
High-Resolution Markov State Models for the Dynamics of Trp-Cage Miniprotein Constructed Over Slow Folding Modes Identified by State-Free Reversible VAMPnets journal August 2019
Variational Approach to Molecular Kinetics journal March 2014
Unmasking Clever Hans predictors and assessing what machines really learn journal March 2019
SchNet – A deep learning architecture for molecules and materials journal June 2018
Deep clustering of protein folding simulations journal December 2018
Automated design of collective variables using supervised machine learning journal September 2018
Automatic Method for Identifying Reaction Coordinates in Complex Systems journal April 2005
Molecular enhanced sampling with autoencoders: On-the-fly collective variable discovery and accelerated free energy landscape exploration: Molecular Enhanced Sampling with Autoencoders: On-The-Fly Collective Variable Discovery and Accelerated Free Energy Landscape Exploration journal September 2018
Integrating diffusion maps with umbrella sampling: Application to alanine dipeptide journal April 2011
Spectral gap optimization of order parameters for sampling complex molecular systems journal February 2016
Collective motions in proteins: A covariance analysis of atomic fluctuations in molecular dynamics and normal mode simulations journal November 1991
Machine Calculation of Thermodynamic Properties of a Simple Fluid at Supercritical Temperatures journal December 1967
Design of therapeutic proteins with enhanced stability journal July 2009
Koopman operator-based model reduction for switched-system control of PDEs journal August 2019
Stochastic Gradient Descent Tricks book January 2012
Markov models of molecular kinetics: Generation and validation journal May 2011
Intrinsic map dynamics exploration for uncharted effective free-energy landscapes journal June 2017
Modeling Molecular Kinetics with tICA and the Kernel Trick journal January 2015
A Variational Approach to Modeling Slow Processes in Stochastic Dynamical Systems journal January 2013
Markov state models of biomolecular conformational dynamics journal April 2014
A variational conformational dynamics approach to the selection of collective variables in metadynamics journal November 2017
A Data–Driven Approximation of the Koopman Operator: Extending Dynamic Mode Decomposition journal June 2015
Finite approximations of Frobenius-Perron operators. A solution of Ulam's conjecture to multi-dimensional transformations journal April 1996
Collective Variables from Local Fluctuations journal May 2018
Reweighted autoencoded variational Bayes for enhanced sampling (RAVE) journal August 2018
Simplifying the representation of complex free-energy landscapes using sketch-map journal July 2011
Learning free energy landscapes using artificial neural networks journal March 2018
Enhancing Important Fluctuations: Rare Events and Metadynamics from a Conceptual Viewpoint journal May 2016
Data-Driven Model Reduction and Transfer Operator Approximation journal January 2018
Generalized Neural-Network Representation of High-Dimensional Potential-Energy Surfaces journal April 2007
Nonlinear discovery of slow molecular modes using state-free reversible VAMPnets journal June 2019
tICA-Metadynamics: Accelerating Metadynamics by Using Kinetically Selected Collective Variables journal May 2017
Markov state models from short non-equilibrium simulations—Analysis and correction of estimation bias journal March 2017
A generalized deep learning approach for local structure identification in molecular simulations journal January 2019
Universal approximation to nonlinear operators by neural networks with arbitrary activation functions and its application to dynamical systems journal July 1995
Enhanced sampling in molecular dynamics journal August 2019
Metadynamics: Metadynamics
  • Barducci, Alessandro; Bonomi, Massimiliano; Parrinello, Michele
  • Wiley Interdisciplinary Reviews: Computational Molecular Science, Vol. 1, Issue 5 https://doi.org/10.1002/wcms.31
journal February 2011
Systematic characterization of protein folding pathways using diffusion maps: Application to Trp-cage miniprotein journal February 2015
Simulating materials failure by using up to one billion atoms and the world's fastest computer: Brittle fracture journal April 2002
What Is the Relation Between Slow Feature Analysis and Independent Component Analysis? journal October 2006
Local elevation: A method for improving the searching properties of molecular dynamics simulation journal December 1994
On the Use of Experimental Observations to Bias Simulated Ensembles journal March 2012
Machine learning for molecular and materials science journal July 2018
Unconstrained enhanced sampling for free energy calculations of biomolecules: a review journal July 2016
Predicting slow structural transitions in macromolecular systems: Conformational flooding journal September 1995
NSF’s Inaugural Software Institutes: The Science Gateways Community Institute and the Molecular Sciences Software Institute journal September 2018
Perspective: Computational chemistry software and its advancement as illustrated through three grand challenge cases for molecular science journal November 2018
Neural Network and Nearest Neighbor Algorithms for Enhancing Sampling of Molecular Dynamics journal May 2017
The Protein Folding Problem journal June 2008
The reweighted path ensemble journal November 2010
Determination of reaction coordinates via locally scaled diffusion map journal March 2011
A Global Geometric Framework for Nonlinear Dimensionality Reduction journal December 2000
Nonlinear principal component analysis using autoassociative neural networks journal February 1991
Coarse-grained dynamics of an activity bump in a neural field model journal July 2007
Enhanced Sampling in Molecular Dynamics Using Metadynamics, Replica-Exchange, and Temperature-Acceleration journal December 2013
BayesWHAM: A Bayesian approach for free energy estimation, reweighting, and uncertainty quantification in the weighted histogram analysis method journal May 2017
Atomic-Level Characterization of the Structural Dynamics of Proteins journal October 2010
Toward Achieving Efficient and Accurate Ligand-Protein Unbinding with Deep Learning and Molecular Dynamics through RAVE journal November 2018
TweTriS: Twenty trillion-atom simulation journal January 2019
Studies in Molecular Dynamics. I. General Method journal August 1959
LIII. On lines and planes of closest fit to systems of points in space journal November 1901
Protein folding and de novo protein design for biotechnological applications journal February 2014
The perceptron: A probabilistic model for information storage and organization in the brain. journal January 1958
Obtaining reaction coordinates by likelihood maximization journal August 2006
Dynamic Mode Decomposition with Control journal January 2016
Molecular dynamics simulations in biology journal October 1990
Adaptive enhanced sampling by force-biasing using neural networks journal April 2018
ACS Central Science Virtual Issue on Machine Learning journal August 2018
Rapid Exploration of Configuration Space with Diffusion-Map-Directed Molecular Dynamics journal July 2013
Multiensemble Markov models of molecular thermodynamics and kinetics journal May 2016
Landmark diffusion maps (L-dMaps): Accelerated manifold learning out-of-sample extension journal July 2019
Simulating materials failure by using up to one billion atoms and the world's fastest computer: Work-hardening journal April 2002
Extending Data-Driven Koopman Analysis to Actuated Systems journal January 2016
Molecular Dynamics with On-the-Fly Machine Learning of Quantum-Mechanical Forces journal March 2015
Using Ulam's method to calculate entropy and other dynamical invariants journal January 1999
Calculation of absolute protein-ligand binding free energy from computer simulations journal May 2005
Metadynamics: a method to simulate rare events and reconstruct the free energy in biophysics, chemistry and material science journal November 2008
Extended dynamic mode decomposition with dictionary learning: A data-driven adaptive spectral decomposition of the Koopman operator journal October 2017
Hessian eigenmaps: Locally linear embedding techniques for high-dimensional data journal April 2003
Metadynamics with Discriminants: A Tool for Understanding Chemistry journal August 2018
A Computational Method to Extract Macroscopic Variables and Their Dynamics in Multiscale Systems journal January 2014
Markov State Models: From an Art to a Science journal February 2018
Multiresolution Dynamic Mode Decomposition journal January 2016
Nonlinear Dimensionality Reduction by Locally Linear Embedding journal December 2000
Modeling and enhanced sampling of molecular systems with smooth and nonlinear data-driven collective variables journal December 2013
Everything you wanted to know about Markov State Models but were afraid to ask journal September 2010
Capabilities and limitations of time-lagged autoencoders for slow mode discovery in dynamical systems journal August 2019
Hierarchical Time-Lagged Independent Component Analysis: Computing Slow Modes and Reaction Coordinates for Large Molecular Systems journal November 2016
Improvements in Markov State Model Construction Reveal Many Non-Native Interactions in the Folding of NTL9 journal March 2013
Time-Scale Separation from Diffusion-Mapped Delay Coordinates journal January 2013
Rational design of patchy colloids via landscape engineering journal January 2018
Density of States–Based Molecular Simulations journal July 2012
On Convergence of Extended Dynamic Mode Decomposition to the Koopman Operator journal November 2017
Beating the Millisecond Barrier in Molecular Dynamics Simulations journal January 2015
Phase Transition for a Hard Sphere System journal November 1957
Graphite-diamond phase coexistence study employing a neural-network mapping of the ab initio potential energy surface journal March 2010
Boltzmann generators: Sampling equilibrium states of many-body systems with deep learning journal September 2019
The Protein-Folding Problem, 50 Years On journal November 2012
Nonlinear dimensionality reduction in molecular simulation: The diffusion map approach journal June 2011
Systematic determination of order parameters for chain dynamics using diffusion maps journal July 2010
Deflation reveals dynamical structure in nondominant reaction coordinates journal August 2019
Low-dimensional, free-energy landscapes of protein-folding reactions by nonlinear dimensionality reduction journal June 2006
Time-lagged autoencoders: Deep learning of slow collective variables for molecular kinetics journal June 2018
Fuzzy spectral clustering by PCCA+: application to Markov state models and data classification journal May 2013
Discovery Through the Computational Microscope journal October 2009
PotentialNet for Molecular Property Prediction journal November 2018
Recent advances in machine learning towards multiscale soft materials design journal March 2019
Dynamic Mode Decomposition book January 2016
On Pixel-Wise Explanations for Non-Linear Classifier Decisions by Layer-Wise Relevance Propagation journal July 2015
Dynamic mode decomposition for large and streaming datasets journal November 2014
Girsanov reweighting for path ensembles and Markov state models journal June 2017
Relations Between two sets of Variates journal December 1936
Variants of Dynamic Mode Decomposition: Boundary Condition, Koopman, and Fourier Analyses journal April 2012
Extensions to the likelihood maximization approach for finding reaction coordinates journal July 2007
Separation of a mixture of independent signals using time delayed correlations journal June 1994
Escaping free-energy minima journal September 2002
Geometric diffusions as a tool for harmonic analysis and structure definition of data: Diffusion maps journal May 2005
Folding a small protein using harmonic linear discriminant analysis journal November 2018
Tensor-based dynamic mode decomposition journal June 2018
Unsupervised Learning of Image Manifolds by Semidefinite Programming journal May 2006
De-biasing the dynamic mode decomposition for applied Koopman spectral analysis of noisy datasets journal April 2017
Adaptive biasing force method for scalar and vector free energy calculations journal April 2008
Identification of slow molecular order parameters for Markov model construction journal July 2013
Machine Learning of Coarse-Grained Molecular Dynamics Force Fields journal April 2019