Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

DeepDriveMD: Deep-Learning Driven Adaptive Molecular Simulations for Protein Folding

Conference ·
Simulations of biological macromolecules are important in understanding the physical basis of complex processes such as protein folding. However, even with increasing computational capacity and specialized architectures, the ability to simulate protein folding at atomistic scales still remains challenging. This stems from the dual aspects of high dimensionality of protein conformational landscapes, and the inability of atomistic molecular dynamics (MD) simulations to sufficiently sample these landscapes to observe folding events. Machine learning/deep learning (ML/DL) techniques, when combined with atomistic MD simulations offer the opportunity to potentially overcome these limitations by: (1) effectively reducing the dimensionality of MD simulations to automatically build latent representations that correspond to biophysically relevant reaction coordinates (RCs), and (2) driving MD simulations to automatically sample potentially novel conformational states based on these RCs. We examine how coupling DL approaches with MD simulations can lead to effective approaches to fold small proteins on supercomputers. In particular, we study the computational costs and effectiveness of scaling DL-coupled MD workflows implemented using RADICAL-Cybertools in folding two prototypical systems, namely Fs-peptide and the fast-folding variant of the villin head piece protein. We demonstrate that a DL-coupled MD workflow is able to effectively learn latent representations and drive adaptive simulations. Compared to traditional MD-based approaches, our approach achieves an effective performance gain in sampling the folded states by at least 2.3x. Together, our study provides quantitative basis to understand how coupling DL approaches to MD simulations, can lead to effective performance gains and reduced times to solution on supercomputing resources.
Research Organization:
Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
Sponsoring Organization:
USDOE
DOE Contract Number:
AC05-00OR22725
OSTI ID:
1649043
Country of Publication:
United States
Language:
English

Similar Records

Deep clustering of protein folding simulations
Journal Article · Thu Dec 20 19:00:00 EST 2018 · BMC Bioinformatics · OSTI ID:1513368

A Deep Learning-Driven Sampling Technique to Explore the Phase Space of an RNA Stem-Loop
Journal Article · Sun Oct 06 20:00:00 EDT 2024 · Journal of Chemical Theory and Computation · OSTI ID:2586552

Integrating solvation shell structure in experimentally driven molecular dynamics using x-ray solution scattering data
Journal Article · Fri May 29 00:00:00 EDT 2020 · Journal of Chemical Physics · OSTI ID:1635268

Related Subjects