skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Predicting Small Molecule Transfer Free Energies by Combining Molecular Dynamics Simulations and Deep Learning

Journal Article · · Journal of Chemical Information and Modeling
ORCiD logo [1];  [2];  [1];  [2];  [1];  [3];  [2];  [1]; ORCiD logo [1]
  1. Biochemical and Biophysical Systems Group, Biosciences and Biotechnology Division, Lawrence Livermore National Laboratory, 7000 East Avenue, Livermore, California, United States
  2. Global Security Computing Applications, Lawrence Livermore National Laboratory, 7000 East Avenue, Livermore, California, United States
  3. Center for Applied Scientific Computing, Lawrence Livermore National Laboratory, 7000 East Avenue, Livermore, California, United States

Accurately predicting small molecule partitioning and hydrophobicity is critical in the drug discovery process. There are many heterogeneous chemical environments within a cell and entire human body. For example, drugs must be able to cross the hydrophobic cellular membrane to reach their intracellular targets, and hydrophobicity is an important driving force for drug–protein binding. Atomistic molecular dynamics (MD) simulations are routinely used to calculate free energies of small molecules binding to proteins, crossing lipid membranes, and solvation but are computationally expensive. Machine learning (ML) and empirical methods are also used throughout drug discovery but rely on experimental data, limiting the domain of applicability. We present atomistic MD simulations calculating 15,000 small molecule free energies of transfer from water to cyclohexane. This large data set is used to train ML models that predict the free energies of transfer. We show that a spatial graph neural network model achieves the highest accuracy, followed closely by a 3D-convolutional neural network, and shallow learning based on the chemical fingerprint is significantly less accurate. A mean absolute error of ~4 kJ/mol compared to the MD calculations was achieved for our best ML model. We also show that including data from the MD simulation improves the predictions, tests the transferability of each model to a diverse set of molecules, and show multitask learning improves the predictions. This work provides insight into the hydrophobicity of small molecules and ML cheminformatics modeling, and our data set will be useful for designing and testing future ML cheminformatics methods.

Research Organization:
Lawrence Livermore National Laboratory (LLNL), Livermore, CA (United States)
Sponsoring Organization:
USDOE National Nuclear Security Administration (NNSA); USDOE Laboratory Directed Research and Development (LDRD) Program; American Heart Association (AHA)
Grant/Contract Number:
AC52-07NA27344
OSTI ID:
1657273
Alternate ID(s):
OSTI ID: 1729734
Report Number(s):
LLNL-JRNL-774697
Journal Information:
Journal of Chemical Information and Modeling, Journal Name: Journal of Chemical Information and Modeling; ISSN 1549-9596
Publisher:
American Chemical SocietyCopyright Statement
Country of Publication:
United States
Language:
English

References (36)

Identification and Prediction of Promiscuous Aggregating Inhibitors among Known Drugs journal October 2003
Sampling errors in free energy simulations of small molecules in lipid bilayers journal October 2016
Perspective on the Martini model journal January 2013
ADME Evaluation in Drug Discovery. 2. Prediction of Partition Coefficient by Atom-Additive Approach Based on Atom-Weighted Solvent Accessible Surface Areas journal May 2003
Fluorinated Alcohols’ Effects on Lipid Bilayer Properties journal August 2018
Blind prediction of cyclohexane–water distribution coefficients from the SAMPL5 challenge journal September 2016
Basic ingredients of free energy calculations: A review journal January 2009
Alcohol's Effects on Lipid Bilayer Properties journal August 2011
Extended-Connectivity Fingerprints journal April 2010
Distribution of Amino Acids in a Lipid Bilayer from Computer Simulations journal May 2008
The Properties of Known Drugs. 1. Molecular Frameworks journal January 1996
Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings journal January 1997
Lessons learned about steered molecular dynamics simulations and free energy calculations
  • Boubeta, Fernando Martín; Contestín García, Rocío María; Lorenzo, Ezequiel Norberto
  • Chemical Biology & Drug Design, Vol. 93, Issue 6 https://doi.org/10.1111/cbdd.13485
journal February 2019
Drug metabolism and pharmacokinetics, the blood-brain barrier, and central nervous system drug discovery journal October 2005
Advancing Drug Discovery through Enhanced Free Energy Calculations journal July 2017
Dissecting Machine-Learning Prediction of Molecular Activity: Is an Applicability Domain Needed for Quantitative Structure–Activity Relationship Models Based on Deep Neural Networks? journal November 2018
PotentialNet for Molecular Property Prediction journal November 2018
Small Molecule Hydration Free Energies in Explicit Solvent: An Extensive Test of Fixed-Charge Atomistic Simulations journal January 2009
Predicting a Drug’s Membrane Permeability: A Computational Model Validated With in Vitro Permeability Assay Data journal May 2017
The MARTINI Coarse-Grained Force Field: Extension to Proteins journal April 2008
The MARTINI Force Field:  Coarse Grained Model for Biomolecular Simulations journal July 2007
Molecular Dynamics Fingerprints (MDFP): Machine Learning from MD Data To Predict Free-Energy Differences journal April 2017
In silico screening of drug-membrane thermodynamics reveals linear relations between bulk partitioning and the potential of mean force journal September 2017
Transfer of Arginine into Lipid Bilayers Is Nonadditive journal July 2011
Calculating the free energy of transfer of small solutes into a model lipid membrane: Comparison between metadynamics and umbrella sampling journal October 2015
Statistical Convergence of Equilibrium Properties in Simulations of Molecular Solutes Embedded in Lipid Bilayers journal October 2011
Hydrophobicity scales: a thermodynamic looking glass into lipid–protein interactions journal December 2011
Lead- and drug-like compounds: the rule-of-five revolution journal December 2004
The COSMO and COSMO-RS solvation models: The COSMO and COSMO-RS solvation models journal April 2011
State of the art and prospects of methods for determination of lipophilicity of chemical compounds journal April 2019
Automatic Chemical Design Using a Data-Driven Continuous Representation of Molecules journal January 2018
Molecular dynamics simulations and drug discovery journal October 2011
Characterizing the Chemical Space of ERK2 Kinase Inhibitors Using Descriptors Computed from Molecular Dynamics Trajectories journal May 2017
The Importance of Membrane Defects—Lessons from Simulations journal June 2014
Hydrophobic hydration from small to large lengthscales: Understanding and manipulating the crossover journal June 2005
A Method to Predict Blood-Brain Barrier Permeability of Drug-Like Compounds Using Molecular Dynamics Simulations journal August 2014

Similar Records

Transferable Dynamic Molecular Charge Assignment Using Deep Neural Networks
Journal Article · Tue Jul 31 00:00:00 EDT 2018 · Journal of Chemical Theory and Computation · OSTI ID:1657273

GDB-9-Ex: Quantum chemical prediction of UV/Vis absorption spectra for GDB-9 molecules
Dataset · Tue Nov 22 00:00:00 EST 2022 · OSTI ID:1657273

Using Computationally-Determined Properties for Machine Learning Prediction of Self-Diffusion Coefficients in Pure Liquids
Journal Article · Thu Nov 18 00:00:00 EST 2021 · Journal of Physical Chemistry. B, Condensed Matter, Materials, Surfaces, Interfaces and Biophysical Chemistry · OSTI ID:1657273