DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Machine Learning Force Field Parameters from Ab Initio Data

Journal Article · · Journal of Chemical Theory and Computation
 [1];  [2]; ORCiD logo [3];  [4];  [4]; ORCiD logo [5];  [5];  [3]; ORCiD logo [6]
  1. Argonne National Laboratory (ANL), Argonne, IL (United States). Argonne Leadership Computing Facility (ALCF)
  2. University of Chicago, IL (United States)
  3. National Institutes of Health (NIH), Bethesda, MD (United States)
  4. Argonne National Laboratory (ANL), Argonne, IL (United States). Center for Nanoscale Materials (CNM)
  5. Argonne National Laboratory (ANL), Argonne, IL (United States). Center for Nanoscale Materials (CNM); University of Chicago, IL (United States)
  6. University of Chicago, IL (United States); Argonne National Laboratory (ANL), Argonne, IL (United States). Center for Nanoscale Materials (CNM)

Machine learning (ML) techniques with the genetic algorithm (GA) have been applied to determine a polarizable force field parameters using only ab initio data from quantum mechanics (QM) calculations of molecular clusters at the MP2/6-31G(d,p), DFMP2(fc)/jul-cc-pVDZ, and DFMP2(fc)/jul-cc-pVTZ levels to predict experimental condensed phase properties (i.e., density and heat of vaporization). The performance of this ML/GA approach is demonstrated on 4943 dimer electrostatic potentials and 1250 cluster interaction energies for methanol. Excellent agreement between the training data set from QM calculations and the optimized force field model was achieved. Our results were further improved by introducing an offset factor during the machine learning process to compensate for the discrepancy between the QM calculated energy and the energy reproduced by optimized force field, while maintaining the local “shape” of the QM energy surface. Throughout the machine learning process, experimental observables were not involved in the objective function, but were only used for model validation. The best model, optimized from the QM data at the DFMP2(fc)/jul-cc-pVTZ level, appears to perform even better than the original AMOEBA force field (amoeba09.prm), which was optimized empirically to match liquid properties. The present effort shows the possibility of using machine learning techniques to develop descriptive polarizable force field using only QM data. The ML/GA strategy to optimize force fields parameters described here could easily be extended to other molecular systems.

Research Organization:
Argonne National Laboratory (ANL), Argonne, IL (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Basic Energy Sciences (BES); University of Chicago Research Computing Center; National Institutes of Health (NIH)
Grant/Contract Number:
AC02-06CH11357
OSTI ID:
1415482
Journal Information:
Journal of Chemical Theory and Computation, Journal Name: Journal of Chemical Theory and Computation Journal Issue: 9 Vol. 13; ISSN 1549-9618
Publisher:
American Chemical SocietyCopyright Statement
Country of Publication:
United States
Language:
English

References (69)

Psi4: an open-source ab initio electronic structure program : Psi4: an electronic structure program
  • Turney, Justin M.; Simmonett, Andrew C.; Parrish, Robert M.
  • Wiley Interdisciplinary Reviews: Computational Molecular Science, Vol. 2, Issue 4 https://doi.org/10.1002/wcms.93
journal October 2011
Publisher’s Note: “Dispersion-corrected Møller–Plesset second-order perturbation theory” [J. Chem. Phys. 131, 094106 (2009)] journal September 2009
Monte carlo study of liquid water with two- and three-body ab initio potentials journal June 2009
Representation of Ion–Protein Interactions Using the Drude Polarizable Force-Field journal September 2014
Structures of hydrogen bond networks formed by a few tens of methanol molecules in the gas phase: size-selective infrared spectroscopy of neutral and protonated methanol clusters journal January 2013
Distributed multipole analysis Methods and applications journal January 2002
Benchmark database of accurate (MP2 and CCSD(T) complete basis set limit) interaction energies of small model complexes, DNA base pairs, and amino acid pairs journal January 2006
Calculation of the reaction field due to off-center point multipoles journal July 1997
Use of a non-pair-additive intermolecular potential function to fit quantum-mechanical data on water molecule interactions journal September 1977
The origin of deficiency of the supermolecule second-order Møller-Plesset approach for evaluating interaction energies journal October 2007
Accurate modeling of the intramolecular electrostatic energy of proteins journal July 1995
All-Atom Empirical Potential for Molecular Modeling and Dynamics Studies of Proteins journal April 1998
Transferable Next-Generation Force Fields from Simple Liquids to Complex Materials journal February 2015
Flexible, ab initio potential, and dipole moment surfaces for water. I. Tests and applications for clusters up to the 22-mer journal March 2011
Determination of best-fit potential parameters for a reactive force field using a genetic algorithm journal June 2011
Distributed multipole analysis: Methods and applications journal December 1985
The possibility of obtaining an effective pairwise additive intermolecular potential via an ab initio route by fitting to a cooperative model of condensed-phase configurations journal September 1991
Molecular modeling and dynamics studies with explicit inclusion of electronic polarizability: theory and applications journal August 2009
Distributed Multipole Analysis:  Stability for Large Basis Sets journal September 2005
A simple polarizable model of water based on classical Drude oscillators journal September 2003
Steepest descent methods for multicriteria optimization journal August 2000
CI study of the water dimer potential surface journal February 1976
eReaxFF: A Pseudoclassical Treatment of Explicit Electrons within Reactive Force Field Simulations journal July 2016
First-Principles Many-Body Force Fields from the Gas Phase to Liquid: A “Universal” Approach journal April 2014
An efficient newton-like method for molecular mechanics energy minimization of large molecules journal October 1987
Modeling induced polarization with classical Drude oscillators: Theory and molecular dynamics simulation algorithm journal August 2003
Application of Gaussian Electrostatic Model (GEM) Distributed Multipoles in the AMOEBA Force Field journal September 2012
Ab Initio -Based Bond Order Potential to Investigate Low Thermal Conductivity of Stanene Nanostructures journal September 2016
Non-additivity in water-ion-water interactions journal December 1980
Roulette-wheel selection via stochastic acceptance journal March 2012
An Empirical Polarizable Force Field Based on the Classical Drude Oscillator Model: Development History and Recent Applications journal January 2016
Accurate modeling of intermolecular forces: a systematic Møller-Plesset study of the argon dimer using correlation consistent basis sets journal March 1993
The Structure of Electronic Excitation Levels in Insulating Crystals journal August 1937
Genetic algorithms for modelling and optimisation journal December 2005
Getting the Right Answers for the Right Reasons: Toward Predictive Molecular Simulations of Water with Many-Body Potential Energy Functions journal August 2016
Nonconvex minimization calculations and the conjugate gradient method book January 1984
The structure of liquid methanol revisited: a neutron diffraction experiment at −80 °C and +25 °C journal April 1999
Scaled MP3 Non-Covalent Interaction Energies Agree Closely with Accurate CCSD(T) Benchmark Data journal January 2009
Distributed multipole analysis, or how to describe a molecular charge distribution journal October 1981
A new water potential including polarization: Application to gas‐phase, liquid, and crystal properties of water journal June 1990
Automation of AMOEBA polarizable force field parameterization for small molecules journal February 2012
Ab initio calculation of electrostatic multipoles with Wannier functions for large-scale biomolecular simulations journal March 2004
Polarizable Atomic Multipole-Based Molecular Mechanics for Organic Molecules journal August 2011
Recent applications and developments of charge equilibration force fields for modeling dynamical charges in classical molecular dynamics simulations journal March 2012
Self-consistent molecular orbital methods. 21. Small split-valence basis sets for first-row elements journal January 1980
Strike a balance: Optimization of backbone torsion parameters of AMBER polarizable force field for simulations of proteins and peptides journal January 2006
Global optimization of parameters in the reactive force field ReaxFF for SiOH journal July 2013
Current Status of the AMOEBA Polarizable Force Field journal March 2010
The calculation of small molecular interactions by the differences of separate total energies. Some procedures with reduced errors journal January 2002
Describing Noncovalent Interactions beyond the Common Approximations: How Accurate Is the “Gold Standard,” CCSD(T) at the Complete Basis Set Limit? journal April 2013
Polarizable Atomic Multipole Water Model for Molecular Mechanics Simulation journal May 2003
Towards accurate prediction of catalytic activity in IrO 2 nanoclusters via first principles-based variable charge force field journal January 2015
ReaxFF:  A Reactive Force Field for Hydrocarbons journal October 2001
Physically-Motivated Force Fields from Symmetry-Adapted Perturbation Theory journal February 2013
The representation of van der Waals (vdW) interactions in molecular mechanics force fields: potential form, combination rules, and vdW parameters journal September 1992
Effective force fields for condensed phase systems from ab initio molecular dynamics simulation: A new method for force-matching journal June 2004
Maximally localized generalized Wannier functions for composite energy bands journal November 1997
Empirical force fields for biological macromolecules: Overview and issues journal January 2004
Algorithms for calculating excluded volume and its derivatives as a function of molecular conformation and their use in energy minimization journal April 1991
Levels of symmetry adapted perturbation theory (SAPT). I. Efficiency and performance for interaction energies journal March 2014
Analysis and Application of Potential Energy Smoothing and Search Methods for Global Optimization journal November 1998
Predictions of the Properties of Water from First Principles journal March 2007
Gaussian basis sets for use in correlated molecular calculations. I. The atoms boron through neon and hydrogen journal January 1989
MP2 energy evaluation by direct methods journal December 1988
Next-Generation Force Fields from Symmetry-Adapted Perturbation Theory journal May 2016
Development of a Modified Embedded Atom Force Field for Zirconium Nitride Using Multi-Objective Evolutionary Optimization journal July 2016
Dynamical fluctuating charge force fields: Application to liquid water journal October 1994
Calculations of electrostatic properties in proteins journal December 1987
Describing the Diverse Geometries of Gold from Nanoclusters to Bulk—A First-Principles-Based Hybrid Bond-Order Potential journal June 2016

Cited By (1)

The TensorMol-0.1 model chemistry: a neural network augmented with long-range physics journal January 2018