skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: A Look Inside the Black Box: Using graph-theoretical descriptors to interpret a Continuous-Filter Convolutional Neural Network (CF-CNN) trained on the global and local minimum energy structures of neutral water clusters

Journal Article · · Journal of Chemical Physics
DOI:https://doi.org/10.1063/5.0009933· OSTI ID:1644767

A Continuous Filter Convolutional Neural Network (CF-CNN) was trained to predict the potential energy of water cluster networks \ce{(H2O)_{\textit{N}}}, \textit{N}=10--30, corresponding to local minima lying within 5 kcal/mol from the putative minima taken from a newly published database containing over 5 million unique networks. The chemical sampling space of the database was characterized using chemical descriptors derived from graph theory, which led to the identification of important trends in the topology, connectivity, polygon structures associated with the various networks as a function of cluster size. The resulting graphs are available alongside the original database at \url{https://sites.uw.edu/wdbase/}. The CF-CNN trained on a subset of 500,000 networks for (\textit{N}=10, 30) yielded a mean absolute error of 0.002$$\pm$$0.002 kcal/mol per water molecule, giving the trained CF-CNN the highest accuracy of any neural network-based surrogate model to date. In addition, clusters of sizes not included in the training set exhibited errors of the same magnitude, indicating that the CF-CNN ptotocol is general enough to accurately predict energies of networks for both smaller and larger sizes than those used during training. The graph-theoretical descriptors were developed in order to analyze the properties of the full database and interpret the predictive power of the CF-CNN. Using topology measures, such as the Wiener index and the average shortest path length along with two similarity measures, we showed that all networks from the test set were within the range of the ones from the training set, suggesting that the training set covered the chemical space of interest quite well. Our graph analysis suggests that the mean degree and number of polygons for networks with larger errors tend to lie further from the mean than those with lower errors. The generality of the used CF-CNN was thus demonstrated, while the use of the graph-theoretical descriptors assisted in interpreting the predicted results.

Research Organization:
Pacific Northwest National Lab. (PNNL), Richland, WA (United States)
Sponsoring Organization:
USDOE
DOE Contract Number:
AC05-76RL01830
OSTI ID:
1644767
Report Number(s):
PNNL-SA-152462
Journal Information:
Journal of Chemical Physics, Vol. 153, Issue 2
Country of Publication:
United States
Language:
English

References (44)

Concepts of Artificial Intelligence for Computer-Assisted Drug Discovery journal July 2019
Inverse molecular design using machine learning: Generative models for matter engineering journal July 2018
Machine Learning in Computer-Aided Synthesis Planning journal April 2018
First Principles Neural Network Potentials for Reactive Simulations of Large Molecular and Condensed Systems journal August 2017
ANI-1: an extensible neural network potential with DFT accuracy at force field computational cost journal January 2017
Hierarchical modeling of molecular energies using a deep neural network journal June 2018
Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations journal February 2019
Atlas of putative minima and low-lying energy networks of water clusters n = 3–25 journal December 2019
Development of transferable interaction models for water. IV. A flexible, all-atom polarizable potential (TTM2-F) based on geometry dependent charges derived from an ab initio monomer dipole moment surface journal January 2002
The Flexible, Polarizable, Thole-Type Interaction Potential for Water (TTM2-F) Revisited journal March 2006
Development of transferable interaction potentials for water. V. Extension of the flexible, polarizable, Thole-type model potential ( TTM3-F , v. 3.0) to describe the vibrational spectra of water clusters and liquid water journal February 2008
Towards an ab initio flexible potential for water, and post-harmonic quantum vibrational analysis of water clusters journal May 2010
Development of a “First-Principles” Water Potential with Flexible Monomers. III. Liquid Phase Properties journal July 2014
A quantitative account of quantum effects in liquid water journal October 2006
Quantum effects in liquid water from an ab initio -based polarizable force field journal August 2007
Infrared Spectroscopy and Hydrogen-Bond Dynamics of Liquid Water from Centroid Molecular Dynamics with an Ab Initio-Based Force Field journal October 2009
Nuclear Quantum Effects in the Reorientation of Water journal July 2010
Effective force fields for condensed phase systems from ab initio molecular dynamics simulation: A new method for force-matching journal June 2004
Dynamic force matching: A method for constructing dynamical coarse-grained models with realistic time dependence journal April 2015
Machine learning of accurate energy-conserving molecular force fields journal May 2017
Structure-based sampling and self-correcting machine learning for accurate calculations of potential energy surfaces and vibrational levels journal June 2017
De novo exploration and self-guided learning of potential-energy surfaces journal October 2019
A Density-Functional Theory-Based Neural Network Potential for Water Clusters Including van der Waals Corrections journal April 2013
Representing the potential-energy surface of protonated water clusters by high-dimensional neural network potentials journal January 2015
High-dimensional neural network potentials for solvation: The case of protonated water clusters in helium journal March 2018
Generalized Neural-Network Representation of High-Dimensional Potential-Energy Surfaces journal April 2007
Atom-centered symmetry functions for constructing high-dimensional neural network potentials journal February 2011
Library-Based LAMMPS Implementation of High-Dimensional Neural Network Potentials journal January 2019
Parallel Multistream Training of High-Dimensional Neural Network Potentials journal April 2019
Constructing high-dimensional neural network potentials: A tutorial review journal March 2015
Quantum-chemical insights from deep tensor neural networks journal January 2017
SchNet – A deep learning architecture for molecules and materials journal June 2018
SchNetPack: A Deep Learning Toolbox For Atomistic Systems journal November 2018
Global Optimization by Basin-Hopping and the Lowest Energy Structures of Lennard-Jones Clusters Containing up to 110 Atoms journal July 1997
Finding low energy minima of (H2O)25 and (H2O)30 with temperature basin paving Monte Carlo method with effective fragment potential: New ‘global minimum’ and graph theoretical characterization of low energy structures journal October 2013
Understanding the structure and hydrogen bonding network of (H 2 O) 32 and (H 2 O) 33 : an improved Monte Carlo temperature basin paving (MCTBP) method and quantum theory of atoms in molecules (QTAIM) analysis journal January 2017
Hydrogen bonding definitions and dynamics in liquid water journal May 2007
On the use of graph invariants for efficiently generating hydrogen bond topologies and predicting physical properties of water clusters and ice journal February 2001
Hydrogen-Bond Networks in Water Clusters (H 2 O) 20 : An Exhaustive Quantum-Chemical Analysis journal February 2010
Low-Energy Networks of the T-Cage (H 2 O) 24 Cluster and Their Use in Constructing Periodic Unit Cells of the Structure I (sI) Hydrate Lattice journal June 2009
Collective dynamics of ‘small-world’ networks journal June 1998
Hydrogen-bond patterns in liquid water journal November 1973
Promise and challenge of high-performance computing, with examples from molecular modelling
  • Dunning, Thom H.; Harrison, Robert J.; Feller, David
  • Philosophical Transactions of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences, Vol. 360, Issue 1795 https://doi.org/10.1098/rsta.2002.0984
journal April 2002
High-level ab initio calculations for the four low-lying families of minima of (H[sub 2]O)[sub 20]. I. Estimates of MP2/CBS binding energies and comparison with empirical potentials journal January 2004