DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Parsimonious Potential Energy Surface Expansions Using Dictionary Learning with Multipass Greedy Selection

Journal Article · · Journal of Physical Chemistry Letters

Potential energy surfaces fit with basis set expansions have been shown to provide accurate representations of electronic energies and have enabled a variety of high-accuracy dynamics, kinetics, and spectroscopy applications. The number of terms in these expansions scales poorly with system size, a drawback that challenges their use for systems with more than similar to 10 atoms. A solution is presented here using dictionary learning. Subsets of the full set of conventional basis functions are optimized using a newly developed multipass greedy regression method inspired by forward and backward selection methods from the statistics, signal processing, and machine learning literatures. Here, the optimized representations have accuracies comparable to the full set but are 1 or more orders of magnitude smaller, and notably, the number of terms in the optimized multipass greedy expansions scales approximately linearly with the number of atoms.

Research Organization:
Argonne National Laboratory (ANL), Argonne, IL (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Basic Energy Sciences (BES). Chemical Sciences, Geosciences & Biosciences Division
Grant/Contract Number:
AC02-06CH11357
OSTI ID:
1854517
Journal Information:
Journal of Physical Chemistry Letters, Journal Name: Journal of Physical Chemistry Letters Journal Issue: 37 Vol. 12; ISSN 1948-7185
Publisher:
American Chemical SocietyCopyright Statement
Country of Publication:
United States
Language:
English

References (42)

The Elements of Statistical Learning book January 2009
Sparse and Redundant Representations book January 2010
The development of a new Morse-oscillator based rotation-vibration Hamiltonian for H3+ journal July 1985
Perspectives on CUR decompositions journal May 2020
Termolecular chemistry facilitated by radical-radical recombinations and its impact on flame speed predictions journal January 2021
Neural Network Potential Energy Surfaces for Small Molecules and Reactions journal October 2020
Efficient Generation of Permutationally Invariant Potential Energy Surfaces for Large Molecules journal March 2020
Permutationally Invariant, Reproducing Kernel-Based Potential Energy Surfaces for Polyatomic Molecules: From Formaldehyde to Acetone journal July 2020
Permutationally Invariant Polynomial Expansions with Unrestricted Complexity journal September 2021
Automating the Development of High-Dimensional Reactive Potential Energy Surfaces with the robosurfer Program System journal November 2019
Data-Driven Many-Body Models for Molecular Fluids: CO 2 /H 2 O Mixtures as a Case Study journal March 2020
Global Sensitivity Analysis with Small Sample Sizes: Ordinary Least Squares Approach journal January 2017
Sparsity Facilitates Chemical-Reaction Selection for Engine Simulations journal August 2018
Parameterization Strategies for Intermolecular Potentials for Predicting Trajectory-Based Collision Parameters journal March 2019
Anharmonic Rovibrational Partition Functions at High Temperatures: Tests of Reduced-Dimensional Models for Systems with up to Three Fluxional Modes journal June 2019
High-Fidelity Potential Energy Surfaces for Gas-Phase and Gas–Surface Scattering Processes from Machine Learning journal June 2020
Breaking the Coupled Cluster Barrier for Machine-Learned Potentials of Large Molecules: The Case of 15-Atom Acetylacetone journal May 2021
Development of a “First Principles” Water Potential with Flexible Monomers: Dimer Potential Energy Surface, VRT Spectrum, and Second Virial Coefficient journal November 2013
Unimolecular dissociation dynamics of vibrationally activated CH3CHOO Criegee intermediates to OH radical products journal April 2016
Full-dimensional potential energy surface for acetylacetone and tunneling splittings journal January 2021
Global ab initio ground-state potential energy surface of N 4 journal July 2013
Communication: A benchmark-quality, full-dimensional ab initio potential energy surface for Ar-HOCO journal April 2014
On the accuracy of the MB-pol many-body potential for water: Interaction energies, vibrational frequencies, and classical thermodynamic and dynamical properties from clusters to liquid water and ice journal November 2016
A fragmented, permutationally invariant polynomial approach for potential energy surfaces of large molecules: Application to N -methyl acetamide journal April 2019
Full and fragmented permutationally invariant polynomial potential energy surfaces for trans and cis N -methyl acetamide and isomerization saddle points journal August 2019
Permutationally invariant polynomial potential energy surfaces for tropolone and H and D atom tunneling dynamics journal July 2020
CUR matrix decompositions for improved data analysis journal January 2009
Permutationally invariant potential energy surfaces in high dimensionality journal October 2009
Single- and multireference electronic structure calculations for constructing potential energy surfaces journal June 2016
Potential energy surfaces from high fidelity fitting of ab initio points: the permutation invariant polynomial - neural network approach journal July 2016
MIP-BOOST: Efficient and Effective L 0 Feature Selection for Linear Regression journal January 2021
Dictionaries for Sparse Representation Modeling journal June 2010
Dictionary Learning journal March 2011
Signal Recovery From Random Measurements Via Orthogonal Matching Pursuit journal December 2007
Adaptive Forward-Backward Greedy Algorithm for Learning Sparse Representations journal July 2011
Regression shrinkage and selection via the lasso: a retrospective: Regression Shrinkage and Selection via the Lasso journal April 2011
Regression Shrinkage and Selection Via the Lasso journal January 1996
An Improved Approximation Algorithm for the Column Subset Selection Problem conference December 2013
Updating the Inverse of a Matrix journal June 1989
Near-Optimal Column-Based Matrix Reconstruction journal January 2014
On the Optimality of the Backward Greedy Algorithm for the Subset Selection Problem journal January 2000
Statistical Learning with Sparsity book May 2015