Fourier series of atomic radial distribution functions: A molecular fingerprint for machine learning models of quantum chemical properties
Journal Article
·
· International Journal of Quantum Chemistry
- University of Basel (Switzerland); Argonne National Laboratory (ANL), Argonne, IL (United States). Argonne Leadership Computing Facility (ALCF)
- University of Basel (Switzerland)
- Argonne National Laboratory (ANL), Argonne, IL (United States). Mathematics and Computer Science Division; University of Texas, Austin, TX (United States)
Here we introduce a fingerprint representation of molecules based on a Fourier series of atomic radial distribution functions. This fingerprint is unique (except for chirality), continuous, and differentiable with respect to atomic coordinates and nuclear charges. It is invariant with respect to translation, rotation, and nuclear permutation, and requires no preconceived knowledge about chemical bonding, topology, or electronic orbitals. As such, it meets many important criteria for a good molecular representation, suggesting its usefulness for machine learning models of molecular properties trained across chemical compound space. To assess the performance of this new descriptor, we have trained machine learning models of molecular enthalpies of atomization for training sets with up to 10 k organic molecules, drawn at random from a published set of 134 k organic molecules with an average atomization enthalpy of over 1770 kcal/mol. We validate the descriptor on all remaining molecules of the 134 k set. For a training set of 10 k molecules, the fingerprint descriptor achieves a mean absolute error of 8.0 kcal/mol. This is slightly worse than the performance attained using the Coulomb matrix, another popular alternative, reaching 6.2 kcal/mol for the same training and test sets.
- Research Organization:
- Argonne National Laboratory (ANL), Argonne, IL (United States)
- Sponsoring Organization:
- Swiss National Science Foundation; USDOE Laboratory Directed Research and Development (LDRD) Program; USDOE Office of Science (SC)
- Grant/Contract Number:
- AC02-06CH11357
- OSTI ID:
- 1392322
- Journal Information:
- International Journal of Quantum Chemistry, Journal Name: International Journal of Quantum Chemistry Journal Issue: 16 Vol. 115; ISSN 0020-7608
- Publisher:
- WileyCopyright Statement
- Country of Publication:
- United States
- Language:
- English
Neural Networks for the Prediction of Organic Chemistry Reactions
|
journal | October 2016 |
A Universal 3D Voxel Descriptor for Solid-State Material Informatics with Deep Convolutional Neural Networks
|
journal | December 2017 |
| The octet rule in chemical space: Generating virtual molecules | text | January 2017 |
Similar Records
Quantum-Chemically Informed Machine Learning: Prediction of Energies of Organic Molecules with 10 to 14 Non-hydrogen Atoms
Machine Learning for Prediction of Thermodynamic Descriptors
Machine Learning of Parameters for Accurate Semiempirical Quantum Chemical Calculations
Journal Article
·
Sun Jun 14 20:00:00 EDT 2020
· Journal of Physical Chemistry. A, Molecules, Spectroscopy, Kinetics, Environment, and General Theory
·
OSTI ID:1656872
Machine Learning for Prediction of Thermodynamic Descriptors
Technical Report
·
Fri Sep 29 00:00:00 EDT 2023
·
OSTI ID:2203236
Machine Learning of Parameters for Accurate Semiempirical Quantum Chemical Calculations
Journal Article
·
Tue May 12 00:00:00 EDT 2015
· Journal of Chemical Theory and Computation
·
OSTI ID:1392016