DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Many-body expansion based machine learning models for octahedral transition metal complexes

Journal Article · · Machine Learning: Science and Technology

Abstract Graph-based machine learning (ML) models for material properties show great potential to accelerate virtual high-throughput screening of large chemical spaces. However, in their simplest forms, graph-based models do not include any 3D information and are unable to distinguish stereoisomers such as those arising from different orderings of ligands around a metal center in coordination complexes. In this work we present a modification to revised autocorrelation descriptors, a molecular graph featurization method, for predicting spin state dependent properties of octahedral transition metal complexes (TMCs). Inspired by analytical semi-empirical models for TMCs, the new modeling strategy is based on the many-body expansion (MBE) and allows one to tune the captured stereoisomer information by changing the truncation order of the MBE. We present the necessary modifications to include this approach in two commonly used ML methods, kernel ridge regression and feed-forward neural networks. On a test set composed of all possible isomers of binary TMCs, the best MBE models achieve mean absolute errors (MAEs) of 2.75 kcal mol −1 on spin-splitting energies and 0.26 eV on frontier orbital energy gaps, a 30%–40% reduction in error compared to models based on our previous approach. We also observe improved generalization to previously unseen ligands where the best-performing models exhibit MAEs of 4.00 kcal mol −1 (i.e. a 0.73 kcal mol −1 reduction) on the spin-splitting energies and 0.53 eV (i.e. a 0.10 eV reduction) on the frontier orbital energy gaps. Because the new approach incorporates insights from electronic structure theory, such as ligand additivity relationships, these models exhibit systematic generalization from homoleptic to heteroleptic complexes, allowing for efficient screening of TMC search spaces.

Sponsoring Organization:
USDOE
Grant/Contract Number:
NA0003965
OSTI ID:
2496813
Journal Information:
Machine Learning: Science and Technology, Journal Name: Machine Learning: Science and Technology Journal Issue: 4 Vol. 5; ISSN 2632-2153
Publisher:
IOP PublishingCopyright Statement
Country of Publication:
United Kingdom
Language:
English

References (102)

New Trends in the Use of Transition Metal-Ligand Complexes for Applications in Electroluminescent Devices journal May 2005
High-Throughput Computational Screening of New Li-Ion Battery Anode Materials journal September 2012
Luminescent Ionic Transition-Metal Complexes for Light-Emitting Electrochemical Cells journal August 2012
Bioactive Luminescent Transition-Metal Complexes for Biomedical Applications journal June 2013
molSimplify: A toolkit for automating discovery in inorganic chemistry journal July 2016
A ?Level-Shifting? method for converging closed shell Hartree-Fock wave functions journal July 1973
TeraChem : A graphical processing unit ‐accelerated electronic structure package for large‐scale ab initio molecular dynamics journal July 2020
A Modern First-Principles View on Ligand Field Theory Through the Eyes of Correlated Multireference Wavefunctions book January 2011
The influence of polarization functions on molecular orbital hydrogenation energies journal January 1973
Hydrogen bonds, coordination isomerism, and catalytic dehydrogenation of alcohols with the bifunctional iridium pincer complex $$^{{{\left( {HOC{H_2}} \right)}_2}}\left( {P{C_{s{p^3}}}P} \right)$$ ( H O C H 2 ) 2 ( P C s p 3 P ) IrHCl journal December 2015
Materials Design and Discovery with High-Throughput Density Functional Theory: The Open Quantum Materials Database (OQMD) journal September 2013
Studies on the antitumor activity of group VIII transition metal complexes. Part I. Platinum (II) complexes journal January 1973
Transition metal complexes in cancer chemotherapy journal June 1974
Applications of functionalized transition metal complexes in photonic and optoelectronic devices journal October 1998
The Inhibition of Growth or Cell Division in Escherichia coli by Different Ionic Species of Platinum(IV) Complexes journal March 1967
Building ligand knowledge bases for organometallic chemistry: Computational description of phosphorus(III)-donor ligands and the metal–phosphorus bond journal March 2009
Covalency and chemical bonding in transition metal complexes: An ab initio based ligand field perspective journal August 2017
Recent advances, opportunities, and challenges in high-throughput computational screening of MOFs for gas separations journal November 2020
A compact review of molecular property prediction with graph neural networks journal December 2020
The Matérn function as a general model for soil variograms journal October 2005
Toward computational screening in heterogeneous catalysis: Pareto-optimal methanation catalysts journal April 2006
Navigating Transition-Metal Chemical Space: Artificial Intelligence for First-Principles Design journal January 2021
Getting the Right Answers for the Right Reasons: Toward Predictive Molecular Simulations of Water with Many-Body Potential Energy Functions journal August 2016
Graph Networks as a Universal Machine Learning Framework for Molecules and Crystals journal April 2019
Computational Discovery of Transition-metal Complexes: From High-throughput Screening to Machine Learning journal July 2021
Modeling Molecular Interactions in Water: From Pairwise to Many-Body Potential Energy Functions journal May 2016
Computational Ligand Descriptors for Catalyst Design journal October 2018
Strategies and Software for Machine Learning Accelerated Discovery in Transition Metal Chemistry journal September 2018
Stereoisomerism as an Origin of Different Reactivities of Ir(III) PC(sp 3 )P Pincer Catalysts journal August 2020
Uncertainty Quantification Using Neural Networks for Molecular Property Prediction journal July 2020
Convolutional Embedding of Attributed Molecular Graphs for Physical Property Prediction journal July 2017
Exploiting Ligand Additivity for Transferable Machine Learning of Multireference Character across Known Transition Metal Complex Ligands journal July 2022
MBE-CASSCF Approach for the Accurate Treatment of Large Active Spaces journal May 2024
Learning from Failure: Predicting Electronic Structure Calculation Outcomes with Machine Learning Models journal March 2019
Resolving Transition Metal Chemical Space: Feature Selection for Machine Learning and Structure–Property Relationships journal November 2017
Improvement of Ab Initio Ligand Field Theory by Means of Multistate Perturbation Theory journal January 2020
Ligand Additivity and Divergent Trends in Two Types of Delocalization Errors from Approximate Density Functional Theory journal May 2022
Machine Learning Predictions of Molecular Properties: Accurate Many-Body Potentials and Nonlocality in Chemical Space journal June 2015
Accelerating Chemical Discovery with Machine Learning: Simulated Evolution of Spin Crossover Complexes with an Artificial Neural Network journal February 2018
Computational Screening of Trillions of Metal–Organic Frameworks for High-Performance Methane Storage journal May 2021
Design of Organocatalysts for Asymmetric Propargylations through Computational Screening journal October 2016
The Open DAC 2023 Dataset and Challenges for Sorbent Discovery in Direct Air Capture journal May 2024
Deep Architectures and Deep Learning in Chemoinformatics: The Prediction of Aqueous Solubility for Drug-Like Molecules journal July 2013
Phosphates as Lithium-Ion Battery Cathodes: An Evaluation Based on High-Throughput ab Initio Calculations journal August 2011
Fragmentation Methods: A Route to Accurate Calculations on Large Systems journal August 2011
Correcting Systematic Errors in DFT Spin-Splitting Energetics for Transition Metal Complexes journal December 2010
Quantum Chemistry on Graphical Processing Units. 3. Analytical Energy Gradients, Geometry Optimization, and First Principles Molecular Dynamics journal August 2009
Electronic structure and bonding in methyl- and perfluoromethyl(pentacarbonyl)manganese journal April 1972
Nonadditive and additive ligand fields and spectrochemical series arising from ligand field parameterization schemes. Pyridine as a nonlinearly ligating .pi.-back-bonding ligand toward chromium(III) journal June 1976
Ab Initio Calculation of Vibrational Absorption and Circular Dichroism Spectra Using Density Functional Force Fields journal November 1994
Ligand additivity: applications to the electrochemistry and photoelectron spectroscopy of d6 octahedral complexes journal March 1982
An Evolutionary Algorithm for de Novo Optimization of Functional Transition Metal Compounds journal May 2012
Localized Orbital Corrections for the Calculation of Ionization Potentials and Electron Affinities in Density Functional Theory journal September 2006
Accelerating Applications of Metal–Organic Frameworks for Gas Adsorption and Separation by Computational Screening of Materials journal July 2012
Computational high-throughput screening of electrocatalytic materials for hydrogen evolution journal October 2006
Rapid virtual screening of enantioselective catalysts using CatVS journal December 2018
Accelerating materials property predictions using machine learning journal September 2013
Molecular mechanics for multiple spin states of transition metal complexes journal January 2003
High-throughput computational screening of metal–organic frameworks journal January 2014
Predicting electronic structure properties of transition metal complexes with neural networks journal January 2017
Machine learning for the structure–energy–property landscapes of molecular crystals journal January 2018
High-throughput screening of bimetallic catalysts enabled by machine learning journal January 2017
Stereoisomers and functional groups in oxidorhenium(v) complexes: effects on catalytic activity journal January 2019
High-throughput computational screening for solid-state Li-ion conductors journal January 2020
Enumeration of de novo inorganic complexes for chemical discovery and machine learning journal January 2020
Large-scale comparison of 3d and 4d transition metal complexes illuminates the reduced effect of exchange on second-row spin-state energetics journal January 2020
Machine-learning-assisted high-throughput computational screening of high performance metal–organic frameworks journal January 2020
Machine learning to tame divergent density functional approximations: a new path to consensus materials design principles journal January 2021
The interelectronic repulsion and partly covalent bonding in transition-group complexes journal January 1958
Self—Consistent Molecular Orbital Methods. XII. Further Extensions of Gaussian—Type Basis Sets for Use in Molecular Orbital Studies of Organic Molecules journal March 1972
Development of generalized potential-energy surfaces using many-body expansions, neural networks, and moiety energy approximations journal May 2009
Ab initio effective core potentials for molecular calculations. Potentials for K to Au including the outermost core orbitals journal January 1985
Density‐functional thermochemistry. III. The role of exact exchange journal April 1993
A generalized many-body expansion and a unified view of fragment-based methods in electronic structure theory journal August 2012
Comparing the accuracy of high-dimensional neural network potentials and the systematic molecular fragmentation method: A benchmark study for all-trans alkanes journal May 2016
Geometry optimization made simple with translation and rotation coordinates journal June 2016
The many-body expansion combined with neural networks journal January 2017
Ligand additivity relationships enable efficient exploration of transition metal chemical space journal November 2022
On methods for converging open-shell Hartree-Fock wave-functions journal September 1974
LXXVIII. Some devices for the solution of large sets of simultaneous linear equations journal October 1944
Towards high throughput screening of electrochemical stability of battery electrolytes journal August 2015
Machine learning of molecular electronic properties in chemical compound space journal September 2013
Unified representation of molecules and crystals for machine learning journal November 2022
Computer simulation of local order in condensed phases of silicon journal April 1985
Development of the Colle-Salvetti correlation-energy formula into a functional of the electron density journal January 1988
How to represent crystal structures for machine learning: Towards fast prediction of electronic properties journal May 2014
Representation of compounds for machine-learning prediction of physical properties journal April 2017
Atomic cluster expansion for accurate and transferable interatomic potentials journal January 2019
Machine Learning Energies of 2 Million Elpasolite ( A B C 2 D 6 ) Crystals journal September 2016
Crystal Graph Convolutional Neural Networks for an Accurate and Interpretable Prediction of Material Properties journal April 2018
Climbing the Density Functional Ladder: Nonempirical Meta–Generalized Gradient Approximation Designed for Molecules and Solids journal September 2003
Machine-learning-accelerated high-throughput materials screening: Discovery of novel quaternary Heusler compounds journal December 2018
Developing an improved crystal graph convolutional neural network framework for accelerated materials discovery journal June 2020
The Cambridge Structural Database
  • Groom, Colin R.; Bruno, Ian J.; Lightfoot, Matthew P.
  • Acta Crystallographica Section B Structural Science, Crystal Engineering and Materials, Vol. 72, Issue 2, p. 171-179 https://doi.org/10.1107/S2052520616003954
journal April 2016
Neural Network for Graphs: A Contextual Constructive Approach journal March 2009
Crystal graph attention networks for the prediction of stable materials journal December 2021
On Deriving the Inverse of a Sum of Matrices journal January 1981
Enlargement Methods for Computing the Inverse Matrix journal September 1946
On the Absorption Spectra of Hexamminecobalt (III) and Related Complexes. II. Theoretical Study on Shifting and Spilitting of the First and the Second Band Due to Substitution of Ligands journal January 1958
Anti-tumour Platinum Compounds journal January 1973
Different Effects of Cisplatin and Transplatin on the Higher-Order Structure of DNA and Gene Expression journal December 2019
Many-body Expansion Based Machine Learning Models for Octahedral Transition Metal Complexes journal January 2024