skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Navigating Transition-Metal Chemical Space: Artificial Intelligence for First-Principles Design

Journal Article · · Accounts of Chemical Research

Conspectus The variability of chemical bonding in open-shell transition-metal complexes not only motivates their study as functional materials and catalysts but also challenges conventional computational modeling tools. Here, tailoring ligand chemistry can alter preferred spin or oxidation states as well as electronic structure properties and reactivity, creating vast regions of chemical space to explore when designing new materials atom by atom. Although first-principles density functional theory (DFT) remains the workhorse of computational chemistry in mechanism deduction and property prediction, it is of limited use here. DFT is both far too computationally costly for widespread exploration of transition-metal chemical space and also prone to inaccuracies that limit its predictive performance for localized d electrons in transition-metal complexes. These challenges starkly contrast with the well-trodden regions of small-organic-molecule chemical space, where the analytical forms of molecular mechanics force fields and semiempirical theories have for decades accelerated the discovery of new molecules, accurate DFT functional performance has been demonstrated, and gold-standard methods from correlated wavefunction theory can predict experimental results to chemical accuracy. The combined promise of transition-metal chemical space exploration and lack of established tools has mandated a distinct approach. In this Account, we outline the path we charted in exploration of transition-metal chemical space starting from the first machine learning (ML) models (i.e., artificial neural network and kernel ridge regression) and representations for the prediction of open-shell transition-metal complex properties. The distinct importance of the immediate coordination environment of the metal center as well as the lack of low-level methods to accurately predict structural properties in this coordination environment first motivated and then benefited from these ML models and representations. Once developed, the recipe for prediction of geometric, spin state, and redox potential properties was straightforwardly extended to a diverse range of other properties, including in catalysis, computational “feasibility”, and the gas separation properties of periodic metal–organic frameworks. Interpretation of selected features most important for model prediction revealed new ways to encapsulate design rules and confirmed that models were robustly mapping essential structure–property relationships. Encountering the special challenge of ensuring that good model performance could generalize to new discovery targets motivated investigation of how to best carry out model uncertainty quantification. Distance-based approaches, whether in model latent space or in carefully engineered feature space, provided intuitive measures of the domain of applicability. With all of these pieces together, ML can be harnessed as an engine to tackle the large-scale exploration of transition-metal chemical space needed to satisfy multiple objectives using efficient global optimization methods. In practical terms, bringing these artificial intelligence tools to bear on the problems of transition-metal chemical space exploration has resulted in ML-model assessments of large, multimillion compound spaces in minutes and validated new design leads in weeks instead of decades.

Research Organization:
Univ. of Minnesota, Minneapolis, MN (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Basic Energy Sciences (BES); US Department of the Navy, Office of Naval Research (ONR); Defense Advanced Research Projects Agency (DARPA); National Science Foundation (NSF)
Grant/Contract Number:
SC0012702; SC0018096; N00014-17-1-2956; N00014-18-1-2434; N00014-20-1-2150; D18AP00039; ACI-1547580; CBET-1704266; CBET-1846426; 1122374
OSTI ID:
1781598
Alternate ID(s):
OSTI ID: 1777836
Journal Information:
Accounts of Chemical Research, Vol. 54, Issue 3; ISSN 0001-4842
Publisher:
American Chemical SocietyCopyright Statement
Country of Publication:
United States
Language:
English

References (66)

Understanding the diversity of the metal-organic framework ecosystem journal August 2020
Efficient search of compositional space for hybrid organic–inorganic perovskites via Bayesian optimization journal September 2018
ANI-1: an extensible neural network potential with DFT accuracy at force field computational cost journal January 2017
UFF, a full periodic table force field for molecular mechanics and molecular dynamics simulations journal December 1992
Accelerated Discovery of Large Electrostrains in BaTiO 3 -Based Piezoelectrics Using Active Learning journal January 2018
Molecular Mechanics and the Jahn-Teller Effect journal November 1994
Building ligand knowledge bases for organometallic chemistry: Computational description of phosphorus(III)-donor ligands and the metal–phosphorus bond journal March 2009
Ligand-field theory journal January 1957
Designing in the Face of Uncertainty: Exploiting Electronic Structure and Machine Learning Models for Discovery in Inorganic Chemistry journal March 2019
Density functional theory for transition metals and transition metal chemistry journal January 2009
Applying Bayesian Approach to Combinatorial Problem in Chemistry journal April 2017
ANI-1, A data set of 20 million calculated off-equilibrium conformations for organic molecules journal December 2017
Predicting electronic structure properties of transition metal complexes with neural networks journal January 2017
Statistical Improvement Criteria for Use in Multiobjective Design Optimization journal April 2006
Perspective: Treating electron over-delocalization with the DFT+U method journal June 2015
Search for Catalysts by Inverse Design: Artificial Intelligence, Mountain Climbers, and Alchemists journal October 2018
Leveraging Cheminformatics Strategies for Inorganic Discovery: Application to Redox Potential Design journal April 2017
Metal Ion Modeling Using Classical Mechanics journal January 2017
Towards quantifying the role of exact exchange in predictions of transition metal complex properties journal July 2015
Optimizing Open Iron Sites in Metal–Organic Frameworks for Ethane Oxidation: A First-Principles Study journal April 2017
The ligand field molecular mechanics model and the stereoelectronic effects of d and s electrons journal February 2001
Stabilization of the Dinitrogen Analogue, Phosphorus Nitride journal September 2020
Fast and Accurate Uncertainty Estimation in Chemical Machine Learning journal November 2018
Accurate Multiobjective Design in a Space of Millions of Transition Metal Complexes with Neural-Network-Driven Efficient Global Optimization journal March 2020
MOF-FF - A flexible first-principles derived force field for metal-organic frameworks journal March 2013
Accelerating Chemical Discovery with Machine Learning: Simulated Evolution of Spin Crossover Complexes with an Artificial Neural Network journal February 2018
Anthropogenic biases in chemical reaction data hinder exploratory inorganic synthesis journal September 2019
Stochastic Voyages into Uncharted Chemical Space Produce a Representative Library of All Possible Drug-Like Compounds journal May 2013
Materials Synthesis Insights from Scientific Literature via Text Extraction and Machine Learning journal October 2017
Computational Ligand Descriptors for Catalyst Design journal October 2018
Automatic Chemical Design Using a Data-Driven Continuous Representation of Molecules journal January 2018
Genetic algorithms in chemistry journal July 2007
Machine Learning Accelerates the Discovery of Design Rules and Exceptions in Stable Metal–Oxo Intermediate Formation journal July 2019
Improved Chemical Prediction from Scarce Data Sets via Latent Space Enrichment journal April 2019
Enumeration of de novo inorganic complexes for chemical discovery and machine learning journal January 2020
A quantitative uncertainty metric controls error in neural network-driven chemical discovery journal January 2019
Fast and Accurate Modeling of Molecular Atomization Energies with Machine Learning journal January 2012
Resolving Transition Metal Chemical Space: Feature Selection for Machine Learning and Structure–Property Relationships journal November 2017
Inverse quantum chemistry: Concepts and strategies for rational compound design journal April 2014
Quantum Machine Learning in Chemical Compound Space journal March 2018
An Evolutionary Algorithm for de Novo Optimization of Functional Transition Metal Compounds journal May 2012
Accelerated search for materials with targeted properties by adaptive design journal April 2016
An intermediate neglect of differential overlap technique for spectroscopy of transition-metal complexes. Ferrocene journal January 1980
Learning from Failure: Predicting Electronic Structure Calculation Outcomes with Machine Learning Models journal March 2019
The performance of nonhybrid density functionals for calculating the structures and spin states of Fe(II) and Fe(III) complexes journal November 2004
Two-State Reactivity as a New Concept in Organometallic Chemistry § journal March 2000
The angular overlap model, an attempt to revive the ligand field approaches journal January 1965
Phoenics: A Bayesian Optimizer for Chemistry journal August 2018
Strategies and Software for Machine Learning Accelerated Discovery in Transition Metal Chemistry journal September 2018
Enumeration of 166 Billion Organic Small Molecules in the Chemical Universe Database GDB-17 journal November 2012
Advances, Updates, and Analytics for the Computation-Ready, Experimental Metal–Organic Framework Database: CoRE MOF 2019 journal November 2019
Deep Confidence: A Computationally Efficient Framework for Calculating Reliable Prediction Errors for Deep Neural Networks journal October 2018
Comparison of density functionals for differences between the high- (T2g5) and low- (A1g1) spin states of iron(II) compounds. IV. Results for the ferrous complexes [Fe(L)(‘NHS4’)] journal June 2005
Automated in Silico Design of Homogeneous Catalysts journal January 2020
Seeing Is Believing: Experimental Spin States from Machine Learning Model Structure Predictions journal March 2020
Representation of compounds for machine-learning prediction of physical properties journal April 2017
A rechargeable redox battery utilizing ruthenium complexes with non-aqueous organic electrolyte journal November 1988
Recent advances in surrogate-based optimization journal January 2009
Artificial evolution of coumarin dyes for dye sensitized solar cells journal January 2015
Rapid Detection of Strong Correlation with Machine Learning for Transition-Metal Complex High-Throughput Screening journal August 2020
Molecular Mechanics for Coordination Complexes: The Impact of Adding d-Electron Stabilization Energies journal August 1995
Large-scale screening of hypothetical metal–organic frameworks journal November 2011
Computational Approach to Molecular Catalysis by 3d Transition Metals: Challenges and Opportunities journal October 2018
Making machine learning a useful tool in the accelerated discovery of transition metal complexes journal July 2019
Quantum chemistry structures and properties of 134 kilo molecules journal August 2014
Beyond Density Functional Theory: The Multiconfigurational Approach To Model Heterogeneous Catalysis journal August 2019

Similar Records

Strategies and Software for Machine Learning Accelerated Discovery in Transition Metal Chemistry
Journal Article · Mon Sep 24 00:00:00 EDT 2018 · Industrial and Engineering Chemistry Research · OSTI ID:1781598

Representations and strategies for transferable machine learning improve model performance in chemical discovery
Journal Article · Tue Feb 15 00:00:00 EST 2022 · Journal of Chemical Physics · OSTI ID:1781598

What's Left for a Computational Chemist To Do in the Age of Machine Learning?
Journal Article · Thu Apr 15 00:00:00 EDT 2021 · Israel Journal of Chemistry · OSTI ID:1781598