DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Data Analytics for Catalysis Predictions: Are We Ready Yet?

Journal Article · · ACS Catalysis

Catalysis informatics has received tremendous attention in recent years as a tool to design catalysts and discover unique descriptors that capture the relationships between chemical properties and catalytic performance. One of the stop-gaps in understanding catalytic effects, which is often ignored and limits the deployment of data science tools, relates to the lack of uniform data. The catalytic cleavage of C–X (X= H, C, N, and O) bonds is relevant to many fundamental catalytic processes. In this Perspective, we performed data analytics on four groups of C–X cleavage reactions that are common in production, upcycling, or reactive separation: the C–C cleavage in cyclopropyl alcohol, the C–H cleavage in hydroacylation reactions, the C–O cleavage in β-O-4 linkages, and the C–N cleavage in amides, using experimental data collected from the literature to understand their underlying correlations. Experimental variables of high impact are identified for each reaction by dimensionality reduction methods. We highlight the urgent need for experimental data sets that include full details on the reaction conditions, such as reagent concentration, reaction temperature, or time in machine-readable forms. We discuss the potential improvement of the data of these reactions and promising approaches such as autonomous experiments to fill the gaps in unbiased experimental data. Finally, we also address the early stage consideration of separation aspects in the experimental design of efficient catalytic systems for these fundamental examples of chemical reactivity.

Research Organization:
Pacific Northwest National Laboratory (PNNL), Richland, WA (United States)
Sponsoring Organization:
USDOE Office of Energy Efficiency and Renewable Energy (EERE), Office of Sustainable Transportation. Bioenergy Technologies Office (BETO); USDOE Laboratory Directed Research and Development (LDRD) Program
Grant/Contract Number:
AC05-76RL01830; AC05-00OR22725
OSTI ID:
2448368
Report Number(s):
PNNL-SA--192178
Journal Information:
ACS Catalysis, Journal Name: ACS Catalysis Journal Issue: 10 Vol. 14; ISSN 2155-5435
Publisher:
American Chemical Society (ACS)Copyright Statement
Country of Publication:
United States
Language:
English

References (143)

Simultaneous catalysis and product separation by cross-linked enzyme crystals journal January 2001
A Highly Active Catalyst System for Intermolecular Hydroacylation journal September 2000
Chelation‐Controlled Intermolecular Hydroacylation: Direct Addition of Alkyl Aldehydes to Functionalized Alkenes journal December 2003
A Second‐Generation Catalyst for Intermolecular Hydroacylation of Alkenes and Alkynes Using β‐S‐Substituted Aldehydes: The Role of a Hemilabile P‐O‐P Ligand journal November 2006
Biofuels and Biomass-To-Liquid Fuels in the Biorefinery: Catalytic Conversion of Lignocellulosic Biomass using Porous Materials journal October 2008
Non-Oxidative Vanadium-Catalyzed C-O Bond Cleavage: Application to Degradation of Lignin Model Compounds journal April 2010
Regioselective and Stereospecific Cross-Coupling of Primary Allylic Amines with Boronic Acids and Boronates through Palladium-Catalyzed CN Bond Cleavage journal February 2012
Beyond Directing Groups: Transition-Metal-Catalyzed CH Activation of Simple Arenes journal September 2012
Synthesis of Biaryls through Nickel‐Catalyzed Suzuki–Miyaura Coupling of Amides by Carbon–Nitrogen Bond Cleavage journal April 2016
Achieving Digital Catalysis: Strategies for Data Acquisition, Storage and Use journal May 2023
A Primer about Machine Learning in Catalysis – A Tutorial with Code journal May 2020
Design of an Accurate Machine Learning Algorithm to Predict the Binding Energies of Several Adsorbates on Multiple Sites of Metal Surfaces journal July 2020
Accurate prediction of binding energies for two‐dimensional catalytic materials using machine learning journal September 2020
Active Learning A Neural Network Model For Gold Clusters & Bulk From Sparse First Principles Training Data journal September 2020
Revisiting Machine Learning Predictions for Oxidative Coupling of Methane (OCM) based on Literature Data journal September 2020
Open Data in Catalysis: From Today's Big Picture to the Future of Small Data journal December 2020
A Unified Research Data Infrastructure for Catalysis Research – Challenges and Concepts journal March 2021
Lignin Depolymerization and Conversion A Review of Thermochemical Methods journal November 2010
Towards Quantitative Catalytic Lignin Depolymerization journal April 2011
Twisted Amides: From Obscurity to Broadly Useful Transition-Metal-Catalyzed Reactions by N−C Amide Bond Activation journal February 2017
Cleavage of CO Bonds in Lignin Model Compounds Catalyzed by Methyldioxorhenium in Homogeneous Phase journal January 2014
Tungsten Carbide: A Remarkably Efficient Catalyst for the Selective Cleavage of Lignin C−O Bonds journal October 2016
Resin adsorption application for product separation and catalyst recycling in coupled enzymatic catalysis to produce 1,3‐propanediol and dihydroxyacetone for repeated batch journal September 2013
C-C Bond Activation book January 2014
A theoretical study of the mechanism for peptide hydrolysis by thermolysin journal March 2002
Enzyme immobilization by adsorption: a review journal June 2014
Effective C–O Bond Cleavage of Lignin β-O-4 Model Compounds: A New RuHCl(CO)(PPh3)3/KOH Catalytic System journal May 2014
Principal components analysis (PCA) journal March 1993
Regioselective opening of substituted (cyclopropylmethyl)lithiums derived from cyclopropylmethyl iodides journal October 1998
Biocatalytic membrane reactors: applications and perspectives journal August 2000
Catalytic polymeric membranes: Preparation and application journal July 2006
From models to lignin: Transition metal catalysis for selective bond cleavage reactions journal January 2016
Molecular Basis of C–N Bond Cleavage by the Glycyl Radical Enzyme Choline Trimethylamine-Lyase journal October 2016
Artificial intelligence in reaction prediction and chemical synthesis journal June 2022
Autonomous chemical science and engineering enabled by self-driving laboratories journal June 2022
Selective cleavage of ether C-O bond in lignin-derived compounds over Ru system under different H-sources journal January 2021
Effect of functional groups on hydrogenolysis of lignin model compounds journal December 2016
Reaction prediction via atomistic simulation: from quantum mechanics to machine learning journal January 2021
Environmental assessment of enzyme use in industrial production – a literature review journal March 2013
Next-Generation Experimentation with Self-Driving Laboratories journal June 2019
Status and Challenges of Density Functional Theory journal April 2020
Autonomous Chemical Experiments: Challenges and Perspectives on Establishing a Self-Driving Lab journal August 2022
Metal-Catalyzed Carbon–Carbon Bond Cleavage of Unstrained Alcohols journal July 2020
Creating Stereocenters within Acyclic Systems by C–C Bond Cleavage of Cyclopropanes journal July 2020
Computational Methods in Heterogeneous Catalysis journal December 2020
Machine Learning for Chemical Reactions journal June 2021
Transition-Metal-Catalyzed Cleavage of C–N Single Bonds journal October 2015
Metal–Organic Cooperative Catalysis in C–H and C–C Bond Activation journal January 2017
Recent Methodologies That Exploit C–C Single-Bond Cleavage of Strained Ring Systems by Transition Metal Complexes journal January 2017
Catalytic Enantioselective Transformations Involving C–H Bond Cleavage by Transition-Metal Complexes journal February 2017
Formation and Cleavage of C–C Bonds by Enzymatic Oxidation–Reduction Reactions journal June 2018
Cleavage of Si–H, B–H, and C–H Bonds by Metal–Ligand Cooperation journal August 2019
σ-H–H, σ-C–H, and σ-Si–H Bond Activation Catalyzed by Metal Nanoparticles journal October 2019
Automated Chemical Reaction Extraction from Scientific Literature journal June 2021
Machine Learning of Reaction Properties via Learned Representations of the Condensed Graph of Reaction journal November 2021
ReactionDataExtractor 2.0: A Deep Learning Approach for Data Extraction from Chemical Reaction Schemes journal September 2023
ChemDataExtractor: A Toolkit for Automated Extraction of Chemical Information from the Scientific Literature journal October 2016
An Umpolung Strategy for the Synthesis of β-Aminoketones via Copper-Catalyzed Electrophilic Amination of Cyclopropanols journal April 2015
Sterically Controlled Pd-Catalyzed Chemoselective Ketone Synthesis via N–C Cleavage in Twisted Amides journal August 2015
Suzuki–Miyaura Cross-Coupling of N -Acylpyrroles and Pyrazoles: Planar, Electronically Activated Amides in Catalytic N–C Cleavage journal June 2017
Open Catalyst 2020 (OC20) Dataset and Community Challenges journal May 2021
Site- and Regioselective Silaborative C–C Cleavage of 1-Alkyl-2-Methylenecyclopropanes Using a Platinum Catalyst with a Sterically Demanding Silylboronic Ester journal April 2015
Toward Benchmarking in Catalysis Science: Best Practices, Challenges, and Opportunities journal March 2016
An Adventure in Sustainable Cross-Coupling of Phenols and Derivatives via Carbon–Oxygen Bond Cleavage journal December 2016
“Cut and Sew” Transformations via Transition-Metal-Catalyzed Carbon–Carbon Bond Activation journal January 2017
Extracting Knowledge from Data through Catalysis Informatics journal June 2018
Ru-Catalyzed Hydrogenolysis of Lignin: Base-Dependent Tunability of Monomeric Phenols and Mechanistic Study journal March 2019
Discovering New Chemistry with an Autonomous Robotic Platform Driven by a Reactivity-Seeking Neural Network journal November 2021
Prediction of Organic Reaction Outcomes Using Machine Learning journal April 2017
Molecular Transformer: A Model for Uncertainty-Calibrated Chemical Reaction Prediction journal August 2019
Chemical Recycling of Carbon Fiber Reinforced Epoxy Resin Composites via Selective Cleavage of the Carbon–Nitrogen Bond journal November 2015
Efficient and Mild Transfer Hydrogenolytic Cleavage of Aromatic Ether Bonds in Lignin-Derived Compounds over Ru/C journal January 2018
ReO x /AC-Catalyzed Cleavage of C–O Bonds in Lignin Model Compounds and Alkaline Lignins journal November 2018
Designing Catalysts for Functionalization of Unactivated C–H Bonds Based on the CH Activation Reaction journal January 2012
Chemical literature data extraction: The CLiDE Project journal May 1993
Algorithm for Reaction Classification journal October 2013
The Chemistry of Cyclopropanols journal July 2003
Polymeric Membranes in Catalytic Reactors journal August 2002
Recent Advances in Transition-Metal-Catalyzed Functionalization of Unstrained Carbon–Carbon Bonds journal July 2014
Cyclopropanol chemistry journal December 1974
Rhodium-Catalyzed C−C Bond Formation via Heteroatom-Directed C−H Bond Activation journal February 2010
Transition Metal Catalyzed Alkene and Alkyne Hydroacylation journal October 2009
Palladium-Catalyzed Ligand-Directed C−H Functionalization Reactions journal February 2010
C—H Bond Activation in Transition Metal Species from a Computational Perspective journal February 2010
The Catalytic Valorization of Lignin for the Production of Renewable Chemicals journal June 2010
Hydrolytic Cleavage of β-O-4 Ether Bonds of Lignin Model Compounds in an Ionic Liquid with Metal Chlorides journal January 2011
A Novel Chelation-Assisted Hydroesterification of Alkenes via Ruthenium Catalysis journal January 2002
Ruthenium-Catalyzed Carbon−Carbon Bond Formation via the Cleavage of an Unreactive Aryl Carbon−Nitrogen Bond in Aniline Derivatives with Organoboronates journal April 2007
C–N Bond Cleavage of Allylic Amines via Hydrogen Bond Activation with Alcohol Solvents in Pd-Catalyzed Allylic Alkylation of Carbonyl Compounds journal November 2011
Ruthenium Hydride-Catalyzed Addition of Aldehydes to Dienes Leading to β,γ-Unsaturated Ketones journal October 2008
Cleavage of C−N Bonds in Aniline Derivatives on a Ruthenium Center and Its Relevance to Catalytic C−C Bond Formation journal May 2009
Catalysis of a Flavoenzyme-Mediated Amide Hydrolysis journal April 2010
The Open Reaction Database journal November 2021
Reversible Twisting of Primary Amides via Ground State N–C(O) Destabilization: Highly Twisted Rotationally Inverted Acyclic Amides journal January 2018
Rhodium-Catalyzed Intermolecular Chelation Controlled Alkene and Alkyne Hydroacylation:  Synthetic Scope of β-S-Substituted Aldehyde Substrates journal June 2006
Double-Chelation-Assisted Rh-Catalyzed Intermolecular Hydroacylation journal March 2003
Chelation-Controlled Intermolecular Alkene and Alkyne Hydroacylation: The Utility of β-Thioacetal Aldehydes journal May 2005
Direct Intermolecular Hydroacylation of N,N-Dialkylacrylamides with Aldehydes Catalyzed by a Cationic Rhodium(I)/dppb Complex journal March 2007
First Intermolecular Hydroacylation of 1,3-Dienes with Aldehydes Catalyzed by Ruthenium journal April 1998
Dissecting the catalytic triad of a serine protease journal April 1988
Cloud labs: where robots do the research journal June 2022
Conversion of amides to esters by the nickel-catalysed activation of amide C–N bonds journal July 2015
C–H bond activation enables the rapid construction and late-stage diversification of functional molecules journal April 2013
Nickel-catalysed Suzuki–Miyaura coupling of amides journal November 2015
Stille coupling via C–N bond cleavage journal September 2016
Machine learning in chemical reaction space journal October 2020
Quantitative interpretation explains machine learning models for chemical reaction prediction and uncovers bias journal March 2021
Inferring experimental procedures from text-based representations of chemical reactions journal May 2021
Autonomous platforms for data-driven organic synthesis journal February 2022
Catalysis-Hub.org, an open electronic structure database for surface reactions journal May 2019
Discovery of novel chemical reactions by deep generative recurrent neural network journal February 2021
Mapping the space of chemical reactions using attention-based neural networks journal January 2021
Leveraging large language models for predictive chemistry journal February 2024
The rise of self-driving labs in chemical and materials sciences journal January 2023
Enzymatic functionalization of carbon–hydrogen bonds journal January 2011
Unprecedented organocatalytic reduction of lignin model compounds to phenols and primary alcohols using hydrosilanes journal January 2014
Homogeneous catalysis for the conversion of biomass and biomass-derived platform chemicals journal January 2014
Understanding the chemical transformations of lignin during ionic liquid pretreatment journal January 2014
Acylative Suzuki coupling of amides: acyl-nitrogen activation via synergy of independently modifiable activating groups journal January 2015
Iron-catalysed oxidative cleavage of lignin and β-O-4 lignin model compounds with peroxides in DMSO journal January 2015
Oxidative conversion of lignin and lignin model compounds catalyzed by CeO 2 -supported Pd nanoparticles journal January 2015
Cleavage of the lignin β-O-4 ether bond via a dehydroxylation–hydrogenation strategy over a NiMo sulfide catalyst journal January 2016
Recent advances in transition metal-catalysed hydroacylation of alkenes and alkynes journal January 2016
Deep learning for chemical reaction prediction journal January 2018
Synthesis of γ-keto sulfones by copper-catalyzed oxidative sulfonylation of tertiary cyclopropanols journal January 2017
Machine learning for predicting product distributions in catalytic regioselective reactions journal January 2018
“Found in Translation”: predicting outcomes of complex organic chemistry reactions using neural sequence-to-sequence models journal January 2018
Au–Pd alloy cooperates with covalent triazine frameworks for the catalytic oxidative cleavage of β-O-4 linkages journal January 2019
Prospects and challenges for autonomous catalyst discovery viewed from an experimental perspective journal January 2022
The design and optimization of heterogeneous catalysts using computational methods journal January 2024
Rhodium-catalysed hydroacylation or reductive aldol reactions: a ligand dependent switch of reactivity journal January 2008
Palladium-catalyzed cross-coupling of cyclopropanol-derived ketone homoenolates with aryl bromides journal January 2013
Review: The Materials Chemistry of Inorganic Catalysts journal January 2001
Discovery of a Novel Enzyme, Isonitrile Hydratase, Involved in Nitrogen-Carbon Triple Bond Cleavage journal April 2001
Prediction of chemical reaction yields using deep learning journal March 2021
Lignin Valorization: Improving Lignin Processing in the Biorefinery journal May 2014
Predicting reaction performance in C–N cross-coupling using machine learning journal February 2018
Polyethylene upcycling to long-chain alkylaromatics by tandem hydrogenolysis/aromatization journal October 2020
The central role of density functional theory in the AI age journal July 2023
A machine-learning tool to predict substrate-adaptive conditions for Pd-catalyzed C–N couplings journal September 2023
Selective Removal of Nitrogen from Quinoline and Petroleum by Pseudomonas ayucida IGTN9m journal February 2000
Automatic vs. manual curation of a multi-source chemical dictionary: the impact on text mining journal March 2010
Automated reaction database and reaction network analysis: extraction of reaction templates using cheminformatics journal March 2018