Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Machine Learning for the Discovery, Design, and Engineering of Materials

Journal Article · · Annual Review of Chemical and Biomolecular Engineering
 [1];  [1];  [2]
  1. Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA,, ,; Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA
  2. Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA,, ,

Machine learning (ML) has become a part of the fabric of high-throughput screening and computational discovery of materials. Despite its increasingly central role, challenges remain in fully realizing the promise of ML. This is especially true for the practical acceleration of the engineering of robust materials and the development of design strategies that surpass trial and error or high-throughput screening alone. Depending on the quantity being predicted and the experimental data available, ML can either outperform physics-based models, be used to accelerate such models, or be integrated with them to improve their performance. We cover recent advances in algorithms and in their application that are starting to make inroads toward ( a) the discovery of new materials through large-scale enumerative screening, ( b) the design of materials through identification of rules and principles that govern materials properties, and ( c) the engineering of practical materials by satisfying multiple objectives. We conclude with opportunities for further advancement to realize ML as a widespread tool for practical computational materials design.

Research Organization:
Univ. of Minnesota, Minneapolis, MN (United States); Massachusetts Inst. of Technology (MIT), Cambridge, MA (United States)
Sponsoring Organization:
USDOE Office of Science (SC)
DOE Contract Number:
SC0012702; SC0018096
OSTI ID:
1980843
Journal Information:
Annual Review of Chemical and Biomolecular Engineering, Vol. 13, Issue 1; ISSN 1947-5438
Publisher:
Annual Reviews
Country of Publication:
United States
Language:
English

References (142)

The high-throughput highway to computational materials design journal February 2013
Commentary: The Materials Project: A materials genome approach to accelerating materials innovation journal July 2013
Perspective: Materials informatics and big data: Realization of the “fourth paradigm” of science in materials science journal April 2016
Autonomous Molecular Design: Then and Now journal March 2019
Optimal Heck Cross-Coupling Catalysis: A Pseudo-Pharmaceutical Approach journal December 2003
Combinatorial Explosion in Homogeneous Catalysis: Screening 60,000 Cross-Coupling Reactions journal December 2004
Topological Mapping of Bidentate Ligands: A Fast Approach for Screening Homogeneous Catalysts journal December 2005
Machine-learning models for combinatorial catalyst discovery journal December 2004
Quantum chemistry structures and properties of 134 kilo molecules journal August 2014
ChemML : A machine learning and informatics program package for the analysis, mining, and modeling of chemical and materials data journal January 2020
Semi-supervised machine-learning classification of materials synthesis procedures journal July 2019
Virtual screening of inorganic materials synthesis parameters with deep learning journal December 2017
Materials Synthesis Insights from Scientific Literature via Text Extraction and Machine Learning journal October 2017
Opportunities and challenges of text mining in materials research journal March 2021
Capturing chemical intuition in synthesis of metal-organic frameworks journal February 2019
Using Machine Learning and Data Mining to Leverage Community Knowledge for the Engineering of Stable Metal–Organic Frameworks journal October 2021
Computational Discovery of Transition-metal Complexes: From High-throughput Screening to Machine Learning journal July 2021
ChemDataExtractor: A Toolkit for Automated Extraction of Chemical Information from the Scientific Literature journal October 2016
Automated Building of Organometallic Complexes from 3D Fragments journal July 2014
molSimplify: A toolkit for automating discovery in inorganic chemistry journal July 2016
Strategies and Software for Machine Learning Accelerated Discovery in Transition Metal Chemistry journal September 2018
Matminer: An open source toolkit for materials data mining journal September 2018
A general-purpose machine learning framework for predicting properties of inorganic materials journal August 2016
The Cambridge Structural Database
  • Groom, Colin R.; Bruno, Ian J.; Lightfoot, Matthew P.
  • Acta Crystallographica Section B Structural Science, Crystal Engineering and Materials, Vol. 72, Issue 2, p. 171-179 https://doi.org/10.1107/S2052520616003954
journal April 2016
Large-scale screening of hypothetical metal–organic frameworks journal November 2011
Topologically Guided, Automated Construction of Metal–Organic Frameworks and Their Evaluation for Energy-Related Applications journal September 2017
Advances, Updates, and Analytics for the Computation-Ready, Experimental Metal–Organic Framework Database: CoRE MOF 2019 journal November 2019
Materials Design and Discovery with High-Throughput Density Functional Theory: The Open Quantum Materials Database (OQMD) journal September 2013
tmQM Dataset—Quantum Geometries and Properties of 86k Transition Metal Complexes journal November 2020
Resolving Transition Metal Chemical Space: Feature Selection for Machine Learning and Structure–Property Relationships journal November 2017
Predicting electronic structure properties of transition metal complexes with neural networks journal January 2017
Combinatorial screening for new materials in unconstrained composition space with machine learning journal March 2014
Enumeration of 166 Billion Organic Small Molecules in the Chemical Universe Database GDB-17 journal November 2012
Ring Closure To Form Metal Chelates in 3D Fragment-Based de Novo Design journal August 2015
Accurate Multiobjective Design in a Space of Millions of Transition Metal Complexes with Neural-Network-Driven Efficient Global Optimization journal March 2020
A generalized method for constructing hypothetical nanoporous materials of any net topology from graph theory journal January 2016
Construction and Characterization of Structure Models of Crystalline Porous Polymers journal April 2014
Understanding the diversity of the metal-organic framework ecosystem journal August 2020
Computational Discovery of New Zeolite-Like Materials journal October 2009
Design and Assembly of Virtual Homogeneous Catalyst Libraries –Towardsin silico Catalyst Optimisation journal February 2006
Machine Learning Accelerates the Discovery of Design Rules and Exceptions in Stable Metal–Oxo Intermediate Formation journal July 2019
Rapid Detection of Strong Correlation with Machine Learning for Transition-Metal Complex High-Throughput Screening journal August 2020
Local Descriptors of Dynamic and Nondynamic Correlation journal May 2017
A quantitative uncertainty metric controls error in neural network-driven chemical discovery journal January 2019
Evaluating topologically diverse metal–organic frameworks for cryo-adsorbed hydrogen storage journal January 2016
In silico Design of Porous Polymer Networks: High-Throughput Screening for Methane Storage Materials journal March 2014
Learning from Failure: Predicting Electronic Structure Calculation Outcomes with Machine Learning Models journal March 2019
Design of efficient molecular organic light-emitting diodes by a high-throughput virtual screening and experimental approach journal August 2016
Data-driven design of metal–organic frameworks for wet flue gas CO2 capture journal December 2019
Designing in the Face of Uncertainty: Exploiting Electronic Structure and Machine Learning Models for Discovery in Inorganic Chemistry journal March 2019
Computational Ligand Descriptors for Catalyst Design journal October 2018
Statistical Modeling of a Ligand Knowledge Base journal November 2006
Conformation oftripod Metal Templates in CH3C(CH2PPh2)3MLn (n = 2, 3): Neural Networks in Conformational Analysis journal July 1996
Unsupervised machine learning in atomistic simulations, between predictions and understanding journal April 2019
Stochastic Voyages into Uncharted Chemical Space Produce a Representative Library of All Possible Drug-Like Compounds journal May 2013
3D-QSAR as a Tool for Understanding and Improving Single-Site Polymerization Catalysts. A Review journal June 2014
The Development of Multidimensional Analysis Tools for Asymmetric Catalysis and Beyond journal May 2016
Quantitative Structure–Selectivity Relationships in Enantioselective Catalysis: Past, Present, and Future journal December 2019
Predicting reaction performance in C–N cross-coupling using machine learning journal February 2018
Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization conference October 2017
Automatic Chemical Design Using a Data-Driven Continuous Representation of Molecules journal January 2018
Improved Chemical Prediction from Scarce Data Sets via Latent Space Enrichment journal April 2019
Robust Fuzzy Principal Component Analysis (FPCA). A Comparative Study Concerning Interaction of Carbon−Hydrogen Bonds with Molybdenum−Oxo Bonds journal November 2002
Performance of Metal-Catalyzed Hydrodebromination of Dibromomethane Analyzed by Descriptors Derived from Statistical Learning journal April 2020
UMAP: Uniform Manifold Approximation and Projection journal September 2018
Quantum-mechanical transition-state model combined with machine learning provides catalyst design features for selective Cr olefin oligomerization journal January 2020
Polymerization Activity Prediction of Zirconocene Single-Site Catalysts Using 3D Quantitative Structure–Activity Relationship Modeling journal February 2012
Machine learning dihydrogen activation in the chemical space surrounding Vaska's complex journal January 2020
Data-Driven Advancement of Homogeneous Nickel Catalyst Activity for Aryl Ether Cleavage journal May 2020
Machine learning meets volcano plots: computational discovery of cross-coupling catalysts journal January 2018
Search for Catalysts by Inverse Design: Artificial Intelligence, Mountain Climbers, and Alchemists journal October 2018
Applications of the Cambridge Structural Database to molecular inorganic chemistry journal May 2002
Phosphorus ligand exchange equilibriums on zerovalent nickel. Dominant role for steric effects journal May 1970
Steric effects of phosphorus ligands in organometallic chemistry and homogeneous catalysis journal June 1977
Towards the online computer-aided design of catalytic pockets journal September 2019
Prediction of higher-selectivity catalysts by computer-driven workflow and machine learning journal January 2019
Stereocartography:  A Computational Mapping Technique That Can Locate Regions of Maximum Stereoinduction around Chiral Catalysts journal November 2002
A Simple Approach for Predicting the Spin State of Homoleptic Fe(II) Tris-diimine Complexes journal April 2017
Machine learning guided design of single-molecule magnets for magnetocaloric applications journal June 2019
Seeing Is Believing: Experimental Spin States from Machine Learning Model Structure Predictions journal March 2020
Machine Learning Approach for Prediction of Reaction Yield with Simulated Catalyst Parameters journal March 2018
Interactive-quantum-chemical-descriptors enabling accurate prediction of an activation energy through machine learning journal August 2020
Prediction of catalytic activities of bis(imino)pyridine metal complexes by machine learning journal February 2020
Leveraging Cheminformatics Strategies for Inorganic Discovery: Application to Redox Potential Design journal April 2017
Development of a Computer-Guided Workflow for Catalyst Optimization. Descriptor Validation, Subset Selection, and Training Set Analysis journal June 2020
A Universal Machine Learning Algorithm for Large-Scale Screening of Materials journal February 2020
Crystal Graph Convolutional Neural Networks for an Accurate and Interpretable Prediction of Material Properties journal April 2018
Atomic Property Weighted Radial Distribution Functions Descriptors of Metal–Organic Frameworks for the Prediction of Gas Uptake Capacity journal July 2013
Predicting oxidation and spin states by high-dimensional neural networks: Applications to lithium manganese oxide spinels journal October 2020
Surfing Multiple Conformation-Property Landscapes via Machine Learning: Designing Single-Ion Magnetic Anisotropy journal February 2020
Machine Learning Using Combined Structural and Chemical Descriptors for Prediction of Methane Adsorption Performance of Metal Organic Frameworks (MOFs) journal September 2017
Data-Driven Approaches Can Overcome the Cost–Accuracy Trade-Off in Multireference Diagnostics journal June 2020
Fast and Accurate Modeling of Molecular Atomization Energies with Machine Learning journal January 2012
Chemically intuited, large-scale screening of MOFs by machine learning techniques journal October 2017
Role of Pore Chemistry and Topology in the CO 2 Capture Capabilities of MOFs: From Molecular Simulation to Machine Learning journal August 2018
Addressing Challenges of Identifying Geometrically Diverse Sets of Crystalline Porous Materials journal December 2011
Algorithms and tools for high-throughput geometry-based analysis of crystalline porous materials journal February 2012
Machine learning to tame divergent density functional approximations: a new path to consensus materials design principles journal January 2021
Optimizing Open Iron Sites in Metal–Organic Frameworks for Ethane Oxidation: A First-Principles Study journal April 2017
Anthropogenic biases in chemical reaction data hinder exploratory inorganic synthesis journal September 2019
Machine-learning-assisted materials discovery using failed experiments journal May 2016
Optimization of Catalyst for Methanol Synthesis by a Combinatorial Approach Using a Parallel Activity Test and Genetic Algorithm Assisted by a Neural Network journal July 2003
Toward computational screening in heterogeneous catalysis: Pareto-optimal methanation catalysts journal April 2006
Optimisation of olefin epoxidation catalysts with the application of high-throughput and genetic algorithms assisted by artificial neural networks (softcomputing techniques) journal January 2005
Thermochromic sensor design based on Fe(II) spin crossover/polymers hybrid materials and artificial neural networks as a tool in modelling journal March 2015
Intelligent Selection of Metal–Organic Framework Arrays for Methane Sensing via Genetic Algorithms journal May 2019
Unsupervised word embeddings capture latent knowledge from materials science literature journal July 2019
Machine learning meets continuous flow chemistry: Automated optimization towards the Pareto front of multiple objectives journal November 2018
Prediction of water stability of metal–organic frameworks using machine learning journal November 2020
Evolution of Catalysts Directed by Genetic Algorithms in a Plug-Based Microfluidic Device Tested with Oxidation of Methane by Oxygen journal February 2010
Optimization of Cu oxide catalysts for methanol synthesis by combinatorial tools using 96 well microplates, artificial neural network and genetic algorithm journal June 2004
In silico discovery of metal-organic frameworks for precombustion CO 2 capture using a genetic algorithm journal October 2016
Accelerating Chemical Discovery with Machine Learning: Simulated Evolution of Spin Crossover Complexes with an Artificial Neural Network journal February 2018
Genetic algorithms for computational materials discovery accelerated by machine learning journal April 2019
Genetic algorithms in chemometrics and chemistry: a review journal January 2001
Discovery and Optimization of Materials Using Evolutionary Approaches journal May 2016
Materials design by evolutionary optimization of functional groups in metal-organic frameworks journal November 2016
Computational Screening of Trillions of Metal–Organic Frameworks for High-Performance Methane Storage journal May 2021
DENOPTIM: Software for Computational de Novo Design of Organic and Inorganic Molecules journal September 2019
Comparison of Single- And Multiobjective Design of Experiment in Combinatorial Chemistry for the Selective Dehydrogenation of Propane journal September 2009
Insights into Multi-Objective Design of Metal–Organic Frameworks journal August 2013
Multi-objective Optimization for Materials Discovery via Adaptive Design journal February 2018
Accelerated Discovery of Large Electrostrains in BaTiO 3 -Based Piezoelectrics Using Active Learning journal January 2018
Assessing the frontier: Active learning, model accuracy, and multi-objective candidate discovery and optimization journal July 2020
Recent advances in surrogate-based optimization journal January 2009
Application of a genetic algorithm and a neural network for the discovery and optimization of new solid catalytic materials journal February 2004
Artificial neural network-aided design of a multi-component catalyst for methane oxidative coupling journal October 2001
Catalyst design for methane oxidative coupling by using artificial neural network and hybrid genetic algorithm journal January 2003
An Evolutionary Algorithm for de Novo Optimization of Functional Transition Metal Compounds journal May 2012
Supervised machine learning for prediction of zirconocene-catalyzed α-olefin polymerization journal December 2019
Machine learning and molecular descriptors enable rational solvent selection in asymmetric catalysis journal January 2019
Identifying high-performance catalytic conditions for carbon dioxide reduction to dimethoxymethane by multivariate modelling journal January 2019
Predicting New TiO 2 Phases with Low Band Gaps by a Multiobjective Global Optimization Approach journal January 2014
Efficient search of compositional space for hybrid organic–inorganic perovskites via Bayesian optimization journal September 2018
Text Mining Metal–Organic Framework Papers journal January 2018
Design of Electroceramic Materials Using Artificial Neural Networks and Multiobjective Evolutionary Algorithms journal January 2008
Adding Stochastic Negative Examples into Machine Learning Improves Molecular Bioactivity Prediction journal November 2020
Inverse design of porous materials using artificial neural networks journal January 2020
Inverse design of nanoporous crystalline reticular materials with deep generative models journal January 2021
Generative adversarial networks (GAN) based efficient sampling of chemical composition space for inverse design of inorganic materials journal June 2020
Discovering Relationships between OSDAs and Zeolites through Data Mining and Generative Neural Networks journal April 2021
Big Data Meets Quantum Chemistry Approximations: The Δ-Machine Learning Approach journal April 2015