Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Representations of Materials for Machine Learning

Journal Article · · Annual Review of Materials Research
 [1];  [2];  [2];  [2];  [2];  [2];  [2]
  1. Massachusetts Inst. of Technology (MIT), Cambridge, MA (United States); OSTI
  2. Massachusetts Inst. of Technology (MIT), Cambridge, MA (United States)

High-throughput data generation methods and machine learning (ML) algorithms have given rise to a new era of computational materials science by learning the relations between composition, structure, and properties and by exploiting such relations for design. However, to build these connections, materials data must be translated into a numerical form, called a representation, that can be processed by an ML model. Data sets in materials science vary in format (ranging from images to spectra), size, and fidelity. Predictive models vary in scope and properties of interest. Here, we review context-dependent strategies for constructing representations that enable the use of materials as inputs or outputs for ML models. Furthermore, we discuss how modern ML techniques can learn representations from data and transfer chemical and physical information between tasks. Finally, we outline high-impact questions that have not been fully resolved and thus require further investigation.

Research Organization:
Massachusetts Inst. of Technology (MIT), Cambridge, MA (United States)
Sponsoring Organization:
USDOE Advanced Research Projects Agency - Energy (ARPA-E)
Grant/Contract Number:
AR0001220
OSTI ID:
2422344
Journal Information:
Annual Review of Materials Research, Journal Name: Annual Review of Materials Research Journal Issue: 1 Vol. 53; ISSN 1531-7331
Publisher:
Annual ReviewsCopyright Statement
Country of Publication:
United States
Language:
English

References (154)

Finding the Next Superhard Material through Ensemble Learning journal December 2020
Direct Prediction of Phonon Density of States With Euclidean Neural Networks journal March 2021
The Limited Predictive Power of the Pauling Rules journal March 2020
Fourier series of atomic radial distribution functions: A molecular fingerprint for machine learning models of quantum chemical properties
  • von Lilienfeld, O. Anatole; Ramakrishnan, Raghunathan; Rupp, Matthias
  • International Journal of Quantum Chemistry, Vol. 115, Issue 16 https://doi.org/10.1002/qua.24912
journal April 2015
Crystal structure representations for machine learning models of formation energies journal April 2015
Generative Models for Automatic Chemical Design book January 2020
Persistent-homology-based machine learning: a survey and a comparative study journal February 2022
Microstructure Generation via Generative Adversarial Network for Heterogeneous, Topologically Complex 3D Materials journal December 2020
Generalized cluster description of multicomponent systems journal November 1984
Quantifying and connecting atomic and crystallographic grain boundary structure using local environment representation and dimensionality reduction techniques journal December 2018
AFLOWLIB.ORG: A distributed materials properties repository from high-throughput ab initio calculations journal June 2012
Matminer: An open source toolkit for materials data mining journal September 2018
Transfer learning for materials informatics using crystal graph convolutional neural network journal April 2021
USPEX—Evolutionary crystal structure prediction journal December 2006
From the Sabatier principle to a predictive theory of transition-metal heterogeneous catalysis journal August 2015
Inverse Design of Solid-State Materials via a Continuous Representation journal November 2019
Crystal Site Feature Embedding Enables Exploration of Large Chemical Spaces journal August 2020
An invertible crystallographic representation for general inverse design of inorganic crystals with targeted properties journal January 2022
Genetic algorithm-guided deep learning of grain boundary diagrams: Addressing the challenge of five degrees of freedom journal September 2020
Accelerating materials discovery with Bayesian optimization and graph deep learning journal December 2021
A Universal Machine Learning Model for Elemental Grain Boundary Energies journal September 2022
High-Throughput Machine-Learning-Driven Synthesis of Full-Heusler Compounds journal October 2016
How Chemical Composition Alone Can Predict Vibrational Free Energies and Entropies of Solids journal July 2017
Graph Networks as a Universal Machine Learning Framework for Molecules and Crystals journal April 2019
Machine Learning Force Fields journal March 2021
Physics-Inspired Structural Representations for Molecules and Materials journal July 2021
Gaussian Process Regression for Materials and Molecules journal August 2021
3-D Inorganic Crystal Structure Generation and Property Prediction via Representation Learning journal August 2020
High-Throughput Screening Approach for Nanoporous Materials Genome Using Topological Data Analysis: Application to Zeolites journal June 2018
Descriptor-Based Approach for the Prediction of Cation Vacancy Formation Energies and Transition Levels journal October 2017
Predicting the Band Gaps of Inorganic Solids by Machine Learning journal March 2018
Convolutional Neural Network of Atomic Surface Structures To Predict Binding Energies for High-Throughput Screening of Catalysts journal July 2019
Generative Model for the Inverse Design of Metasurfaces journal September 2018
Computational Discovery of New 2D Materials Using Deep Learning Generative Models journal May 2021
Open Catalyst 2020 (OC20) Dataset and Community Challenges journal May 2021
Open Challenges in Developing Generalizable Large-Scale Machine-Learning Models for Catalyst Discovery journal July 2022
The Open Catalyst 2022 (OC22) Dataset and Challenges for Oxide Electrocatalysts journal February 2023
Generative Adversarial Networks for Crystal Structure Prediction journal July 2020
Automatic Chemical Design Using a Data-Driven Continuous Representation of Molecules journal January 2018
Predicting Materials Properties with Little Data Using Shotgun Transfer Learning journal September 2019
Machine Learning-Enabled Design of Point Defects in 2D Materials for Quantum and Neuromorphic Information Processing journal September 2020
The Principles Determining the Structure of Complex Ionic Crystals journal April 1929
Disentangling Structural Confusion through Machine Learning: Structure Prediction and Polymorphism of Equiatomic Ternary Phases ABC journal November 2017
Universal fragment descriptors for predicting properties of inorganic crystals journal June 2017
The Open Quantum Materials Database (OQMD): assessing the accuracy of DFT formation energies journal December 2015
A general-purpose machine learning framework for predicting properties of inorganic materials journal August 2016
Insightful classification of crystal structures using deep learning journal July 2018
Enhancing materials property prediction by leveraging computational and experimental data using deep transfer learning journal November 2019
Quantitative prediction of grain boundary thermal conductivities from local atomic environments journal April 2020
Predicting materials properties without crystal structure: deep representation learning from stoichiometry journal December 2020
Machine learned features from density of states for accurate adsorption energy prediction journal January 2021
Cross-property deep transfer learning framework for enhanced predictive analytics on small materials data journal November 2021
SpookyNet: Learning force fields with electronic degrees of freedom and nonlocal effects journal December 2021
Density of states prediction for materials discovery via contrastive learning from probabilistic embeddings journal February 2022
E(3)-equivariant graph neural networks for data-efficient and accurate interatomic potentials journal May 2022
Excited state non-adiabatic dynamics of large photoswitchable molecules using a chemically transferable machine learning potential journal June 2022
BIGDML—Towards accurate quantum machine learning force fields for materials journal June 2022
Adsorbate chemical environment-based machine learning framework for heterogeneous catalysis journal October 2022
Learning local equivariant representations for large-scale atomistic dynamics journal February 2023
Predicting defect behavior in B2 intermetallics by merging ab initio modeling and machine learning journal December 2016
Discovering the building blocks of atomic systems using machine learning: application to grain boundaries journal August 2017
Machine learning modeling of superconducting critical temperature journal June 2018
A machine learning approach to model solute grain boundary segregation journal November 2018
Machine-learned multi-system surrogate models for materials prediction journal April 2019
Generative adversarial networks (GAN) based efficient sampling of chemical composition space for inverse design of inorganic materials journal June 2020
A critical examination of compound stability predictions from machine-learned formation energies journal July 2020
Benchmarking materials property prediction methods: the Matbench test set and Automatminer reference algorithm journal September 2020
A general and transferable deep learning framework for predicting phase formation in materials journal January 2021
Topological representations of crystalline compounds for the machine-learning prediction of materials properties journal February 2021
Constrained crystals deep convolutional generative adversarial network for the inverse design of crystal structures journal May 2021
Compositionally restricted attention-based network for materials property predictions journal May 2021
Benchmarking graph neural networks for materials chemistry journal June 2021
Graph neural networks for an accurate and interpretable prediction of the properties of polycrystalline materials journal July 2021
AtomSets as a hierarchical transfer learning framework for small and large materials datasets journal October 2021
Atomistic Line Graph Neural Network for improved materials property predictions journal November 2021
Approaches for handling high-dimensional cluster expansions of ionic systems journal June 2022
Data-augmentation for graph neural network learning of the relaxed energies of unrelaxed structures journal September 2022
Graph similarity drives zeolite diffusionless transformations and intergrowth journal October 2019
Mechanism of collective interstitial ordering in Fe–C alloys journal May 2020
A data-science approach to predict the heat capacity of nanoporous materials journal October 2022
Theory-guided design of catalytic materials using scaling relationships and reactivity descriptors journal November 2019
Machine learning for alloys journal July 2021
Human- and machine-centred designs of molecules and materials for sustainability and decarbonization journal August 2022
Unsupervised word embeddings capture latent knowledge from materials science literature journal July 2019
ElemNet: Deep Learning the Chemistry of Materials From Only Elemental Composition journal December 2018
Machine learning with persistent homology and chemical word embeddings improves prediction accuracy and interpretability in metal-organic frameworks journal April 2021
Active learning across intermetallics to guide discovery of electrocatalysts for CO2 reduction and H2 evolution journal September 2018
Operando identification of site-dependent water oxidation activity on ruthenium dioxide single-crystal surfaces journal May 2020
Topological methods for data modelling journal November 2020
Inverse design of nanoporous crystalline reticular materials with deep generative models journal January 2021
A geometric-information-enhanced crystal graph network for predicting properties of materials journal September 2021
Graph neural networks for materials science and chemistry journal November 2022
Learning properties of ordered and disordered materials from multi-fidelity data journal January 2021
A universal graph deep learning interatomic potential for the periodic table journal November 2022
Comparing molecules and solids across structural and alchemical space journal January 2016
ANI-1: an extensible neural network potential with DFT accuracy at force field computational cost journal January 2017
Machine learning material properties from the periodic table using convolutional neural networks journal January 2018
Graph convolutional neural networks with global attention for improved materials property prediction journal January 2020
Machine-enabled inverse design of inorganic solid materials: promises and challenges journal January 2020
Multi-scale approach for the prediction of atomic scale properties journal January 2021
Data-driven machine learning model for the prediction of oxygen vacancy formation energy of metal oxide materials journal January 2021
Multi-fidelity prediction of molecular optical peaks with deep learning journal January 2022
A transferable machine-learning scheme from pure metals to alloys for predicting adsorption energies journal January 2022
Atom-centered symmetry functions for constructing high-dimensional neural network potentials journal February 2011
Commentary: The Materials Project: A materials genome approach to accelerating materials innovation journal July 2013
SchNet – A deep learning architecture for molecules and materials journal June 2018
Atom-density representations for machine learning journal April 2019
Screening billions of candidates for solid lithium-ion conductors: A transfer learning approach for small data journal June 2019
Improved accuracy and transferability of molecular-orbital-based machine learning: Organics, transition-metal complexes, non-covalent interactions, and transition states journal February 2021
Materials representation and transfer learning for multi-property prediction journal June 2021
Representations and strategies for transferable machine learning improve model performance in chemical discovery journal February 2022
Transfer learning using attentions across atomic systems with graph neural networks (TAAG) journal May 2022
Learning atoms for materials discovery journal June 2018
Machine learning determination of atomic dynamics at grain boundaries journal October 2018
CLEASE: a versatile and user-friendly implementation of cluster expansion method journal May 2019
VoroTop : Voronoi cell topology visualization and analysis toolkit journal December 2017
Learning physical descriptors for materials science by compressed sensing journal February 2017
Machine learning for the modeling of interfaces in energy storage and conversion materials journal July 2019
Unified representation of molecules and crystals for machine learning journal November 2022
Atomic structure generation from reconstructing structural fingerprints journal November 2022
Ab initiomolecular dynamics for liquid metals journal January 1993
Crystal structure prediction via particle-swarm optimization journal September 2010
On representing chemical environments journal May 2013
Combinatorial screening for new materials in unconstrained composition space with machine learning journal March 2014
How to represent crystal structures for machine learning: Towards fast prediction of electronic properties journal May 2014
Including crystal structure attributes in machine learning models of formation energies via Voronoi tessellations journal July 2017
Atomic cluster expansion for accurate and transferable interatomic potentials journal January 2019
Fast and Accurate Modeling of Molecular Atomization Energies with Machine Learning journal January 2012
Big Data of Materials Science: Critical Role of the Descriptor journal March 2015
Machine Learning Energies of 2 Million Elpasolite ( A B C 2 D 6 ) Crystals journal September 2016
Crystal Graph Convolutional Neural Networks for an Accurate and Interpretable Prediction of Material Properties journal April 2018
Generalized Neural-Network Representation of High-Dimensional Potential-Energy Surfaces journal April 2007
Achieving DFT accuracy with a machine-learning interatomic potential: Thermomechanics and defects in bcc ferromagnetic iron journal January 2018
SISSO: A compressed-sensing method for identifying the best low-dimensional descriptor in an immensity of offered candidates journal August 2018
Developing an improved crystal graph convolutional neural network framework for accelerated materials discovery journal June 2020
Orbital graph convolutional neural network for material property prediction journal September 2020
Finding symmetry breaking order parameters with Euclidean neural networks journal January 2021
Voronoi cell analysis: The shapes of particle systems journal June 2022
Prediction of interface structures and energies via virtual screening journal November 2016
Machine learning unifies the modeling of materials and molecules journal December 2017
New tolerance factor to predict the stability of perovskite oxides and halides journal February 2019
Inverse design of porous materials using artificial neural networks journal January 2020
Crystal graph attention networks for the prediction of stable materials journal December 2021
Finding optimal surface sites on heterogeneous catalysts by counting nearest neighbors journal October 2015
Perovskites in catalysis and electrocatalysis journal November 2017
What Is High-Throughput Virtual Screening? A Perspective from Organic Materials Discovery journal July 2015
CRC Handbook of Chemistry and Physics book June 2016
Machine learning modeling of superconducting critical temperature dataset January 2022
Examining graph neural networks for crystal structures: limitations and opportunities for capturing periodicity preprint October 2022
Density of States Prediction for Materials Discovery via Contrastive Learning from Probabilistic Embeddings dataset January 2022
Learning local equivariant representations for large-scale atomistic dynamics dataset January 2022
A data-science approach to predict the heat capacity of nanoporous materials dataset January 2022
Graph neural network modeling of vacancy formation enthalpy for materials discovery and its application in solar thermochemical water splitting posted_content May 2022
Deep Generative Models for Materials Discovery and Machine Learning-Accelerated Innovation journal March 2022

Similar Records

Reliable and explainable machine-learning methods for accelerated material discovery
Journal Article · Wed Nov 13 23:00:00 EST 2019 · npj Computational Materials · OSTI ID:1619662

Representations and strategies for transferable machine learning improve model performance in chemical discovery
Journal Article · Mon Feb 14 23:00:00 EST 2022 · Journal of Chemical Physics · OSTI ID:1979037