DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Density of states prediction for materials discovery via contrastive learning from probabilistic embeddings

Journal Article · · Nature Communications

Abstract Machine learning for materials discovery has largely focused on predicting an individual scalar rather than multiple related properties, where spectral properties are an important example. Fundamental spectral properties include the phonon density of states (phDOS) and the electronic density of states (eDOS), which individually or collectively are the origins of a breadth of materials observables and functions. Building upon the success of graph attention networks for encoding crystalline materials, we introduce a probabilistic embedding generator specifically tailored to the prediction of spectral properties. Coupled with supervised contrastive learning, our materials-to-spectrum (Mat2Spec) model outperforms state-of-the-art methods for predicting ab initio phDOS and eDOS for crystalline materials. We demonstrate Mat2Spec’s ability to identify eDOS gaps below the Fermi energy, validating predictions with ab initio calculations and thereby discovering candidate thermoelectrics and transparent conductors. Mat2Spec is an exemplar framework for predicting spectral properties of materials via strategically incorporated machine learning techniques.

Research Organization:
Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
Sponsoring Organization:
Toyota Research Institute; USDOE; USDOE Office of Science (SC), Basic Energy Sciences (BES)
Grant/Contract Number:
AC02-05CH11231; SC0020383
OSTI ID:
1845605
Journal Information:
Nature Communications, Journal Name: Nature Communications Journal Issue: 1 Vol. 13; ISSN 2041-1723
Publisher:
Nature Publishing GroupCopyright Statement
Country of Publication:
United Kingdom
Language:
English

References (47)

Direct Prediction of Phonon Density of States With Euclidean Neural Networks journal March 2021
Prediction of seebeck coefficient for compounds without restriction to fixed stoichiometry: A machine learning approach journal September 2017
Cost-sensitive label embedding for multi-label classification journal August 2017
Python Materials Genomics (pymatgen): A robust, open-source python library for materials analysis journal February 2013
Atomate: A high-level interface to generate, execute, and analyze computational materials science workflows journal November 2017
Matminer: An open source toolkit for materials data mining journal September 2018
Finding the needle in the haystack: Materials discovery and design through computational ab initio high-throughput screening journal June 2019
USPEX—Evolutionary crystal structure prediction journal December 2006
Spontaneous Non-stoichiometry and Ordering in Degenerate but Gapped Transparent Conductors journal July 2019
Deliberate Deficiencies: Expanding Electronic Function through Non-stoichiometry journal July 2019
Electronic Structure book January 2004
Graph Networks as a Universal Machine Learning Framework for Molecules and Crystals journal April 2019
Leveraging Transfer Learning and Chemical Principles toward Interpretable Materials Properties journal August 2021
An Efficient Deep Learning Scheme To Predict the Electronic Structure of Materials and Molecules: The Example of Graphene-Derived Allotropes journal November 2020
Predicting the Band Gaps of Inorganic Solids by Machine Learning journal March 2018
Generative Adversarial Networks for Crystal Structure Prediction journal July 2020
Computational predictions of energy materials using density functional theory journal January 2016
Universal fragment descriptors for predicting properties of inorganic crystals journal June 2017
A red metallic oxide photocatalyst journal April 2012
Metals amassing transparency journal December 2015
A general-purpose machine learning framework for predicting properties of inorganic materials journal August 2016
In situ click chemistry generation of cyclooxygenase-2 inhibitors journal February 2017
Predicting materials properties without crystal structure: deep representation learning from stoichiometry journal December 2020
Recent advances and applications of machine learning in solid-state materials science journal August 2019
Computational sustainability meets materials science journal July 2021
Machine learning for molecular and materials science journal July 2018
Unsupervised word embeddings capture latent knowledge from materials science literature journal July 2019
High-throughput density-functional perturbation theory phonons for inorganic materials journal May 2018
Progress and prospects for accelerating materials science with automated and autonomous workflows journal January 2019
Graph convolutional neural networks with global attention for improved materials property prediction journal January 2020
Gapped metals as thermoelectric materials revealed by high-throughput screening journal January 2020
Commentary: The Materials Project: A materials genome approach to accelerating materials innovation journal July 2013
Perspective: Web-based machine learning models for real-time screening of thermoelectric materials properties journal May 2016
Fulfilling the promise of the materials genome initiative with high-throughput experimental methodologies journal March 2017
Materials representation and transfer learning for multi-property prediction journal June 2021
Solar fuels photoanode materials discovery by integrating high-throughput theory and experiment journal March 2017
Learning the electronic density of states in condensed matter journal December 2020
Mott's formula for the thermopower and the Wiedemann-Franz law journal May 1980
Intrinsic Transparent Conductors without Doping journal October 2015
Crystal Graph Convolutional Neural Networks for an Accurate and Interpretable Prediction of Material Properties journal April 2018
Exploring Simple Siamese Representation Learning conference June 2021
Approximating the Kullback Leibler Divergence Between Gaussian Mixture Models
  • Hershey, John R.; Olsen, Peder A.
  • 2007 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07 https://doi.org/10.1109/ICASSP.2007.366913
conference April 2007
A Comprehensive Survey on Graph Neural Networks journal January 2021
Structure motif–centric learning framework for inorganic crystalline systems journal April 2021
Eigenvalue decomposition of spectral features in density of states curves journal August 2011
Disentangled Variational Autoencoder based Multi-Label Classification with Covariance-Aware Multivariate Probit Model
  • Bai, Junwen; Kong, Shufeng; Gomes, Carla
  • Twenty-Ninth International Joint Conference on Artificial Intelligence and Seventeenth Pacific Rim International Conference on Artificial Intelligence {IJCAI-PRICAI-20}, Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence https://doi.org/10.24963/ijcai.2020/595
conference July 2020
Deep Hurdle Networks for Zero-Inflated Multi-Target Regression: Application to Multiple Species Abundance Estimation
  • Kong, Shufeng; Bai, Junwen; Lee, Jae Hee
  • Twenty-Ninth International Joint Conference on Artificial Intelligence and Seventeenth Pacific Rim International Conference on Artificial Intelligence {IJCAI-PRICAI-20}, Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence https://doi.org/10.24963/ijcai.2020/603
conference July 2020