Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Enhancing materials property prediction by leveraging computational and experimental data using deep transfer learning

Journal Article · · Nature Communications
Abstract

The current predictive modeling techniques applied to Density Functional Theory (DFT) computations have helped accelerate the process of materials discovery by providing significantly faster methods to scan materials candidates, thereby reducing the search space for future DFT computations and experiments. However, in addition to prediction error against DFT-computed properties, such predictive models also inherit the DFT-computation discrepancies against experimentally measured properties. To address this challenge, we demonstrate that using deep transfer learning, existing large DFT-computational data sets (such as the Open Quantum Materials Database (OQMD)) can be leveraged together with other smaller DFT-computed data sets as well as available experimental observations to build robust prediction models. We build a highly accurate model for predicting formation energy of materials from their compositions; using an experimental data set of $$1,643$$ 1 , 643 observations, the proposed approach yields a mean absolute error (MAE) of $$0.07$$ 0.07 eV/atom, which is significantly better than existing machine learning (ML) prediction modeling based on DFT computations and is comparable to the MAE of DFT-computation itself.

Research Organization:
National Inst. of Standards and Technology (NIST), Gaithersburg, MD (United States); Northwestern Univ., Evanston, IL (United States)
Sponsoring Organization:
Dept. of Commerce (United States); USDOE; USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR) (SC-21)
Grant/Contract Number:
SC0014330; SC0019358
OSTI ID:
1619594
Alternate ID(s):
OSTI ID: 1624217
Journal Information:
Nature Communications, Journal Name: Nature Communications Journal Issue: 1 Vol. 10; ISSN 2041-1723
Publisher:
Nature Publishing GroupCopyright Statement
Country of Publication:
United Kingdom
Language:
English

References (49)

Data-Driven Model for Estimation of Friction Coefficient Via Informatics Methods journal May 2012
Materials Design and Discovery with High-Throughput Density Functional Theory: The Open Quantum Materials Database (OQMD) journal September 2013
Recent progress in metallurgical thermochemistry journal January 1969
Understanding NOx formation in nonpremixed flames: Experiments and modeling journal January 1995
A high-throughput infrastructure for density functional theory calculations journal June 2011
AFLOWLIB.ORG: A distributed materials properties repository from high-throughput ab initio calculations journal June 2012
Matminer: An open source toolkit for materials data mining journal September 2018
An introduction to ROC analysis journal June 2006
Extracting Grain Orientations from EBSD Patterns of Polycrystalline Materials Using Convolutional Neural Networks journal October 2018
From Organized High-Throughput Data to Phenomenological Theory using Machine Learning: The Example of Dielectric Breakdown journal February 2016
High-Throughput Machine-Learning-Driven Synthesis of Full-Heusler Compounds journal October 2016
Predicting the Thermodynamic Stability of Solids Combining Density Functional Theory and Machine Learning journal May 2017
How Chemical Composition Alone Can Predict Vibrational Free Energies and Entropies of Solids journal July 2017
Rapid and Accurate Machine Learning Recognition of High Performing Metal Organic Frameworks for CO 2 Capture journal August 2014
Deep learning journal May 2015
Machine-learning-assisted materials discovery using failed experiments journal May 2016
Universal fragment descriptors for predicting properties of inorganic crystals journal June 2017
The Open Quantum Materials Database (OQMD): assessing the accuracy of DFT formation energies journal December 2015
A general-purpose machine learning framework for predicting properties of inorganic materials journal August 2016
Computational screening of high-performance optoelectronic materials using OptB88vdW and TB-mBJ formalisms journal May 2018
Accelerating materials property predictions using machine learning journal September 2013
MoleculeNet: a benchmark for molecular machine learning journal January 2018
Optimizing transition states via kernel-based machine learning journal May 2012
Commentary: The Materials Project: A materials genome approach to accelerating materials innovation journal July 2013
Perspective: Materials informatics and big data: Realization of the “fourth paradigm” of science in materials science journal April 2016
SchNet – A deep learning architecture for molecules and materials journal June 2018
Machine learning of molecular electronic properties in chemical compound space journal September 2013
Comparison of theory with quenching experiments for the entropy and enthalpy of vacancy formation in Si and Ge journal October 1976
Properties of intrinsic point defects in silicon determined by zinc diffusion experiments under nonequilibrium conditions journal December 1995
Formation enthalpies by mixing GGA and GGA + U calculations journal July 2011
Combinatorial screening for new materials in unconstrained composition space with machine learning journal March 2014
Predicting density functional theory total energies and enthalpies of formation of metal-nonmetal compounds by linear regression journal February 2016
Prediction model of band gap for inorganic compounds by combination of density functional theory calculations and machine learning techniques journal March 2016
Representation of compounds for machine-learning prediction of physical properties journal April 2017
Including crystal structure attributes in machine learning models of formation energies via Voronoi tessellations journal July 2017
Elastic properties of bulk and low-dimensional materials using van der Waals density functional journal July 2018
Machine Learning Energies of 2 Million Elpasolite ( A B C 2 D 6 ) Crystals journal September 2016
Diffusion of Ge below the Si(100) Surface: Theory and Experiment journal March 2000
Matrix- and tensor-based recommender systems for the discovery of currently unknown inorganic compounds journal January 2018
Machine learning with force-field-inspired descriptors for materials: Fast screening and mapping energy landscape journal August 2018
Nobel Lecture: Electronic structure of matter—wave functions and density functionals journal October 1999
A Survey on Transfer Learning journal October 2010
Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning journal May 2016
IRNet: A General Purpose Deep Residual Regression Framework for Materials Discovery
  • Jha, Dipendra; Ward, Logan; Yang, Zijiang
  • KDD '19: The 25th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining https://doi.org/10.1145/3292500.3330703
conference July 2019
Exploration of data science techniques to predict fatigue strength of steel from composition and processing parameters journal April 2014
Phase Diagrams of the Elements book December 1991
Deep materials informatics: Applications of deep learning in materials science journal June 2019
Toward Computational Materials Design: The Impact of Density Functional Theory on Materials Research journal September 2006
Dropout from Higher Education: A Theoretical Synthesis of Recent Research journal March 1975