DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Cross-property deep transfer learning framework for enhanced predictive analytics on small materials data

Journal Article · · Nature Communications

Abstract Artificial intelligence (AI) and machine learning (ML) have been increasingly used in materials science to build predictive models and accelerate discovery. For selected properties, availability of large databases has also facilitated application of deep learning (DL) and transfer learning (TL). However, unavailability of large datasets for a majority of properties prohibits widespread application of DL/TL. We present a cross-property deep-transfer-learning framework that leverages models trained on large datasets to build models on small datasets of different properties. We test the proposed framework on 39 computational and two experimental datasets and find that the TL models with only elemental fractions as input outperform ML/DL models trained from scratch even when they are allowed to use physical attributes as input, for 27/39 (≈ 69%) computational and both the experimental datasets. We believe that the proposed framework can be widely useful to tackle the small data challenge in applying AI/ML in materials science.

Research Organization:
Northwestern University, Evanston, IL (United States)
Sponsoring Organization:
National Institute of Standards and Technology (NIST); USDOE; USDOE Office of Science (SC)
Grant/Contract Number:
SC0014330; SC0019358; SC0021399
OSTI ID:
1830279
Journal Information:
Nature Communications, Journal Name: Nature Communications Journal Issue: 1 Vol. 12; ISSN 2041-1723
Publisher:
Nature Publishing GroupCopyright Statement
Country of Publication:
United Kingdom
Language:
English

References (61)

Accelerated Materials Design of Lithium Superionic Conductors Based on First-Principles Calculations and Machine Learning Algorithms journal April 2013
Adaptive machine learning framework to accelerate ab initio molecular dynamics journal December 2014
Expanding Materials Selection Via Transfer Learning for High-Temperature Oxide Selection journal November 2020
Material structure-property linkages using three-dimensional convolutional neural networks journal March 2018
AFLOWLIB.ORG: A distributed materials properties repository from high-throughput ab initio calculations journal June 2012
Inverse design of composite metal oxide optical materials based on deep transfer learning and global optimization journal February 2021
Transfer learning for materials informatics using crystal graph convolutional neural network journal April 2021
Machine learning in materials science: From explainable predictions to autonomous design journal June 2021
Deep Convolutional Neural Networks with transfer learning for computer vision-based data-driven pavement distress detection journal December 2017
Atomistic calculations and materials informatics: A review journal June 2017
Computational Data-Driven Materials Discovery journal February 2021
Data-Driven Strategies for Accelerated Materials Design journal February 2021
Graph Networks as a Universal Machine Learning Framework for Molecules and Crystals journal April 2019
Generating Focused Molecule Libraries for Drug Discovery with Recurrent Neural Networks journal December 2017
Predicting Materials Properties with Little Data Using Shotgun Transfer Learning journal September 2019
Challenges for Density Functional Theory journal December 2011
Accelerated search for materials with targeted properties by adaptive design journal April 2016
The high-throughput highway to computational materials design journal February 2013
The Open Quantum Materials Database (OQMD): assessing the accuracy of DFT formation energies journal December 2015
A general-purpose machine learning framework for predicting properties of inorganic materials journal August 2016
Double-slit photoelectron interference in strong-field ionization of the neon dimer journal January 2019
Predicting materials properties without crystal structure: deep representation learning from stoichiometry journal December 2020
Machine learning in materials informatics: recent applications and prospects journal December 2017
Reliable and explainable machine-learning methods for accelerated material discovery journal November 2019
Machine learning enabled autonomous microstructural characterization in 3D samples journal January 2020
The joint automated repository for various integrated simulations (JARVIS) for data-driven materials design journal November 2020
Theoretical prediction of high melting temperature for a Mo–Ru–Ta–W HCP multiprincipal element alloy journal January 2021
A general and transferable deep learning framework for predicting phase formation in materials journal January 2021
Machine-learned potentials for next-generation matter simulations journal May 2021
Accelerating the discovery of materials for clean energy in the era of smart automation journal April 2018
Machine learning for molecular and materials science journal July 2018
Bayesian-Driven First-Principles Calculations for Accelerating Exploration of Fast Ion Conductors for Rechargeable Battery Application journal April 2018
Plasma Hsp90 levels in patients with systemic sclerosis and relation to lung and skin involvement: a cross-sectional and longitudinal study journal January 2021
A predictive machine learning approach for microstructure optimization and materials design journal June 2015
Holistic computational structure screening of more than 12 000 candidates for solid lithium-ion conductor materials journal January 2017
Commentary: The Materials Project: A materials genome approach to accelerating materials innovation journal July 2013
Perspective: Materials informatics and big data: Realization of the “fourth paradigm” of science in materials science journal April 2016
SchNet – A deep learning architecture for molecules and materials journal June 2018
Screening billions of candidates for solid lithium-ion conductors: A transfer learning approach for small data journal June 2019
Perspective on integrating machine learning into computational chemistry and materials science journal June 2021
Machine learning of molecular electronic properties in chemical compound space journal September 2013
Data mining for materials: Computational experiments with A B compounds journal March 2012
Machine learning with systematic density-functional theory calculations: Application to melting temperatures of single- and binary-component solids journal February 2014
Combinatorial screening for new materials in unconstrained composition space with machine learning journal March 2014
Prediction model of band gap for inorganic compounds by combination of density functional theory calculations and machine learning techniques journal March 2016
Representation of compounds for machine-learning prediction of physical properties journal April 2017
Big Data of Materials Science: Critical Role of the Descriptor journal March 2015
Prediction of Low-Thermal-Conductivity Compounds with First-Principles Anharmonic Lattice-Dynamics Calculations and Bayesian Optimization journal November 2015
Machine Learning Energies of 2 Million Elpasolite ( A B C 2 D 6 ) Crystals journal September 2016
Crystal Graph Convolutional Neural Networks for an Accurate and Interpretable Prediction of Material Properties journal April 2018
Knowledge-transfer-based cost-effective search for interface structures: A case study on fcc-Al [110] tilt grain boundary journal November 2018
Inverse molecular design using machine learning: Generative models for matter engineering journal July 2018
IRNet: A General Purpose Deep Residual Regression Framework for Materials Discovery
  • Jha, Dipendra; Ward, Logan; Yang, Zijiang
  • KDD '19: The 25th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining https://doi.org/10.1145/3292500.3330703
conference July 2019
Materials Informatics: The Materials “Gene” and Big Data journal July 2015
Opportunities and Challenges for Machine Learning in Materials Science journal July 2020
Handbook of Parametric and Nonparametric Statistical Procedures book August 2003
Deep materials informatics: Applications of deep learning in materials science journal June 2019
Materials science with large-scale data and informatics: Unlocking new opportunities journal May 2016
Hyperopt-Sklearn: Automatic Hyperparameter Configuration for Scikit-Learn conference January 2014
Transfer Learning book January 2010
Transfer Learning to Accelerate Interface Structure Searches journal December 2017