DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Simultaneously improving accuracy and computational cost under parametric constraints in materials property prediction tasks

Journal Article · · Journal of Cheminformatics

Abstract Modern data mining techniques using machine learning (ML) and deep learning (DL) algorithms have been shown to excel in the regression-based task of materials property prediction using various materials representations. In an attempt to improve the predictive performance of the deep neural network model, researchers have tried to add more layers as well as develop new architectural components to create sophisticated and deep neural network models that can aid in the training process and improve the predictive ability of the final model. However, usually, these modifications require a lot of computational resources, thereby further increasing the already large model training time, which is often not feasible, thereby limiting usage for most researchers. In this paper, we study and propose a deep neural network framework for regression-based problems comprising of fully connected layers that can work with any numerical vector-based materials representations as model input. We present a novel deep regression neural network, iBRNet, with branched skip connections and multiple schedulers, which can reduce the number of parameters used to construct the model, improve the accuracy, and decrease the training time of the predictive model. We perform the model training using composition-based numerical vectors representing the elemental fractions of the respective materials and compare their performance against other traditional ML and several known DL architectures. Using multiple datasets with varying data sizes for training and testing, We show that the proposed iBRNet models outperform the state-of-the-art ML and DL models for all data sizes. We also show that the branched structure and usage of multiple schedulers lead to fewer parameters and faster model training time with better convergence than other neural networks. Scientific contribution: The combination of multiple callback functions in deep neural networks minimizes training time and maximizes accuracy in a controlled computational environment with parametric constraints for the task of materials property prediction.

Sponsoring Organization:
USDOE
OSTI ID:
2305758
Alternate ID(s):
OSTI ID: 2472008
Journal Information:
Journal of Cheminformatics, Journal Name: Journal of Cheminformatics Journal Issue: 1 Vol. 16; ISSN 1758-2946
Publisher:
Springer Science + Business MediaCopyright Statement
Country of Publication:
United Kingdom
Language:
English

References (40)

Theoretical prediction of high melting temperature for a Mo–Ru–Ta–W HCP multiprincipal element alloy journal January 2021
Materials Informatics: The Materials “Gene” and Big Data journal July 2015
A general-purpose machine learning framework for predicting properties of inorganic materials journal August 2016
Deep materials informatics: Applications of deep learning in materials science journal June 2019
Atomistic calculations and materials informatics: A review journal June 2017
Developing an improved crystal graph convolutional neural network framework for accelerated materials discovery journal June 2020
SchNet – A deep learning architecture for molecules and materials journal June 2018
Commentary: The Materials Project: A materials genome approach to accelerating materials innovation journal July 2013
Evolution of artificial intelligence for application in contemporary materials science journal August 2023
Unsupervised word embeddings capture latent knowledge from materials science literature journal July 2019
Importance of input data normalization for the application of neural networks to complex industrial problems journal June 1997
Machine Learning Energies of 2 Million Elpasolite ( A B C 2 D 6 ) Crystals journal September 2016
Graph Networks as a Universal Machine Learning Framework for Molecules and Crystals journal April 2019
AFLOWLIB.ORG: A distributed materials properties repository from high-throughput ab initio calculations journal June 2012
Predicting materials properties without crystal structure: deep representation learning from stoichiometry journal December 2020
Physics-based Data-Augmented Deep Learning for Enhanced Autogenous Shrinkage Prediction on Experimental Dataset conference August 2023
Demystifying Learning Rate Policies for High Accuracy Training of Deep Neural Networks conference December 2019
Machine learning of molecular electronic properties in chemical compound space journal September 2013
The high-throughput highway to computational materials design journal February 2013
Improving deep learning model performance under parametric constraints for materials informatics applications journal June 2023
The Open Quantum Materials Database (OQMD): assessing the accuracy of DFT formation energies journal December 2015
Machine learning in materials informatics: recent applications and prospects journal December 2017
Learning atoms for materials discovery journal June 2018
Matminer: An open source toolkit for materials data mining journal September 2018
Combinatorial screening for new materials in unconstrained composition space with machine learning journal March 2014
MPpredictor: An Artificial Intelligence-Driven Web Tool for Composition-Based Material Property Prediction journal March 2023
Perspective: Materials informatics and big data: Realization of the “fourth paradigm” of science in materials science journal April 2016
Mitigating bias in materials science data journal April 2023
Robust model benchmarking and bias-imbalance in data-driven materials science: a case study on MODNet journal July 2021
Pre-Activation based Representation Learning to Enhance Predictive Analytics on Small Materials Data conference June 2023
AI for Learning Deformation Behavior of a Material: Predicting Stress-Strain Curves 4000x Faster Than Simulations conference June 2023
An AI-driven microstructure optimization framework for elastic properties of titanium beyond cubic crystal systems journal June 2023
Compositionally restricted attention-based network for materials property predictions journal May 2021
Materials science with large-scale data and informatics: Unlocking new opportunities journal May 2016
Representation of compounds for machine-learning prediction of physical properties journal April 2017
Materials property prediction for limited datasets enabled by feature selection and joint learning with MODNet journal June 2021
Crystal Graph Convolutional Neural Networks for an Accurate and Interpretable Prediction of Material Properties journal April 2018
BRNet: Branched Residual Network for Fast and Accurate Predictive Modeling of Materials Properties book January 2022
Learning from the Harvard Clean Energy Project: The Use of Neural Networks to Accelerate Materials Discovery journal September 2015
Quantifying uncertainty in high-throughput density functional theory: A comparison of AFLOW, Materials Project, and OQMD journal May 2023