DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Analyzing machine learning models to accelerate generation of fundamental materials insights

Abstract

Abstract Machine learning for materials science envisions the acceleration of basic science research through automated identification of key data relationships to augment human interpretation and gain scientific understanding. A primary role of scientists is extraction of fundamental knowledge from data, and we demonstrate that this extraction can be accelerated using neural networks via analysis of the trained data model itself rather than its application as a prediction tool. Convolutional neural networks excel at modeling complex data relationships in multi-dimensional parameter spaces, such as that mapped by a combinatorial materials science experiment. Measuring a performance metric in a given materials space provides direct information about (locally) optimal materials but not the underlying materials science that gives rise to the variation in performance. By building a model that predicts performance (in this case photoelectrochemical power generation of a solar fuels photoanode) from materials parameters (in this case composition and Raman signal), subsequent analysis of gradients in the trained model reveals key data relationships that are not readily identified by human inspection or traditional statistical analyses. Human interpretation of these key relationships produces the desired fundamental understanding, demonstrating a framework in which machine learning accelerates data interpretation by leveraging the expertize of themore » human scientist. We also demonstrate the use of neural network gradient analysis to automate prediction of the directions in parameter space, such as the addition of specific alloying elements, that may increase performance by moving beyond the confines of existing data.« less

Authors:
ORCiD logo; ; ; ; ; ORCiD logo
Publication Date:
Research Org.:
California Institute of Technology (CalTech), Pasadena, CA (United States). Joint Center for Artificial Photosynthesis (JCAP)
Sponsoring Org.:
USDOE Office of Science (SC), Basic Energy Sciences (BES)
OSTI Identifier:
1619633
Alternate Identifier(s):
OSTI ID: 1575062
Grant/Contract Number:  
SC0004993
Resource Type:
Published Article
Journal Name:
npj Computational Materials
Additional Journal Information:
Journal Name: npj Computational Materials Journal Volume: 5 Journal Issue: 1; Journal ID: ISSN 2057-3960
Publisher:
Nature Publishing Group
Country of Publication:
United Kingdom
Language:
English
Subject:
36 MATERIALS SCIENCE

Citation Formats

Umehara, Mitsutaro, Stein, Helge S., Guevarra, Dan, Newhouse, Paul F., Boyd, David A., and Gregoire, John M. Analyzing machine learning models to accelerate generation of fundamental materials insights. United Kingdom: N. p., 2019. Web. doi:10.1038/s41524-019-0172-5.
Umehara, Mitsutaro, Stein, Helge S., Guevarra, Dan, Newhouse, Paul F., Boyd, David A., & Gregoire, John M. Analyzing machine learning models to accelerate generation of fundamental materials insights. United Kingdom. https://doi.org/10.1038/s41524-019-0172-5
Umehara, Mitsutaro, Stein, Helge S., Guevarra, Dan, Newhouse, Paul F., Boyd, David A., and Gregoire, John M. Fri . "Analyzing machine learning models to accelerate generation of fundamental materials insights". United Kingdom. https://doi.org/10.1038/s41524-019-0172-5.
@article{osti_1619633,
title = {Analyzing machine learning models to accelerate generation of fundamental materials insights},
author = {Umehara, Mitsutaro and Stein, Helge S. and Guevarra, Dan and Newhouse, Paul F. and Boyd, David A. and Gregoire, John M.},
abstractNote = {Abstract Machine learning for materials science envisions the acceleration of basic science research through automated identification of key data relationships to augment human interpretation and gain scientific understanding. A primary role of scientists is extraction of fundamental knowledge from data, and we demonstrate that this extraction can be accelerated using neural networks via analysis of the trained data model itself rather than its application as a prediction tool. Convolutional neural networks excel at modeling complex data relationships in multi-dimensional parameter spaces, such as that mapped by a combinatorial materials science experiment. Measuring a performance metric in a given materials space provides direct information about (locally) optimal materials but not the underlying materials science that gives rise to the variation in performance. By building a model that predicts performance (in this case photoelectrochemical power generation of a solar fuels photoanode) from materials parameters (in this case composition and Raman signal), subsequent analysis of gradients in the trained model reveals key data relationships that are not readily identified by human inspection or traditional statistical analyses. Human interpretation of these key relationships produces the desired fundamental understanding, demonstrating a framework in which machine learning accelerates data interpretation by leveraging the expertize of the human scientist. We also demonstrate the use of neural network gradient analysis to automate prediction of the directions in parameter space, such as the addition of specific alloying elements, that may increase performance by moving beyond the confines of existing data.},
doi = {10.1038/s41524-019-0172-5},
journal = {npj Computational Materials},
number = 1,
volume = 5,
place = {United Kingdom},
year = {Fri Mar 08 00:00:00 EST 2019},
month = {Fri Mar 08 00:00:00 EST 2019}
}

Journal Article:
Free Publicly Available Full Text
Publisher's Version of Record
https://doi.org/10.1038/s41524-019-0172-5

Citation Metrics:
Cited by: 46 works
Citation information provided by
Web of Science

Save / Share:

Works referenced in this record:

A simple constrained machine learning model for predicting high-pressure-hydrogen-compressor materials
journal, January 2018

  • Hattrick-Simpers, Jason R.; Choudhary, Kamal; Corgnale, Claudio
  • Molecular Systems Design & Engineering, Vol. 3, Issue 3
  • DOI: 10.1039/C8ME00005K

Learning atoms for materials discovery
journal, June 2018

  • Zhou, Quan; Tang, Peizhe; Liu, Shenxiu
  • Proceedings of the National Academy of Sciences, Vol. 115, Issue 28
  • DOI: 10.1073/pnas.1801181115

Effects of molybdenum substitution on the photocatalytic behavior of BiVO4
journal, January 2008

  • Yao, Weifeng; Iwai, Hideo; Ye, Jinhua
  • Dalton Transactions, Issue 11
  • DOI: 10.1039/b713338c

Systematic behaviour in trivalent lanthanide charge transfer energies
journal, November 2003


Transparent, explainable, and accountable AI for robotics
journal, May 2017


Machine learning modeling of superconducting critical temperature
journal, June 2018


High-throughput determination of structural phase diagram and constituent phases using GRENDEL
journal, October 2015


Accelerated search for materials with targeted properties by adaptive design
journal, April 2016

  • Xue, Dezhen; Balachandran, Prasanna V.; Hogden, John
  • Nature Communications, Vol. 7, Issue 1
  • DOI: 10.1038/ncomms11241

A Universal 3D Voxel Descriptor for Solid-State Material Informatics with Deep Convolutional Neural Networks
journal, December 2017


Combinatorial Materials Sciences: Experimental Strategies for Accelerated Knowledge Discovery
journal, August 2008


Perspective: Composition–structure–property mapping in high-throughput experiments: Turning data into knowledge
journal, May 2016

  • Hattrick-Simpers, Jason R.; Gregoire, John M.; Kusne, A. Gilad
  • APL Materials, Vol. 4, Issue 5
  • DOI: 10.1063/1.4950995

Solar fuels photoanode materials discovery by integrating high-throughput theory and experiment
journal, March 2017

  • Yan, Qimin; Yu, Jie; Suram, Santosh K.
  • Proceedings of the National Academy of Sciences, Vol. 114, Issue 12
  • DOI: 10.1073/pnas.1619940114

Building data-driven models with microstructural images: Generalization and interpretability
journal, December 2017


Mastering the game of Go with deep neural networks and tree search
journal, January 2016

  • Silver, David; Huang, Aja; Maddison, Chris J.
  • Nature, Vol. 529, Issue 7587
  • DOI: 10.1038/nature16961

Data Mined Ionic Substitutions for the Discovery of New Compounds
journal, January 2011

  • Hautier, Geoffroy; Fischer, Chris; Ehrlacher, Virginie
  • Inorganic Chemistry, Vol. 50, Issue 2
  • DOI: 10.1021/ic102031h

Unraveling compositional effects on the light-induced oxygen evolution in Bi(V–Mo–X)O 4 material libraries
journal, January 2017

  • Gutkowski, R.; Khare, C.; Conzuelo, F.
  • Energy & Environmental Science, Vol. 10, Issue 5
  • DOI: 10.1039/C7EE00287D

Structural, electronic and optical features of molybdenum-doped bismuth vanadium oxide
journal, March 2015


Learning surface molecular structures via machine vision
journal, August 2017

  • Ziatdinov, Maxim; Maksov, Artem; Kalinin, Sergei V.
  • npj Computational Materials, Vol. 3, Issue 1
  • DOI: 10.1038/s41524-017-0038-7

Vanadium(V) environments in bismuth vanadates: A structural investigation using Raman spectroscopy and solid state 51V NMR
journal, February 1991

  • Hardcastle, Franklin D.; Wachs, Israel E.; Eckert, Hellmut
  • Journal of Solid State Chemistry, Vol. 90, Issue 2
  • DOI: 10.1016/0022-4596(91)90135-5

Towards fully autonomous driving: Systems and algorithms
conference, June 2011

  • Levinson, Jesse; Askeland, Jake; Becker, Jan
  • 2011 IEEE Intelligent Vehicles Symposium (IV)
  • DOI: 10.1109/IVS.2011.5940562

Microstructure recognition using convolutional neural networks for prediction of ionic conductivity in ceramics
journal, December 2017


Low-level interpretability and high-level interpretability: a unified view of data-driven interpretable fuzzy system modelling
journal, December 2008


Transferability in Machine Learning for Electronic Structure via the Molecular Orbital Basis
journal, July 2018

  • Welborn, Matthew; Cheng, Lixue; Miller, Thomas F.
  • Journal of Chemical Theory and Computation, Vol. 14, Issue 9
  • DOI: 10.1021/acs.jctc.8b00636

The 2019 materials by design roadmap
journal, October 2018

  • Alberi, Kirstin; Nardelli, Marco Buongiorno; Zakutayev, Andriy
  • Journal of Physics D: Applied Physics, Vol. 52, Issue 1
  • DOI: 10.1088/1361-6463/aad926

Discovery of a useful thin-film dielectric using a composition-spread approach
journal, March 1998

  • van Dover, R. B.; Schneemeyer, L. F.; Fleming, R. M.
  • Nature, Vol. 392, Issue 6672
  • DOI: 10.1038/32381

Autonomy in materials research: a case study in carbon nanotube growth
journal, October 2016


Applications of high throughput (combinatorial) methodologies to electronic, magnetic, optical, and energy-related materials
journal, June 2013

  • Green, Martin L.; Takeuchi, Ichiro; Hattrick-Simpers, Jason R.
  • Journal of Applied Physics, Vol. 113, Issue 23
  • DOI: 10.1063/1.4803530

Automated Phase Mapping with AgileFD and its Application to Light Absorber Discovery in the V–Mn–Nb Oxide System
journal, December 2016


A general-purpose machine learning framework for predicting properties of inorganic materials
journal, August 2016


Multi-modal optimization of bismuth vanadate photoanodes via combinatorial alloying and hydrogen processing
journal, January 2019

  • Newhouse, P. F.; Guevarra, D.; Umehara, M.
  • Chemical Communications, Vol. 55, Issue 4
  • DOI: 10.1039/C8CC07156J

Deep Neural Networks for Acoustic Modeling in Speech Recognition: The Shared Views of Four Research Groups
journal, November 2012


Solving the quantum many-body problem with artificial neural networks
journal, February 2017


Transparency by Design: Closing the Gap Between Performance and Interpretability in Visual Reasoning
conference, June 2018

  • Mascharka, David; Tran, Philip; Soklaski, Ryan
  • 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
  • DOI: 10.1109/CVPR.2018.00519

Synthesis and characterisation of bismuth(III) vanadate
journal, June 2005


Combinatorial alloying improves bismuth vanadate photoanodes via reduced monoclinic distortion
journal, January 2018

  • Newhouse, P. F.; Guevarra, D.; Umehara, M.
  • Energy & Environmental Science, Vol. 11, Issue 9
  • DOI: 10.1039/C8EE00179K