skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: A Statistical Learning Framework for Materials Science: Application to Elastic Moduli of k-nary Inorganic Polycrystalline Compounds

Abstract

Materials scientists increasingly employ machine or statistical learning (SL) techniques to accelerate materials discovery and design. Such pursuits benefit from pooling training data across, and thus being able to generalize predictions over, k-nary compounds of diverse chemistries and structures. This work presents a SL framework that addresses challenges in materials science applications, where datasets are diverse but of modest size, and extreme values are often of interest. Our advances include the application of power or Hölder means to construct descriptors that generalize over chemistry and crystal structure, and the incorporation of multivariate local regression within a gradient boosting framework. The approach is demonstrated by developing SL models to predict bulk and shear moduli (K and G, respectively) for polycrystalline inorganic compounds, using 1,940 compounds from a growing database of calculated elastic moduli for metals, semiconductors and insulators. The usefulness of the models is illustrated by screening for superhard materials.

Authors:
 [1];  [2];  [3];  [2];  [4];  [2];  [4];  [3]
  1. Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Dept. of Materials Science and Engineering
  2. Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Energy Technologies Area
  3. Univ. of California, San Diego, CA (United States). Computational and Applied Statistics Laboratory
  4. Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Materials Sciences Division
Publication Date:
Research Org.:
Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
Sponsoring Org.:
USDOE Office of Science (SC), Basic Energy Sciences (BES)
OSTI Identifier:
1377526
Grant/Contract Number:  
AC02-05CH11231; EDCBEE
Resource Type:
Journal Article: Accepted Manuscript
Journal Name:
Scientific Reports
Additional Journal Information:
Journal Volume: 6; Journal Issue: 1; Journal ID: ISSN 2045-2322
Publisher:
Nature Publishing Group
Country of Publication:
United States
Language:
English
Subject:
36 MATERIALS SCIENCE

Citation Formats

de Jong, Maarten, Chen, Wei, Notestine, Randy, Persson, Kristin, Ceder, Gerbrand, Jain, Anubhav, Asta, Mark, and Gamst, Anthony. A Statistical Learning Framework for Materials Science: Application to Elastic Moduli of k-nary Inorganic Polycrystalline Compounds. United States: N. p., 2016. Web. doi:10.1038/srep34256.
de Jong, Maarten, Chen, Wei, Notestine, Randy, Persson, Kristin, Ceder, Gerbrand, Jain, Anubhav, Asta, Mark, & Gamst, Anthony. A Statistical Learning Framework for Materials Science: Application to Elastic Moduli of k-nary Inorganic Polycrystalline Compounds. United States. https://doi.org/10.1038/srep34256
de Jong, Maarten, Chen, Wei, Notestine, Randy, Persson, Kristin, Ceder, Gerbrand, Jain, Anubhav, Asta, Mark, and Gamst, Anthony. 2016. "A Statistical Learning Framework for Materials Science: Application to Elastic Moduli of k-nary Inorganic Polycrystalline Compounds". United States. https://doi.org/10.1038/srep34256. https://www.osti.gov/servlets/purl/1377526.
@article{osti_1377526,
title = {A Statistical Learning Framework for Materials Science: Application to Elastic Moduli of k-nary Inorganic Polycrystalline Compounds},
author = {de Jong, Maarten and Chen, Wei and Notestine, Randy and Persson, Kristin and Ceder, Gerbrand and Jain, Anubhav and Asta, Mark and Gamst, Anthony},
abstractNote = {Materials scientists increasingly employ machine or statistical learning (SL) techniques to accelerate materials discovery and design. Such pursuits benefit from pooling training data across, and thus being able to generalize predictions over, k-nary compounds of diverse chemistries and structures. This work presents a SL framework that addresses challenges in materials science applications, where datasets are diverse but of modest size, and extreme values are often of interest. Our advances include the application of power or Hölder means to construct descriptors that generalize over chemistry and crystal structure, and the incorporation of multivariate local regression within a gradient boosting framework. The approach is demonstrated by developing SL models to predict bulk and shear moduli (K and G, respectively) for polycrystalline inorganic compounds, using 1,940 compounds from a growing database of calculated elastic moduli for metals, semiconductors and insulators. The usefulness of the models is illustrated by screening for superhard materials.},
doi = {10.1038/srep34256},
url = {https://www.osti.gov/biblio/1377526}, journal = {Scientific Reports},
issn = {2045-2322},
number = 1,
volume = 6,
place = {United States},
year = {Mon Oct 03 00:00:00 EDT 2016},
month = {Mon Oct 03 00:00:00 EDT 2016}
}

Journal Article:
Free Publicly Available Full Text
Publisher's Version of Record

Citation Metrics:
Cited by: 157 works
Citation information provided by
Web of Science

Save / Share:

Works referenced in this record:

Poisson's ratio and modern materials
journal, October 2011


Consistent Nonparametric Regression
journal, July 1977


Crystal structure representations for machine learning models of formation energies
text, January 2015


Computational predictions of energy materials using density functional theory
journal, January 2016


The bulk modulus-volume relationship for oxide compounds and related geophysical problems
journal, August 1965


The search for novel, superhard materials
journal, September 1999


Dissolving the Periodic Table in Cubic Zirconia: Data Mining to Discover Chemical Trends
journal, March 2014


AiiDA: Automated Interactive Infrastructure and Database for Computational Science
text, January 2015


A database to enable discovery and design of piezoelectric materials
journal, September 2015


The Elastic Behaviour of a Crystalline Aggregate
journal, May 1952


The high-throughput highway to computational materials design
journal, February 2013


Lower limit to the thermal conductivity of disordered crystals
journal, September 1992


Predicting Crystal Structures with Data Mining of Quantum Calculations
journal, September 2003


AFLOWLIB.ORG: A distributed materials properties repository from high-throughput ab initio calculations
journal, June 2012


Property prediction of crystalline solids from composition and crystal structure
journal, April 2016


Big Data of Materials Science - Critical Role of the Descriptor
text, January 2014


machine.
journal, October 2001


Fast and Accurate Modeling of Molecular Atomization Energies with Machine Learning
journal, January 2012


Synthesis, Crystal Structure, and Properties of Two Modifications of MgB12C2
journal, April 2007


Brief report: The bulk modulus-volume relationship for oxides
journal, June 1970


XCII. Relations between the elastic moduli and the plastic properties of polycrystalline pure metals
journal, August 1954


Extra-electron induced covalent strengthening and generalization of intrinsic ductile-to-brittle criterion
journal, October 2012


Effective ionic charge and bulk modulus scaling in rocksalt-structured rare-earth compounds
journal, September 1982


Crystal Structure Representations for Machine Learning Models of Formation Energies
preprint, January 2015


Quantum chemistry structures and properties of 134 kilo molecules
text, January 2014


Big Data of Materials Science: Critical Role of the Descriptor
journal, March 2015


Synthesis and crystal structure of Mg2B24C, a new boron-rich boride related to “tetragonal boron I”
journal, July 2006


A family of ductile intermetallic compounds
journal, August 2003


Charting the complete elastic properties of inorganic crystalline compounds
dataset, January 2022


Error Estimates for Solid-State Density-Functional Theory Predictions: An Overview by Means of the Ground-State Elemental Crystals
journal, October 2013


Materials Cartography: Representing and Mining Materials Space Using Structural and Electronic Fingerprints
journal, January 2015


Factors associated with sufficient knowledge of antibiotics and antimicrobial resistance in the Japanese general population
journal, February 2020


Predicting crystal structure by merging data mining with quantum mechanics
journal, July 2006


Python Materials Genomics (pymatgen): A robust, open-source python library for materials analysis
journal, February 2013


High-throughput and data mining with ab initio methods
journal, December 2004


Finding Density Functionals with Machine Learning
journal, June 2012


Least angle regression
journal, April 2004


Charting the complete elastic properties of inorganic crystalline compounds
journal, March 2015


Complex thermoelectric materials
journal, February 2008


Understanding the hcp anisotropy in Cd and Zn: the role of electron correlation in determining the potential energy surface
journal, January 2010


Fast and Accurate Modeling of Molecular Atomization Energies with Machine Learning
text, January 2011


New compound in the system Sc–Cr–B
journal, February 2004


AFLOW: An automatic framework for high-throughput materials discovery
text, January 2013


AiiDA: automated interactive infrastructure and database for computational science
journal, January 2016


The high-throughput highway to computational materials design
journal, February 2013


Adaptive Strategies for Materials Design using Uncertainties
journal, January 2016


Molecular Dynamics with On-the-Fly Machine Learning of Quantum-Mechanical Forces
journal, March 2015


Materials selection guidelines for low thermal conductivity thermal barrier coatings
journal, January 2003


Analytic relation between bulk moduli and lattice constants
journal, June 1987


The search for novel, superhard materials
journal, September 1999


A database to enable discovery and design of piezoelectric materials
journal, September 2015


Machine Learning in Materials Science
book, January 2016


Bulk modulus for polar covalent crystals
journal, October 2013


Robust Locally Weighted Regression and Smoothing Scatterplots
journal, December 1979


A database to enable discovery and design of piezoelectric materials
dataset, January 2015


Synthesis and Crystal Structure of Mg2B24C, a New Boron-Rich Boride Related to “Tetragonal Boron I”.
journal, September 2006


A proposed rigorous definition of coordination number
journal, September 1979


Theory of bulk moduli of hard solids
journal, November 1988


Engineering materials
book, January 2002


Hardness of T -carbon: Density functional theory calculations
journal, September 2011


Fast and accurate modeling of molecular atomization energies with machine learning
text, January 2012


Crystal structure representations for machine learning models of formation energies
journal, April 2015


Virtual Screening and QSAR Formulations for Crystal Chemistry
journal, February 2005


Commentary: The Materials Project: A materials genome approach to accelerating materials innovation
journal, July 2013


AiiDA: automated interactive infrastructure and database for computational science
journal, January 2016


Lattice structure of mercury: Influence of electronic correlation
journal, September 2006


Synthesis, Crystal Structure, and Properties of Two Modifications of MgB12C2.
journal, July 2007


Assessment and Validation of Machine Learning Methods for Predicting Molecular Atomization Energies
journal, July 2013


AFLOW: An automatic framework for high-throughput materials discovery
journal, June 2012


AFLOWLIB.ORG: A distributed materials properties repository from high-throughput ab initio calculations
journal, June 2012


Calculation of bulk moduli of diamond and zinc-blende solids
journal, December 1985


Complex thermoelectric materials
book, October 2010


Modeling hardness of polycrystalline materials and bulk metallic glasses
journal, September 2011


Adaptive Strategies for Materials Design using Uncertainties
journal, January 2016


Van der Waals contribution to the binding energy of noble metals
journal, October 1977


Complex thermoelectric materials
journal, February 2008


Quantum chemistry structures and properties of 134 kilo molecules
journal, August 2014


Quantum chemistry structures and properties of 134 kilo molecules
text, January 2014


Elementary Electronic Structure
book, June 1999


Combinatorial screening for new materials in unconstrained composition space with machine learning
journal, March 2014


Works referencing / citing this record:

Active learning in materials science with emphasis on adaptive sampling using uncertainties for targeted design
journal, February 2019


Empirical modeling of dopability in diamond-like semiconductors
journal, December 2018


Local electronic descriptors for solute-defect interactions in bcc refractory metals
journal, October 2019


Machine Learning Approaches for Thermoelectric Materials Research
journal, November 2019


Universal fragment descriptors for predicting properties of inorganic crystals
journal, June 2017


Modelling of framework materials at multiple scales: current practices and open questions
journal, May 2019

  • Fraux, Guillaume; Chibani, Siwar; Coudert, François-Xavier
  • Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences, Vol. 377, Issue 2149
  • https://doi.org/10.1098/rsta.2018.0220

Universal fragment descriptors for predicting properties of inorganic crystals
text, January 2017


Machine learning for composite materials
journal, March 2019


Computational prediction of new auxetic materials
journal, August 2017


Recent advances and applications of machine learning in solid-state materials science
journal, August 2019


Machine learning properties of binary wurtzite superlattices
journal, January 2018


A general representation scheme for crystalline solids based on Voronoi-tessellation real feature values and atomic property data
journal, March 2018


Materials informatics
journal, April 2009


Materials informatics
journal, January 2018


Local electronic descriptors for solute-defect interactions in bcc refractory metals
journal, October 2019


Uncovering structure-property relationships of materials by subgroup discovery
journal, January 2017


A general representation scheme for crystalline solids based on Voronoi-tessellation real feature values and atomic property data
journal, March 2018


Prediction of the Composition and Hardness of High-Entropy Alloys by Machine Learning
journal, July 2019


Data-driven design of inorganic materials with the Automatic Flow Framework for Materials Discovery
journal, September 2018


ElemNet: Deep Learning the Chemistry of Materials From Only Elemental Composition
journal, December 2018


Nanoinformatics, and the big challenges for the science of small things
journal, January 2019


Materials Informatics
journal, April 2009


Genarris: Random generation of molecular crystal structures and fast screening with a Harris approximation
journal, June 2018


Thermal expansion of quaternary nitride coatings
journal, March 2018


Rocketsled: a software library for optimizing high-throughput computational searches
journal, April 2019


Exploring effective charge in electromigration using machine learning
text, January 2019


Exploring effective charge in electromigration using machine learning
journal, May 2019


A strategy to apply machine learning to small datasets in materials science
journal, May 2018


ElemNet: Deep Learning the Chemistry of Materials From Only Elemental Composition
journal, December 2018