skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Can machine learning identify the next high-temperature superconductor? Examining extrapolation performance for materials discovery

Journal Article · · Molecular Systems Design & Engineering
DOI:https://doi.org/10.1039/C8ME00012C· OSTI ID:1464980

Traditional machine learning (ML) metrics overestimate model performance for materials discovery. We introduce (1) leave-one-cluster-out cross-validation (LOCO CV) and (2) a simple nearest-neighbor benchmark to show that model performance in discovery applications strongly depends on the problem, data sampling, and extrapolation. Our results suggest that ML-guided iterative experimentation may outperform standard high-throughput screening for discovering breakthrough materials like high-Tc superconductors with ML.

Research Organization:
Argonne National Lab. (ANL), Argonne, IL (United States)
Sponsoring Organization:
USDOE; National Institute of Standards and Technology (NIST)
Grant/Contract Number:
AC02-06CH11357
OSTI ID:
1464980
Alternate ID(s):
OSTI ID: 1484501
Journal Information:
Molecular Systems Design & Engineering, Journal Name: Molecular Systems Design & Engineering Vol. 3 Journal Issue: 5; ISSN 2058-9689
Publisher:
Royal Society of Chemistry (RSC)Copyright Statement
Country of Publication:
United Kingdom
Language:
English
Citation Metrics:
Cited by: 90 works
Citation information provided by
Web of Science

References (27)

COMBO: An efficient Bayesian optimization library for materials science journal June 2016
Accelerated discovery of metallic glasses through iteration of machine learning and high-throughput experiments journal April 2018
Random Forests and Adaptive Nearest Neighbors journal June 2006
High-Dimensional Materials and Process Optimization Using Data-Driven Experimental Design with Well-Calibrated Uncertainty Estimates journal July 2017
Accelerated search for materials with targeted properties by adaptive design journal April 2016
How Chemical Composition Alone Can Predict Vibrational Free Energies and Entropies of Solids journal July 2017
Silhouettes: A graphical aid to the interpretation and validation of cluster analysis journal November 1987
Deep Neural Nets as a Method for Quantitative Structure–Activity Relationships journal February 2015
On-the-fly machine-learning for high-throughput experiments: search for rare-earth-free permanent magnets journal September 2014
Prediction Errors of Molecular Machine Learning Models Lower than Hybrid DFT Error journal October 2017
Less is more: Sampling chemical space with active learning journal June 2018
Exploration of data science techniques to predict fatigue strength of steel from composition and processing parameters journal April 2014
Extracting knowledge from molecular mechanics simulations of grain boundaries using machine learning journal July 2017
High-Throughput Machine-Learning-Driven Synthesis of Full-Heusler Compounds journal October 2016
Cross-validation strategies for data with temporal, spatial, hierarchical, or phylogenetic structure journal March 2017
Materials science with large-scale data and informatics: Unlocking new opportunities journal May 2016
Ridge Regression: Biased Estimation for Nonorthogonal Problems journal February 1970
Atomistic calculations and materials informatics: A review journal June 2017
Accelerating Chemical Discovery with Machine Learning: Simulated Evolution of Spin Crossover Complexes with an Artificial Neural Network journal February 2018
Formation enthalpies for transition metal alloys using machine learning journal June 2017
Design of a nickel-base superalloy using a neural network journal October 2017
A general-purpose machine learning framework for predicting properties of inorganic materials journal August 2016
Discovery of Intermetallic Compounds from Traditional to Machine-Learning Approaches journal December 2017
Prediction of polymer properties using infinite chain descriptors (ICD) and machine learning: Toward optimized dielectric polymeric materials journal August 2016
Combinatorial screening for new materials in unconstrained composition space with machine learning journal March 2014
Prediction model of band gap for inorganic compounds by combination of density functional theory calculations and machine learning techniques journal March 2016
3D printing of high-strength aluminium alloys journal September 2017

Cited By (6)

Recent advances and applications of machine learning in solid-state materials science journal August 2019
Materials science in the artificial intelligence age: high-throughput library generation, machine learning, and a pathway from correlations to the underpinning physics journal July 2019
Data‐Driven Materials Science: Status, Challenges, and Perspectives journal September 2019
Concurrent Optimization of Organic Donor–Acceptor Pairs through Machine Learning journal September 2019
A Critical Review of Machine Learning of Energy Materials journal January 2020
Predicting structure/property relationships in multi-dimensional nanoparticle data using t-distributed stochastic neighbour embedding and machine learning journal January 2019

Figures / Tables (5)


Similar Records

Machine Learning for the Discovery, Design, and Engineering of Materials
Journal Article · Fri Jun 10 00:00:00 EDT 2022 · Annual Review of Chemical and Biomolecular Engineering · OSTI ID:1464980

Machine Learning Applied to Zeolite Synthesis: The Missing Link for Realizing High-Throughput Discovery
Journal Article · Wed Sep 25 00:00:00 EDT 2019 · Accounts of Chemical Research · OSTI ID:1464980

Multifidelity Information Fusion with Machine Learning: A Case Study of Dopant Formation Energies in Hafnia
Journal Article · Tue Apr 16 00:00:00 EDT 2019 · ACS Applied Materials and Interfaces · OSTI ID:1464980