Research data infrastructure for high-throughput experimental materials science
Abstract
The High-Throughput Experimental Materials Database (HTEM-DB, htem.nrel.gov) is a repository of inorganic thin-film materials data collected during combinatorial experiments at the National Renewable Energy Laboratory (NREL). This data asset is enabled by NREL's Research Data Infrastructure (RDI), a set of custom data tools that collect, process, and store experimental data and metadata. Here, we describe the experimental data flow from the RDI to the HTEM-DB to illustrate the strategies and best practices currently used for materials data at NREL. Integration of the data tools with experimental instruments establishes a data communication pipeline between experimental researchers and data scientists. This work motivates the creation of similar workflows at other institutions to aggregate valuable data and increase their usefulness for future machine learning studies. In turn, such data-driven studies can greatly accelerate the pace of discovery and design in the materials science domain.
- Authors:
- Publication Date:
- Research Org.:
- National Renewable Energy Lab. (NREL), Golden, CO (United States)
- Sponsoring Org.:
- USDOE Office of Energy Efficiency and Renewable Energy (EERE); USDOE National Renewable Energy Laboratory (NREL), Laboratory Directed Research and Development (LDRD) Program
- OSTI Identifier:
- 1829939
- Alternate Identifier(s):
- OSTI ID: 1832858
- Report Number(s):
- NREL/JA-5K00-80850
Journal ID: ISSN 2666-3899; S266638992100235X; 100373; PII: S266638992100235X
- Grant/Contract Number:
- AC36-08GO28308
- Resource Type:
- Published Article
- Journal Name:
- Patterns
- Additional Journal Information:
- Journal Name: Patterns Journal Volume: 2 Journal Issue: 12; Journal ID: ISSN 2666-3899
- Publisher:
- Elsevier
- Country of Publication:
- United States
- Language:
- English
- Subject:
- 36 MATERIALS SCIENCE; data; experimental; high-throughput; materials; metadata; workflow
Citation Formats
Talley, Kevin R., White, Robert, Wunder, Nick, Eash, Matthew, Schwarting, Marcus, Evenson, Dave, Perkins, John D., Tumas, William, Munch, Kristin, Phillips, Caleb, and Zakutayev, Andriy. Research data infrastructure for high-throughput experimental materials science. United States: N. p., 2021.
Web. doi:10.1016/j.patter.2021.100373.
Talley, Kevin R., White, Robert, Wunder, Nick, Eash, Matthew, Schwarting, Marcus, Evenson, Dave, Perkins, John D., Tumas, William, Munch, Kristin, Phillips, Caleb, & Zakutayev, Andriy. Research data infrastructure for high-throughput experimental materials science. United States. https://doi.org/10.1016/j.patter.2021.100373
Talley, Kevin R., White, Robert, Wunder, Nick, Eash, Matthew, Schwarting, Marcus, Evenson, Dave, Perkins, John D., Tumas, William, Munch, Kristin, Phillips, Caleb, and Zakutayev, Andriy. Wed .
"Research data infrastructure for high-throughput experimental materials science". United States. https://doi.org/10.1016/j.patter.2021.100373.
@article{osti_1829939,
title = {Research data infrastructure for high-throughput experimental materials science},
author = {Talley, Kevin R. and White, Robert and Wunder, Nick and Eash, Matthew and Schwarting, Marcus and Evenson, Dave and Perkins, John D. and Tumas, William and Munch, Kristin and Phillips, Caleb and Zakutayev, Andriy},
abstractNote = {The High-Throughput Experimental Materials Database (HTEM-DB, htem.nrel.gov) is a repository of inorganic thin-film materials data collected during combinatorial experiments at the National Renewable Energy Laboratory (NREL). This data asset is enabled by NREL's Research Data Infrastructure (RDI), a set of custom data tools that collect, process, and store experimental data and metadata. Here, we describe the experimental data flow from the RDI to the HTEM-DB to illustrate the strategies and best practices currently used for materials data at NREL. Integration of the data tools with experimental instruments establishes a data communication pipeline between experimental researchers and data scientists. This work motivates the creation of similar workflows at other institutions to aggregate valuable data and increase their usefulness for future machine learning studies. In turn, such data-driven studies can greatly accelerate the pace of discovery and design in the materials science domain.},
doi = {10.1016/j.patter.2021.100373},
journal = {Patterns},
number = 12,
volume = 2,
place = {United States},
year = {Wed Dec 01 00:00:00 EST 2021},
month = {Wed Dec 01 00:00:00 EST 2021}
}
https://doi.org/10.1016/j.patter.2021.100373
Works referenced in this record:
Chemistry of Electrolyte Reduction on Lithium Silicide
journal, May 2019
- Xu, Yun; Wood, Kevin; Coyle, Jaclyn
- The Journal of Physical Chemistry C, Vol. 123, Issue 21
An open experimental database for exploring inorganic materials
journal, April 2018
- Zakutayev, Andriy; Wunder, Nick; Schwarting, Marcus
- Scientific Data, Vol. 5, Issue 1
Epitaxial growth of rock salt MgZrN 2 semiconductors on MgO and GaN
journal, March 2020
- Bauers, Sage R.; Mangum, John; Harvey, Steven P.
- Applied Physics Letters, Vol. 116, Issue 10
To Data Management and Beyond… for Photovoltaic Applications
journal, January 2003
- Nelson, Brent; Friedman, Dan; Geisz, John
- MRS Proceedings, Vol. 804
An Inter-Laboratory Study of Zn–Sn–Ti–O Thin Films using High-Throughput Experimental Methods
journal, March 2019
- Hattrick-Simpers, Jason R.; Zakutayev, Andriy; Barron, Sara C.
- ACS Combinatorial Science, Vol. 21, Issue 5
Towards a gold standard: regarding quality in public domain chemistry databases and approaches to improving the situation
journal, July 2012
- Williams, Antony J.; Ekins, Sean; Tkachenko, Valery
- Drug Discovery Today, Vol. 17, Issue 13-14
Commentary: The Materials Project: A materials genome approach to accelerating materials innovation
journal, July 2013
- Jain, Anubhav; Ong, Shyue Ping; Hautier, Geoffroy
- APL Materials, Vol. 1, Issue 1
Negative-pressure polymorphs made by heterostructural alloying
journal, April 2018
- Siol, Sebastian; Holder, Aaron; Steffes, James
- Science Advances, Vol. 4, Issue 4
An Open Combinatorial Diffraction Dataset Including Consensus Human and Machine Learning Labels with Quantified Uncertainty for Training New Machine Learning Models
journal, June 2021
- Hattrick-Simpers, Jason R.; DeCost, Brian; Kusne, A. Gilad
- Integrating Materials and Manufacturing Innovation, Vol. 10, Issue 2
Automated algorithms for band gap analysis from optical absorption spectra
journal, December 2017
- Schwarting, Marcus; Siol, Sebastian; Talley, Kevin
- Materials Discovery, Vol. 10
The Computational 2D Materials Database: high-throughput modeling and discovery of atomically thin crystals
journal, September 2018
- Haastrup, Sten; Strange, Mikkel; Pandey, Mohnish
- 2D Materials, Vol. 5, Issue 4
Synthesis, optical imaging, and absorption spectroscopy data for 179072 metal oxides
journal, March 2019
- Stein, Helge S.; Soedarmadji, Edwin; Newhouse, Paul F.
- Scientific Data, Vol. 6, Issue 1
Materials Design and Discovery with High-Throughput Density Functional Theory: The Open Quantum Materials Database (OQMD)
journal, September 2013
- Saal, James E.; Kirklin, Scott; Aykol, Muratahan
- JOM, Vol. 65, Issue 11
Fast-Track to Research Data Management in Experimental Material Science–Setting the Ground for Research Group Level Materials Digitalization
journal, June 2020
- Banko, Lars; Ludwig, Alfred
- ACS Combinatorial Science, Vol. 22, Issue 8
Trade-Offs in Thin Film Solar Cells with Layered Chalcostibite Photovoltaic Absorbers
journal, January 2017
- Welch, Adam W.; Baranowski, Lauryn L.; Peng, Haowei
- Advanced Energy Materials, Vol. 7, Issue 11
COMBIgor: Data-Analysis Package for Combinatorial Materials Science
journal, May 2019
- Talley, Kevin R.; Bauers, Sage R.; Melamed, Celeste L.
- ACS Combinatorial Science, Vol. 21, Issue 7
AFLOW: An automatic framework for high-throughput materials discovery
journal, June 2012
- Curtarolo, Stefano; Setyawan, Wahyu; Hart, Gus L. W.
- Computational Materials Science, Vol. 58
Synthesis and Characterization of (Sn,Zn)O Alloys
journal, October 2016
- Bikowski, Andre; Holder, Aaron; Peng, Haowei
- Chemistry of Materials, Vol. 28, Issue 21
Synthesis of Tunable SnS-TaS 2 Nanoscale Superlattices
journal, September 2020
- Roberts, Dennice M.; Bardgett, Dylan; Gorman, Brian P.
- Nano Letters, Vol. 20, Issue 10
The Inorganic Crystal Structure Database (ICSD)—Present and Future
journal, January 2004
- Hellenbrandt, Mariette
- Crystallography Reviews, Vol. 10, Issue 1
Theoretical Prediction and Experimental Realization of New Stable Inorganic Materials Using the Inverse Design Approach
journal, June 2013
- Zakutayev, Andriy; Zhang, Xiuwen; Nagaraja, Arpun
- Journal of the American Chemical Society, Vol. 135, Issue 27
The Experiment Data Depot: A Web-Based Software Tool for Biological Experimental Data Storage, Sharing, and Visualization
journal, September 2017
- Morrell, William C.; Birkel, Garrett W.; Forrer, Mark
- ACS Synthetic Biology, Vol. 6, Issue 12
Correcting density functional theory for accurate predictions of compound enthalpies of formation: Fitted elemental-phase reference energies
journal, March 2012
- Stevanović, Vladan; Lany, Stephan; Zhang, Xiuwen
- Physical Review B, Vol. 85, Issue 11
User applications driven by the community contribution framework MPContribs in the Materials Project: MPCONTRIBS-DRIVEN USER APPLICATIONS
journal, October 2015
- Huck, P.; Gunter, D.; Cholia, S.
- Concurrency and Computation: Practice and Experience, Vol. 28, Issue 7
Double-Site Substitution of Ce into (Ba, Sr)MnO 3 Perovskites for Solar Thermochemical Hydrogen Production
journal, August 2021
- Heo, Su Jeong; Sanders, Michael; O’Hayre, Ryan
- ACS Energy Letters, Vol. 6, Issue 9
Implications of heterostructural alloying for enhanced piezoelectric performance of (Al,Sc)N
journal, June 2018
- Talley, Kevin R.; Millican, Samantha L.; Mangum, John
- Physical Review Materials, Vol. 2, Issue 6