DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Auto-generating databases of Yield Strength and Grain Size using ChemDataExtractor

Journal Article · · Scientific Data

Abstract The emerging field of material-based data science requires information-rich databases to generate useful results which are currently sparse in the stress engineering domain. To this end, this study uses the’materials-aware’ text-mining toolkit, ChemDataExtractor, to auto-generate databases of yield-strength and grain-size values by extracting such information from the literature. The precision of the extracted data is 83.0% for yield strength and 78.8% for grain size. The automatically-extracted data were organised into four databases: a Yield Strength, Grain Size, Engineering-Ready Yield Strength and Combined database. For further validation of the databases, the Combined database was used to plot the Hall-Petch relationship for, the alloy, AZ31, and similar results to the literature were found, demonstrating how one can make use of these automatically-extracted datasets.

Research Organization:
Argonne National Laboratory (ANL), Argonne, IL (United States). Argonne Leadership Computing Facility (ALCF)
Sponsoring Organization:
USDOE; USDOE Office of Science (SC)
Grant/Contract Number:
AC02-06CH11357
OSTI ID:
1871768
Journal Information:
Scientific Data, Journal Name: Scientific Data Journal Issue: 1 Vol. 9; ISSN 2052-4463
Publisher:
Nature Publishing GroupCopyright Statement
Country of Publication:
United Kingdom
Language:
English

References (15)

Tailoring Microstructure and Properties of Fine Grained Magnesium Alloys by Severe Plastic Deformation journal November 2017
Materials Data Infrastructure: A Case Study of the Citrination Platform to Examine Data Import, Storage, and Access journal June 2016
The Materials Data Facility: Data Services to Advance Materials Science Research journal July 2016
The Materials Genome Initiative, the interplay of experiment, theory and computation journal April 2014
Information Retrieval and Text Mining Technologies for Chemistry journal May 2017
ChemDataExtractor 2.0: Autopopulated Ontologies for Materials Science journal September 2021
ChemDataExtractor: A Toolkit for Automated Extraction of Chemical Information from the Scientific Literature journal October 2016
Text-mined dataset of inorganic materials synthesis recipes journal October 2019
Commentary: The Materials Project: A materials genome approach to accelerating materials innovation journal July 2013
Perspective: Materials informatics and big data: Realization of the “fourth paradigm” of science in materials science journal April 2016
The Deformation and Ageing of Mild Steel: III Discussion of Results journal September 1951
Chemical named entities recognition: a review on approaches and applications journal April 2014
Effect of severe plastic deformation on tensile and fatigue properties of fine-grained magnesium alloy ZK60 journal July 2017
Big data are shaping the future of materials science journal August 2013
Autogenerated databases of yield strength and grain size using ChemDataExtractor dataset January 2022