A thermoelectric materials database auto-generated from the scientific literature using ChemDataExtractor
- Univ. of Cambridge (United Kingdom)
- Univ. of Cambridge (United Kingdom); Science and Technology Facilities Council (STFC), Oxford (United Kingdom). Rutherford Appleton Lab., ISIS Neutron Source
An auto-generated thermoelectric-materials database is presented, containing 22,805 data records, automatically generated from the scientific literature, spanning 10,641 unique extracted chemical names. Each record contains a chemical entity and one of the seminal thermoelectric properties: thermoelectric figure of merit, ZT; thermal conductivity, κ; Seebeck coefficient, S; electrical conductivity, σ; power factor, PF; each linked to their corresponding recorded temperature, T. The database was auto-generated using the automatic sentence-parsing capabilities of the chemistry-aware, natural language processing toolkit, ChemDataExtractor 2.0, adapted for application in the thermoelectric-materials domain, following a rule-based sentence-simplification step. Data were mined from the text of 60,843 scientific papers that were sourced from three scientific publishers: Elsevier, the Royal Society of Chemistry, and Springer. To the best of our knowledge, this is the first automatically-generated database of thermoelectric materials and their properties from existing literature. The database was evaluated to have a precision of 82.25% and has been made publicly available to facilitate the application of data science in the thermoelectric-materials domain, for analysis, design, and prediction.
- Research Organization:
- Argonne National Laboratory (ANL), Argonne, IL (United States). Argonne Leadership Computing Facility (ALCF)
- Sponsoring Organization:
- USDOE Office of Science (SC), Basic Energy Sciences (BES). Scientific User Facilities (SUF)
- Grant/Contract Number:
- AC02-06CH11357
- OSTI ID:
- 2423219
- Journal Information:
- Scientific Data, Journal Name: Scientific Data Journal Issue: 1 Vol. 9; ISSN 2052-4463
- Publisher:
- Nature Publishing GroupCopyright Statement
- Country of Publication:
- United States
- Language:
- English
Similar Records
A database of thermally activated delayed fluorescent molecules auto-generated from scientific literature with ChemDataExtractor
A Database of Stress-Strain Properties Auto-generated from the Scientific Literature using ChemDataExtractor
A database of battery materials auto-generated using ChemDataExtractor
Journal Article
·
Tue Jan 16 19:00:00 EST 2024
· Scientific Data
·
OSTI ID:2469489
A Database of Stress-Strain Properties Auto-generated from the Scientific Literature using ChemDataExtractor
Journal Article
·
Fri Nov 22 19:00:00 EST 2024
· Scientific Data
·
OSTI ID:2478567
A database of battery materials auto-generated using ChemDataExtractor
Journal Article
·
Wed Aug 05 20:00:00 EDT 2020
· Scientific Data
·
OSTI ID:1816494