Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Comparative dataset of experimental and computational attributes of UV/vis absorption spectra

Journal Article · · Scientific Data
 [1];  [2];  [2];  [2];  [3]
  1. Univ. of Cambridge (United Kingdom); Science and Technology Facilities Council (STFC), Didcot (United Kingdom). Rutherford Appleton Lab. (RAL); DOE/OSTI
  2. Argonne National Lab. (ANL), Lemont, IL (United States)
  3. Univ. of Cambridge (United Kingdom); Science and Technology Facilities Council (STFC), Didcot (United Kingdom). Rutherford Appleton Lab., ISIS Neutron Source; Argonne National Lab. (ANL), Lemont, IL (United States)
The ability to auto-generate databases of optical properties holds great prospects in data-driven materials discovery for optoelectronic applications. We present a cognate set of experimental and computational data that describes key features of optical absorption spectra. This includes an auto-generated database of 18,309 records of experimentally determined UV/vis absorption maxima, λmax, and associated extinction coefficients, ϵ, where present. This database was produced using the text-mining toolkit, ChemDataExtractor, on 402,034 scientific documents. High-throughput electronic-structure calculations using fast (simplified Tamm-Dancoff approach) and traditional (time-dependent) density functional theory were executed to predict λmax and oscillation strengths, f (related to ϵ) for a subset of validated compounds. Paired quantities of these computational and experimental data show strong correlations in λmax, f and ϵ, laying the path for reliable in silico calculations of additional optical properties. The total dataset of 8,488 unique compounds and a subset of 5,380 compounds with experimental and computational data, are available in MongoDB, CSV and JSON formats. These can be queried using Python, R, Java, and MATLAB, for data-driven optoelectronic materials discovery.
Research Organization:
Argonne National Laboratory (ANL), Argonne, IL (United States)
Sponsoring Organization:
STFC Rutherford Appleton Laboratory (RAL); Tessella; USDOE Office of Science (SC), Basic Energy Sciences (BES)
Grant/Contract Number:
AC02-06CH11357
OSTI ID:
1607383
Journal Information:
Scientific Data, Journal Name: Scientific Data Journal Issue: 1 Vol. 6; ISSN 2052-4463
Publisher:
Nature Publishing GroupCopyright Statement
Country of Publication:
United States
Language:
English

References (34)

Design-to-Device Approach Affords Panchromatic Co-Sensitized Solar Cells journal December 2018
Optimization of parameters for semiempirical methods VI: more modifications to the NDDO approximations and re-optimization of parameters journal November 2012
NWChem: A comprehensive and scalable open-source solution for large scale molecular simulations journal September 2010
SMILES, a chemical language and information system. 1. Introduction to methodology and encoding rules journal February 1988
Chemical Name to Structure: OPSIN, an Open Source Solution journal February 2011
Freely Available Conformer Generation Methods: How Good Are They? journal April 2012
Solvent Effects on the UV–vis Absorption and Emission of Optoelectronic Coumarins: a Comparison of Three Empirical Solvatochromic Models journal July 2013
Unprecedented generation of 3D heterostructures by mechanochemical disassembly and re-ordering of incommensurate metal chalcogenides journal June 2020
The Harvard organic photovoltaic dataset journal September 2016
Accelerated computational discovery of high-performance materials for organic photovoltaics by means of cheminformatics journal January 2011
A simplified Tamm-Dancoff density functional approach for the electronic excitation spectra of very large molecules journal June 2013
Consistent structures and interactions by density functional theory with small atomic orbital basis sets journal August 2015
ChemicalTagger: A tool for semantic text-mining in chemistry journal May 2011
The dye-sensitized solar cell database journal April 2018
Auto-generated materials database of Curie and Néel temperatures via semi-supervised relationship extraction. journalarticle January 2018
NIST Chemistry WebBook, NIST Standard Reference Database 69 dataset January 1997
Comparative dataset of experimental and computational attributes of UV/vis absorption spectra dataset January 2019
PM3 geometry optimization and CNDO/S-CI computation of UV/Vis spectra of large organic structures: Program description and application to poly(triacetylene) hexamer and taxotere journal March 1999
The ORCA program system: The ORCA program system journal June 2011
ChemDataExtractor: A Toolkit for Automated Extraction of Chemical Information from the Scientific Literature journal October 2016
Machine-learned and codified synthesis parameters of oxide materials journal September 2017
Auto-generated materials database of Curie and Néel temperatures via semi-supervised relationship extraction journal June 2018
Weaver's historic accessible collection of synthetic dyes: a cheminformatics analysis journal January 2017
Commentary: The Materials Project: A materials genome approach to accelerating materials innovation journal July 2013
Inhomogeneous Electron Gas journal November 1964
Self-Consistent Equations Including Exchange and Correlation Effects journal November 1965
Open Babel: An open chemical toolbox journal October 2011
CRC Standard Probability and Statistics Tables and Formulae book December 1999
NWChem: A comprehensive and scalable open-source solution for large scale molecular simulations dataset January 2019
NIST Chemistry WebBook, NIST Standard Reference Database 69 dataset January 1997
Metadata record for: Comparative dataset of experimental and computational attributes of UV/vis absorption spectra dataset January 2020
Metadata record for: Comparative dataset of experimental and computational attributes of UV/vis absorption spectra dataset January 2020
Comparative dataset of experimental and computational attributes of UV/vis absorption spectra dataset January 2019
Auto-generated materials database of Curie and Néel temperatures via semi-supervised relationship extraction [Supplemental Data] dataset June 2018


Similar Records

Auto-generated materials database of Curie and Néel temperatures via semi-supervised relationship extraction
Journal Article · Mon Jun 18 20:00:00 EDT 2018 · Scientific Data · OSTI ID:1460724

Perovskite- and Dye-Sensitized Solar-Cell Device Databases Auto-generated Using ChemDataExtractor
Journal Article · Thu Jun 16 20:00:00 EDT 2022 · Scientific Data · OSTI ID:2469490

A database of refractive indices and dielectric constants auto-generated using ChemDataExtractor
Journal Article · Mon May 02 20:00:00 EDT 2022 · Scientific Data · OSTI ID:1982166