HT Model Dataset
Abstract
This website contains the dataset that was used for writing the manuscript "HT Model: Using the Molecular Transformer for predicting hydrotreating reactions" (PNNL-SA-186589) The dataset includes a collection of hydrotreating reactions compiled from 41 peer-reviewed literature sources. These sources contain experimental data related to hydrotreating reactions. These reactions involve the reaction of chemical compounds with hydrogen gas in the presence of a catalyst to remove heteroatoms or to convert specific functional groups. The dataset contains reactions both with and without reaction conditions. Reaction conditions refer to the specific parameters under which the reaction takes place, such as temperature and pressure. For each reaction, the dataset includes both SMILES and SELFIES representations. SMILES (Simplified Molecular Input Line Entry System) and SELFIES (SELF-referencIng Embedded Strings) are two popular notations used to represent chemical structures in a compact and standardized format. The dataset was created with the aim of training a predictive model, specifically using the Molecular Transformer architecture.
- Authors:
-
- PNNL
- Publication Date:
- DOE Contract Number:
- AC05-76RL01830
- Research Org.:
- Pacific Northwest National Laboratory 2
- Sponsoring Org.:
- DOE
- Subject:
- bioprocessing, hydrotreating, deep learning algorithms, reaction network
- OSTI Identifier:
- 1994205
- DOI:
- https://doi.org/10.25584/1994205
Citation Formats
Eswaran, Sudha Cheranma Devi, Marrlett, Laura N, Harper, Juliette A, Anderson, Mariella S, Shapiro, Eric M, Kozyra, Patrick M, Starks, Alexzabria A, Olarte, Mariefel V, and Rallo Moya, Roberto J. HT Model Dataset. United States: N. p., 2023.
Web. doi:10.25584/1994205.
Eswaran, Sudha Cheranma Devi, Marrlett, Laura N, Harper, Juliette A, Anderson, Mariella S, Shapiro, Eric M, Kozyra, Patrick M, Starks, Alexzabria A, Olarte, Mariefel V, & Rallo Moya, Roberto J. HT Model Dataset. United States. doi:https://doi.org/10.25584/1994205
Eswaran, Sudha Cheranma Devi, Marrlett, Laura N, Harper, Juliette A, Anderson, Mariella S, Shapiro, Eric M, Kozyra, Patrick M, Starks, Alexzabria A, Olarte, Mariefel V, and Rallo Moya, Roberto J. 2023.
"HT Model Dataset". United States. doi:https://doi.org/10.25584/1994205. https://www.osti.gov/servlets/purl/1994205. Pub date:Mon Aug 07 04:00:00 UTC 2023
@article{osti_1994205,
title = {HT Model Dataset},
author = {Eswaran, Sudha Cheranma Devi and Marrlett, Laura N and Harper, Juliette A and Anderson, Mariella S and Shapiro, Eric M and Kozyra, Patrick M and Starks, Alexzabria A and Olarte, Mariefel V and Rallo Moya, Roberto J},
abstractNote = {This website contains the dataset that was used for writing the manuscript "HT Model: Using the Molecular Transformer for predicting hydrotreating reactions" (PNNL-SA-186589) The dataset includes a collection of hydrotreating reactions compiled from 41 peer-reviewed literature sources. These sources contain experimental data related to hydrotreating reactions. These reactions involve the reaction of chemical compounds with hydrogen gas in the presence of a catalyst to remove heteroatoms or to convert specific functional groups. The dataset contains reactions both with and without reaction conditions. Reaction conditions refer to the specific parameters under which the reaction takes place, such as temperature and pressure. For each reaction, the dataset includes both SMILES and SELFIES representations. SMILES (Simplified Molecular Input Line Entry System) and SELFIES (SELF-referencIng Embedded Strings) are two popular notations used to represent chemical structures in a compact and standardized format. The dataset was created with the aim of training a predictive model, specifically using the Molecular Transformer architecture.},
doi = {10.25584/1994205},
journal = {},
number = ,
volume = ,
place = {United States},
year = {Mon Aug 07 04:00:00 UTC 2023},
month = {Mon Aug 07 04:00:00 UTC 2023}
}
