Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

SARS-CoV2 Docking Dataset for MLMol Language Model (50M)

Dataset ·
This is a processed molecular dataset from this https://doi.ccs.ornl.gov/ui/doi/348 adding up to 50M molecules for the training and 486K molecules for the validation. Instructions on how to use/run/train this dataset can be found here: https://code.ornl.gov/candle/mlmol
Research Organization:
Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States). Oak Ridge Leadership Computing Facility (OLCF); Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
Sponsoring Organization:
Office of Science (SC)
DOE Contract Number:
AC05-00OR22725
OSTI ID:
1868526
Country of Publication:
United States
Language:
English

Similar Records

SARS-CoV2 Docking MLMol Benchmark Dataset
Dataset · Fri May 20 00:00:00 EDT 2022 · OSTI ID:1868058

SARS-CoV2 Docking Dataset
Dataset · Thu May 27 00:00:00 EDT 2021 · OSTI ID:1783186

Related Subjects