Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

High-throughput virtual laboratory for drug discovery using massive datasets

Journal Article · · International Journal of High Performance Computing Applications
 [1];  [1];  [1];  [2];  [2];  [3];  [3];  [4];  [5];  [6];  [6];  [3]
  1. National Center for Computational Sciences, Oak Ridge National Laboratory, Oak Ridge, TN, USA
  2. NVIDIA Corporation, Santa Clara, CA, USA
  3. Computer Science and Mathematics Division, Oak Ridge National Laboratory, Oak Ridge, TN, USA
  4. Jubilee Development, Cambridge, MA, USA
  5. Scripps Research, San Diego, CA, USA
  6. Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN, USA

Time-to-solution for structure-based screening of massive chemical databases for COVID-19 drug discovery has been decreased by an order of magnitude, and a virtual laboratory has been deployed at scale on up to 27,612 GPUs on the Summit supercomputer, allowing an average molecular docking of 19,028 compounds per second. Over one billion compounds were docked to two SARS-CoV-2 protein structures with full optimization of ligand position and 20 poses per docking, each in under 24 hours. GPU acceleration and high-throughput optimizations of the docking program produced 350× mean speedup over the CPU version (50× speedup per node). GPU acceleration of both feature calculation for machine-learning based scoring and distributed database queries reduced processing of the 2.4 TB output by orders of magnitude. The resulting 50× speedup for the full pipeline reduces an initial 43 day runtime to 21 hours per protein for providing high-scoring compounds to experimental collaborators for validation assays.

Research Organization:
Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
Sponsoring Organization:
USDOE; USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR)
Grant/Contract Number:
AC05-00OR22725
OSTI ID:
1772255
Alternate ID(s):
OSTI ID: 1811403
Journal Information:
International Journal of High Performance Computing Applications, Journal Name: International Journal of High Performance Computing Applications Journal Issue: 5 Vol. 35; ISSN 1094-3420
Publisher:
SAGE PublicationsCopyright Statement
Country of Publication:
United States
Language:
English

References (33)

FireWorks: a dynamic workflow system designed for high-throughput applications: FireWorks: A Dynamic Workflow System Designed for High-Throughput Applications journal May 2015
AutoDock4 and AutoDockTools4: Automated docking with selective receptor flexibility journal December 2009
AutoDock Vina: Improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading journal January 2009
DOCK 6: Impact of new features and current docking performance journal April 2015
Rapid Identification of Potential Inhibitors of SARS‐CoV‐2 Main Protease by Deep Docking of 1.3 Billion Compounds journal March 2020
From machine learning to deep learning: Advances in scoring functions for protein–ligand docking journal June 2019
D3R Grand Challenge 4: prospective pose prediction of BACE1 ligands with AutoDock-GPU journal November 2019
Selecting machine-learning scoring functions for structure-based virtual screening journal December 2019
Improved Method of Structure-Based Virtual Screening via Interaction-Energy-Based Learning journal October 2018
Deep Docking: A Deep Learning Platform for Augmentation of Structure Based Drug Discovery journal May 2020
Rational design of quinazoline-based irreversible inhibitors of human erythrocyte purine nucleoside phosphorylase journal August 1991
FRED Pose Prediction and Virtual Screening Accuracy journal February 2011
Accelerating Molecular Docking Calculations Using Graphics Processing Units journal March 2011
Stochastic Voyages into Uncharted Chemical Space Produce a Representative Library of All Possible Drug-Like Compounds journal May 2013
Using shape complementarity as an initial screen in designing ligands for a receptor binding site of known three-dimensional structure journal April 1988
Glide:  A New Approach for Rapid, Accurate Docking and Scoring. 1. Method and Assessment of Docking Accuracy journal March 2004
Rational design of potent sialidase-based inhibitors of influenza virus replication journal June 1993
Structural plasticity of SARS-CoV-2 3CL Mpro active site cavity revealed by room temperature X-ray crystallography journal June 2020
Ultra-large library docking for discovering new chemotypes journal February 2019
An open-source drug discovery platform enables ultra-large virtual screens journal March 2020
Protein-Ligand Blind Docking Using QuickVina-W With Inter-Process Spatio-Temporal Integration journal November 2017
Performance of machine-learning scoring functions in structure-based virtual screening journal April 2017
How to Discover Antiviral Drugs Quickly journal June 2020
Machine learning classification can reduce false positives in structure-based virtual screening journal July 2020
Fast, accurate, and reliable molecular docking with QuickVina 2 journal February 2015
GPU-Accelerated Drug Discovery with Docking on the Summit Supercomputer: Porting, Optimization, and Application to COVID-19 Research
  • LeGrand, Scott; Scheinberg, Aaron; Tillack, Andreas F.
  • BCB '20: 11th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics, Proceedings of the 11th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics https://doi.org/10.1145/3388440.3412472
conference September 2020
Empirical evaluation across multiple GPU-accelerated DBMSes
  • Chu, Hawon; Kim, Seounghyun; Lee, Joo-Young
  • SIGMOD/PODS '20: International Conference on Management of Data, Proceedings of the 16th International Workshop on Data Management on New Hardware https://doi.org/10.1145/3399666.3399907
conference June 2020
High performance in silico virtual drug screening on many-core processors journal April 2014
Multilevel Parallelization of AutoDock 4.2 journal April 2011
Open Drug Discovery Toolkit (ODDT): a new open-source player in the drug discovery field journal June 2015
rDock: A Fast, Versatile and Open Source Program for Docking Ligands to Proteins and Nucleic Acids journal April 2014
Ligand Pose and Orientational Sampling in Molecular Docking journal October 2013
GeauxDock: Accelerating Structure-Based Virtual Screening with Heterogeneous Computing journal July 2016

Similar Records

Supercomputer-Based Ensemble Docking Drug Discovery Pipeline with Application to Covid-19
Journal Article · Tue Dec 15 23:00:00 EST 2020 · Journal of Chemical Information and Modeling · OSTI ID:1778008

Supercomputer-Based Ensemble Docking Drug Discovery Pipeline with Application to Covid-19
Journal Article · Tue Dec 15 19:00:00 EST 2020 · Journal of Chemical Information and Modeling · OSTI ID:1755144