skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Supercomputer-Based Ensemble Docking Drug Discovery Pipeline with Application to Covid-19

Journal Article · · Journal of Chemical Information and Modeling
ORCiD logo [1]; ORCiD logo [2];  [3]; ORCiD logo [4]; ORCiD logo [3]; ORCiD logo [3];  [4];  [5]; ORCiD logo [3]; ORCiD logo [2];  [3]; ORCiD logo [6];  [2];  [7]; ORCiD logo [8]; ORCiD logo [3]; ORCiD logo [1];  [9]; ORCiD logo [3]; ORCiD logo [2] more »; ORCiD logo [3]; ORCiD logo [3];  [10]; ORCiD logo [3];  [10]; ORCiD logo [2];  [3];  [5]; ORCiD logo [2];  [1]; ORCiD logo [2];  [10];  [5]; ORCiD logo [11]; ORCiD logo [3];  [8];  [12]; ORCiD logo [3]; ORCiD logo [2]; ORCiD logo [2]; ORCiD logo [2];  [5];  [3]; ORCiD logo [3];  [8]; ORCiD logo [3];  [2]; ORCiD logo [3];  [5];  [13]; ORCiD logo [14] « less
  1. Georgia Institute of Technology, Atlanta, GA (United States)
  2. Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States); Univ. of Tennessee, Knoxville, TN (United States)
  3. Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States)
  4. Univ. of Alabama, Huntsville, AL (United States)
  5. Brookhaven National Lab. (BNL), Upton, NY (United States)
  6. Univ. of L'Aquila (Italy)
  7. Univ. of Kentucky, Lexington, KY (United States)
  8. The Scripps Research Inst., La Jolla, CA (United States)
  9. Amazon Web Services, Seattle, WA (United States)
  10. NVIDIA Corp., Santa Clara, CA (United States)
  11. Argonne National Lab. (ANL), Lemont, IL (United States)
  12. Jubilee Development, Cambridge, MA (United States)
  13. City Univ. of New York (CUNY), NY (United States)
  14. CNR Inst. of Nanoscience, Modena (Italy)

In this work, we present a supercomputer-driven pipeline for in silico drug discovery using enhanced sampling molecular dynamics (MD) and ensemble docking. Ensemble docking makes use of MD results by docking compound databases into representative protein binding-site conformations, thus taking into account the dynamic properties of the binding sites. We also describe preliminary results obtained for 24 systems involving eight proteins of the proteome of SARS-CoV-2. The MD involves temperature replica exchange enhanced sampling, making use of massively parallel supercomputing to quickly sample the configurational space of protein drug targets. Using the Summit supercomputer at the Oak Ridge National Laboratory, more than 1 ms of enhanced sampling MD can be generated per day. We have ensemble docked repurposing databases to 10 configurations of each of the 24 SARS-CoV-2 systems using AutoDock Vina. Comparison to experiment demonstrates remarkably high hit rates for the top scoring tranches of compounds identified by our ensemble approach. We also demonstrate that, using Autodock-GPU on Summit, it is possible to perform exhaustive docking of one billion compounds in under 24 h. Finally, we discuss preliminary results and planned improvements to the pipeline, including the use of quantum mechanical (QM), machine learning, and artificial intelligence (AI) methods to cluster MD trajectories and rescore docking poses.

Research Organization:
Argonne National Lab. (ANL), Argonne, IL (United States); Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States). Oak Ridge Leadership Computing Facility (OLCF); Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). National Energy Research Scientific Computing Center (NERSC)
Sponsoring Organization:
USDOE Office of Science (SC); National Institutes of Health (NIH); National Science Foundation (NSF)
DOE Contract Number:
AC02-06CH11357; AC05-00OR22725; AC02-05CH11231; NIH R01-AI148740; 2017219379
OSTI ID:
1778008
Journal Information:
Journal of Chemical Information and Modeling, Vol. 60, Issue 12; ISSN 1549-9596
Publisher:
American Chemical Society
Country of Publication:
United States
Language:
English

References (133)

Constant pressure molecular dynamics for molecular systems journal December 1983
Structure based virtual screening identifies small molecule effectors for the sialoglycan binding protein Hsa journal October 2020
TopoGromacs: Automated Topology Conversion from CHARMM to GROMACS within VMD journal May 2016
Research and Development on Therapeutic Agents and Vaccines for COVID-19 and Related Human Coronavirus Diseases journal March 2020
Parallel tempering algorithm for conformational studies of biological molecules journal December 1997
The polarizable continuum model (PCM) interfaced with the fragment molecular orbital method (FMO) journal January 2006
Evidence for Osteocalcin Binding and Activation of GPRC6A in β-Cells journal May 2016
Modeling Structural Coordination and Ligand Binding in Zinc Proteins with a Polarizable Potential journal March 2012
GROMACS 4:  Algorithms for Highly Efficient, Load-Balanced, and Scalable Molecular Simulation journal February 2008
Identification and Structure–Activity Relationships of Novel Compounds that Potentiate the Activities of Antibiotics in Escherichia coli journal July 2017
Ensemble Docking in Drug Discovery: How Many Protein Configurations from Molecular Dynamics Simulations are Needed To Reproduce Known Ligand Binding? journal January 2019
Biosynthesis, Purification, and Substrate Specificity of Severe Acute Respiratory Syndrome Coronavirus 3C-like Proteinase journal October 2003
Comparative Assessment of Scoring Functions on a Diverse Test Set journal April 2009
DeepDriveMD: Deep-Learning Driven Adaptive Molecular Simulations for Protein Folding conference November 2019
The PDBbind Database:  Collection of Binding Affinities for Protein−Ligand Complexes with Known Three-Dimensional Structures journal June 2004
Structure of Main Protease from Human Coronavirus NL63: Insights for Wide Spectrum Anti-Coronavirus Drug Design journal March 2016
Implications of protein flexibility for drug discovery journal July 2003
A semiempirical free energy force field with charge-based desolvation journal January 2007
Structural basis for inhibition of the RNA-dependent RNA polymerase from SARS-CoV-2 by remdesivir journal May 2020
GROMACS: High performance molecular simulations through multi-level parallelism from laptops to supercomputers journal September 2015
Comparative Assessment of Scoring Functions on an Updated Benchmark: 1. Compilation of the Test Set journal May 2014
Coronavirus non-structural protein 16: Evasion, attenuation, and possible treatments journal December 2014
Deep clustering of protein folding simulations journal December 2018
Replica-exchange molecular dynamics method for protein folding journal November 1999
Grid-Based Hydrogen Bond Potentials with Improved Directionality journal April 2004
Coronaviruses — drug discovery and therapeutic options journal February 2016
How to Discover Antiviral Drugs Quickly journal June 2020
The SARS coronavirus nucleocapsid protein – Forms and functions journal March 2014
Dexamethasone in the management of covid -19 journal July 2020
Crystal structures of SARS-CoV-2 ADP-ribose phosphatase: from the apo form to ligand complexes journal July 2020
CysxHisy–Zn2+ interactions: Possibilities and limitations of a simple pairwise force field journal March 2006
Structure of Mpro from SARS-CoV-2 and discovery of its inhibitors journal April 2020
Recent developments in the general atomic and molecular electronic structure system journal April 2020
Evidence for Substrate Binding-Induced Zwitterion Formation in the Catalytic Cys-His Dyad of the SARS-CoV Main Protease journal September 2014
Revealing the molecular mechanisms of proteolysis of SARS-CoV-2 M pro by QM/MM computational methods journal January 2020
Comparative Assessment of Scoring Functions on an Updated Benchmark: 2. Evaluation Methods and General Results journal May 2014
Computer simulations of alkali-acetate solutions: Accuracy of the forcefields in difference concentrations journal November 2017
CHARMM-GUI: A web-based graphical user interface for CHARMM journal March 2008
Canonical sampling through velocity rescaling journal January 2007
Rational design of potent sialidase-based inhibitors of influenza virus replication journal June 1993
SARS-CoV-2 Vaccines: Status Report journal April 2020
Polarizable Force Fields for Biomolecular Simulations: Recent Advances and Applications journal May 2019
Structural plasticity of SARS-CoV-2 3CL Mpro active site cavity revealed by room temperature X-ray crystallography journal June 2020
Method for Including the Dynamic Fluctuations of a Protein in Computer-Aided Drug Design journal December 1999
Molecular dynamics with coupling to an external bath journal October 1984
The fragment molecular orbital method: theoretical development, implementation in GAMESS, and applications: Fragment molecular orbital method in GAMESS journal June 2017
Cavity size in reaction field theory journal December 1998
RNA-virus proteases counteracting host innate immunity journal September 2017
Reducing safety-related drug attrition: the use of in vitro pharmacological profiling journal November 2012
Polymorphic transitions in single crystals: A new molecular dynamics method journal December 1981
Crystal Structure of the SARS-CoV-2 Non-structural Protein 9, Nsp9 journal July 2020
A temperature predictor for parallel tempering simulations journal January 2008
Biogenesis and Dynamics of the Coronavirus Replicative Structures journal November 2012
Enhanced sampling in molecular dynamics journal August 2019
Mechanisms of pH Regulation in the Regulated Secretory Pathway journal August 2001
Structural basis of RNA recognition by the SARS-CoV-2 nucleocapsid phosphoprotein journal December 2020
VinaMPI: Facilitating multiple receptor high-throughput virtual docking on high-performance computers: Software News And Updates journal June 2013
Reviving Antibiotics: Efflux Pump Inhibitors That Interact with AcrA, a Membrane Fusion Protein of the AcrAB-TolC Multidrug Efflux Pump journal November 2016
Ensemble docking to difficult targets in early-stage drug discovery: Methodology and application to fibroblast growth factor 23 journal November 2017
VMD: Visual molecular dynamics journal February 1996
Evaluation of AutoDock and AutoDock Vina on the CASF-2013 Benchmark journal July 2018
Enhanced sampling techniques in molecular dynamics simulations of biological systems journal May 2015
Potential therapeutic targets and promising drugs for combating SARS‐CoV‐2 journal June 2020
A Mathematical Theory of Communication journal July 1948
Ensemble Docking in Drug Discovery journal May 2018
Supercomputing Pipelines Search for Therapeutics Against COVID-19 journal January 2020
Rapid Identification of Potential Inhibitors of SARS‐CoV‐2 Main Protease by Deep Docking of 1.3 Billion Compounds journal March 2020
Parameters of Monovalent Ions in the AMBER-99 Forcefield:  Assessment of Inaccuracies and Proposed Improvements journal October 2007
KFC2: A knowledge-based hot spot prediction method based on interface solvation, atomic density, and plasticity features: KFC2: A Knowledge-Based Hot Spot Prediction Method journal July 2011
Polycystin-1 interacts with TAZ to stimulate osteoblastogenesis and inhibit adipogenesis journal November 2017
Virus factories, double membrane vesicles and viroplasm generated in animal cells journal November 2011
Optimization of parameters for molecular dynamics simulation using smooth particle-mesh Ewald in GROMACS 4.5 journal April 2011
Molecular dynamics analysis of the binding of human interleukin‐6 with interleukin‐6 α‐receptor journal September 2020
A Multibasic Cleavage Site in the Spike Protein of SARS-CoV-2 Is Essential for Infection of Human Lung Cells journal May 2020
Structure of coronavirus main proteinase reveals combination of a chymotrypsin fold with an extra alpha-helical domain journal July 2002
SuperDrug: a conformational drug database journal February 2005
Broad spectrum antiviral remdesivir inhibits human endemic and zoonotic deltacoronaviruses with a highly divergent RNA dependent RNA polymerase journal September 2019
Benchmarking Sets for Molecular Docking journal November 2006
Nsp3 of coronaviruses: Structures and functions of a large multi-domain protein journal January 2018
Prophylactic and therapeutic remdesivir (GS-5734) treatment in the rhesus macaque model of MERS-CoV infection journal February 2020
GROMACS: Fast, flexible, and free journal January 2005
Molecular dynamics study of human carbonic anhydrase II in complex with Zn2+ and acetazolamide on the basis of all-atom force field simulations journal July 2016
Mechanism of glucocerebrosidase activation and dysfunction in Gaucher disease unraveled by molecular dynamics and deep learning journal February 2019
Coronavirus Susceptibility to the Antiviral Remdesivir (GS-5734) Is Mediated by the Viral Polymerase and the Proofreading Exoribonuclease journal March 2018
P-LINCS:  A Parallel Linear Constraint Solver for Molecular Simulation journal December 2007
The anti-influenza virus drug, arbidol is an efficient inhibitor of SARS-CoV-2 in vitro journal May 2020
Cytoplasmic Viral Replication Complexes journal July 2010
The crystal structures of severe acute respiratory syndrome virus main protease and its complex with an inhibitor journal October 2003
The Probabilistic Relevance Framework: BM25 and Beyond journal January 2009
Improving AutoDock Vina Using Random Forest: The Growing Accuracy of Binding Affinity Prediction by the Effective Exploitation of Larger Data Sets journal February 2015
Viracept (Nelfinavir Mesylate, AG1343):  A Potent, Orally Bioavailable Inhibitor of HIV-1 Protease journal November 1997
SWEETLEAD: an In Silico Database of Approved Drugs, Regulated Chemicals, and Herbal Isolates for Computer-Aided Drug Discovery journal November 2013
Directory of Useful Decoys, Enhanced (DUD-E): Better Ligands and Decoys for Better Benchmarking journal July 2012
Improved Parametrization of Li + , Na + , K + , and Mg 2+ Ions for All-Atom Molecular Dynamics Simulations of Nucleic Acid Systems journal December 2011
iScreen: world’s first cloud-computing web server for virtual screening and de novo drug design based on TCM database@Taiwan journal June 2011
Biochemical and Structural Insights into the Mechanisms of SARS Coronavirus RNA Ribose 2′-O-Methylation by nsp16/nsp10 Protein Complex journal October 2011
Parallelization of the MAFFT multiple sequence alignment program journal April 2010
CHARMM36m: an improved force field for folded and intrinsically disordered proteins journal November 2016
Parallel tempering: Theory, applications, and new perspectives journal January 2005
How Efficient Is Replica Exchange Molecular Dynamics? An Analytic Approach journal February 2008
Therapy for Early COVID-19: A Critical Need journal December 2020
Comparison of affinity ranking using AutoDock-GPU and MM-GBSA scores for BACE-1 inhibitors in the D3R Grand Challenge 4 journal November 2019
Remdesivir for the Treatment of Covid-19 — Final Report journal November 2020
Using AutoDock for Ligand‐Receptor Docking journal December 2008
Characterization of influenza virus variants induced by treatment with the endonuclease inhibitor baloxavir marboxil journal June 2018
Identification of Binding Sites for Efflux Pump Inhibitors of the AcrAB-TolC Component AcrA journal February 2019
Crystal structure of SARS-CoV-2 main protease provides a basis for design of improved α-ketoamide inhibitors journal March 2020
An open-source drug discovery platform enables ultra-large virtual screens journal March 2020
The SARS-coronavirus papain-like protease: Structure, function and inhibition by designed antiviral compounds journal March 2015
Evolution of In Silico Strategies for Protein-Protein Interaction Drug Discovery journal August 2018
Multidimensional replica-exchange method for free-energy calculations journal October 2000
A computationally identified compound antagonizes excess FGF-23 signaling in renal tubules and a mouse model of hypophosphatemia journal November 2016
Best bang for your buck: GPU nodes for GROMACS biomolecular simulations journal August 2015
Neutralizing Antibodies against SARS-CoV-2 and Other Human Coronaviruses journal May 2020
Computationally identified novel agonists for GPRC6A journal April 2018
Structure of the SARS-CoV-2 spike receptor-binding domain bound to the ACE2 receptor journal March 2020
Crystal structure of Nsp15 endoribonuclease NendoU from SARS‐CoV ‐2 journal May 2020
Structural and Functional Evidence for Testosterone Activation of GPRC6A in Peripheral Tissues journal December 2015
Comparative therapeutic efficacy of remdesivir and combination lopinavir, ritonavir, and interferon beta against MERS-CoV journal January 2020
Ensemble-based docking: From hit discovery to metabolism and toxicity predictions journal October 2016
Rapid calculation of RMSDs using a quaternion-based characteristic polynomial journal June 2005
Structure-based design of broadly protective group a streptococcal M protein-based vaccines journal January 2017
PDB2PQR: expanding and upgrading automated preparation of biomolecular structures for molecular simulations journal May 2007
Prediction of peptide binding to MHC using machine learning with sequence and structure-based feature sets journal April 2020
Genome Composition and Divergence of the Novel Coronavirus (2019-nCoV) Originating in China journal March 2020
Crystal structure of SARS-CoV-2 nucleocapsid protein RNA binding domain reveals potential unique drug targeting sites journal April 2020
FAST: FAST Analysis of Sequences Toolbox journal May 2015
Density-Functional Tight-Binding Combined with the Fragment Molecular Orbital Method journal October 2014
The PDBbind Database:  Methodologies and Updates journal June 2005
SuperDRUG2: a one stop resource for approved/marketed drugs journal November 2017
A SARS-CoV-2 protein interaction map reveals targets for drug repurposing journal April 2020
Evaluating the Energy Efficiency of OpenCL-accelerated AutoDock Molecular Docking conference March 2020
GPU-Accelerated Drug Discovery with Docking on the Summit Supercomputer: Porting, Optimization, and Application to COVID-19 Research
  • LeGrand, Scott; Scheinberg, Aaron; Tillack, Andreas F.
  • BCB '20: 11th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics, Proceedings of the 11th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics https://doi.org/10.1145/3388440.3412472
conference September 2020

Similar Records

Supercomputer-Based Ensemble Docking Drug Discovery Pipeline with Application to Covid-19
Journal Article · Wed Dec 16 00:00:00 EST 2020 · Journal of Chemical Information and Modeling · OSTI ID:1778008

SARS-CoV2 billion-compound docking
Journal Article · Tue Mar 28 00:00:00 EDT 2023 · Scientific Data · OSTI ID:1778008

SARS-CoV2 Docking Dataset
Dataset · Thu May 27 00:00:00 EDT 2021 · OSTI ID:1778008