DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Protein-ligand binding affinity prediction using multi-instance learning with docking structures

Journal Article · · Frontiers in Pharmacology

Recent advances in 3D structure-based deep learning approaches demonstrate improved accuracy in predicting protein-ligand binding affinity in drug discovery. These methods complement physics-based computational modeling such as molecular docking for virtual high-throughput screening. Despite recent advances and improved predictive performance, most methods in this category primarily rely on utilizing co-crystal complex structures and experimentally measured binding affinities as both input and output data for model training. Nevertheless, co-crystal complex structures are not readily available and the inaccurate predicted structures from molecular docking can degrade the accuracy of the machine learning methods. We introduce a novel structure-based inference method utilizing multiple molecular docking poses for each complex entity. Our proposed method employs multi-instance learning with an attention network to predict binding affinity from a collection of docking poses. We validate our method using multiple datasets, including PDBbind and compounds targeting the main protease of SARS-CoV-2. The results demonstrate that our method leveraging docking poses is competitive with other state-of-the-art inference models that depend on co-crystal structures. This method offers binding affinity prediction without requiring co-crystal structures, thereby increasing its applicability to protein targets lacking such data.

Research Organization:
Lawrence Livermore National Laboratory (LLNL), Livermore, CA (United States)
Sponsoring Organization:
Defense Threat Reduction Agency (DTRA); USDOE National Nuclear Security Administration (NNSA)
Grant/Contract Number:
AC52-07NA27344
OSTI ID:
2516789
Report Number(s):
LLNL--JRNL-2000673
Journal Information:
Frontiers in Pharmacology, Journal Name: Frontiers in Pharmacology Vol. 15; ISSN 1663-9812
Publisher:
Frontiers Research FoundationCopyright Statement
Country of Publication:
United States
Language:
English

References (38)

AutoDock Vina: Improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading journal January 2009
Multiple instance learning: A survey of problem characteristics and applications journal May 2018
Improved Protein–Ligand Binding Affinity Prediction with Structure-Based Deep Fusion Inference journal March 2021
AutoDock Vina 1.2.0: New Docking Methods, Expanded Force Field, and Python Bindings journal July 2021
Pose Classification Using Three-Dimensional Atomic Structure-Based Neural Networks Applied to Ion Channel–Ligand Docking journal April 2022
HAC-Net: A Hybrid Attention-Based Convolutional Neural Network for Highly Accurate Protein–Ligand Binding Affinity Prediction journal March 2023
A Small Step Toward Generalizability: Training a Machine Learning Scoring Function for Structure-Based Virtual Screening journal May 2023
graphLambda: Fusion Graph Neural Networks for Binding Affinity Prediction journal February 2024
Guided Docking as a Data Generation Approach Facilitates Structure-Based Machine Learning on Kinases journal May 2024
Protein–Ligand Scoring with Convolutional Neural Networks journal April 2017
K DEEP : Protein–Ligand Absolute Binding Affinity Prediction via 3D-Convolutional Neural Networks journal January 2018
Comparative Assessment of Scoring Functions: The CASF-2016 Update journal November 2018
AMPL: A Data-Driven Modeling Pipeline for Drug Discovery journal April 2020
InteractionGraphNet: A Novel and Efficient Deep Graph Representation Learning Framework for Accurate Protein–Ligand Interaction Predictions journal December 2021
PotentialNet for Molecular Property Prediction journal November 2018
SS-GNN: A Simple-Structured Graph Neural Network for Affinity Prediction journal June 2023
Extended-Connectivity Fingerprints journal April 2010
Toward Fully Automated High Performance Computing Drug Discovery: A Massively Parallel Virtual Screening Pipeline for Docking and Molecular Mechanics/Generalized Born Surface Area Rescoring to Improve Enrichment journal December 2013
Glide:  A New Approach for Rapid, Accurate Docking and Scoring. 1. Method and Assessment of Docking Accuracy journal March 2004
Glide:  A New Approach for Rapid, Accurate Docking and Scoring. 2. Enrichment Factors in Database Screening journal March 2004
A geometric deep learning approach to predict binding conformations of bioactive molecules journal December 2021
Discovery of SARS-CoV-2 main protease inhibitors using a synthesis-directed de novo design model journal January 2021
Pretrained transformer models for predicting the withdrawal of drugs from the market journal August 2023
DeepSite: protein-binding site predictor using 3D-convolutional neural networks journal May 2017
Development and evaluation of a deep learning model for protein–ligand binding affinity prediction journal May 2018
Protein Data Bank: the single global archive for 3D macromolecular structure data journal October 2018
Open Babel: An open chemical toolbox journal October 2011
Open Drug Discovery Toolkit (ODDT): a new open-source player in the drug discovery field journal June 2015
Mordred: a molecular descriptor calculator journal February 2018
GNINA 1.0: molecular docking with deep learning journal June 2021
Effective Approaches to Attention-based Neural Machine Translation conference January 2015
A Comprehensive Review on Multiple Instance Learning journal October 2023
Enhancing Docking Accuracy with PECAN2, a 3D Atomic Neural Network Trained without Co-Complex Crystal Structures journal March 2024
Attention-based Deep Multiple Instance Learning preprint January 2018
Step Change Improvement in ADMET Prediction with PotentialNet Deep Featurization preprint January 2019
Self-Supervised Graph Transformer on Large-Scale Molecular Data preprint January 2020
ChemBERTa: Large-Scale Self-Supervised Pretraining for Molecular Property Prediction preprint January 2020
DeepBindRG: a deep learning based method for estimating effective protein–ligand affinity journal January 2019

Similar Records

Related Subjects