DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: CoarsenConf: Equivariant Coarsening with Aggregated Attention for Molecular Conformer Generation

Journal Article · · Journal of Chemical Information and Modeling
ORCiD logo [1];  [2]
  1. University of California, Berkeley, CA (United States); NVIDIA, Santa Clara, CA (United States)
  2. University of California, Berkeley, CA (United States)

Molecular conformer generation (MCG) is an important task in cheminformatics and drug discovery. The ability to efficiently generate low-energy 3D structures can avoid expensive quantum mechanical simulations, leading to accelerated virtual screenings and enhanced structural exploration. Several generative models have been developed for MCG, but many struggle to consistently produce high-quality conformers for meaningful downstream applications. To address these issues, we introduce CoarsenConf, which coarse-grains molecular graphs based on torsional angles and integrates them into an SE(3)-equivariant hierarchical variational autoencoder. Through equivariant coarse-graining, we aggregate the fine-grained atomic coordinates of subgraphs connected via rotatable bonds, creating a variable-length coarse-grained latent representation. Our model uses a novel aggregated attention mechanism to restore fine-grained coordinates from the coarse-grained latent representation, enabling efficient generation of accurate conformers. Furthermore, we evaluate the chemical and biochemical quality of our generated conformers on multiple downstream applications, including property prediction and large-scale oracle-based protein docking. Overall, CoarsenConf generates more accurate conformer ensembles compared to prior generative models.

Research Organization:
University of California, Berkeley, CA (United States)
Sponsoring Organization:
USDOE; USDOE Office of Science (SC), Basic Energy Sciences (BES)
Grant/Contract Number:
AC02-05CH11231
OSTI ID:
2481610
Journal Information:
Journal of Chemical Information and Modeling, Journal Name: Journal of Chemical Information and Modeling Journal Issue: 1 Vol. 65; ISSN 1549-9596
Publisher:
American Chemical SocietyCopyright Statement
Country of Publication:
United States
Language:
English

References (21)

Coarse-Grained Protein Models and Their Applications journal June 2016
Three-Dimensional Convolutional Neural Networks and a Cross-Docked Data Set for Structure-Based Drug Design journal August 2020
AutoDock Vina 1.2.0: New Docking Methods, Expanded Force Field, and Python Bindings journal July 2021
Better Informed Distance Geometry: Using What We Know To Improve Conformation Generation journal November 2015
Machine Learning Force Fields and Coarse-Grained Variables in Molecular Dynamics: Application to Materials and Biological Systems journal June 2020
Bottom-up Coarse-Graining: Principles and Perspectives journal September 2022
Two for One: Diffusion Models and Force Fields for Coarse-Grained Molecular Dynamics journal September 2023
GFN2-xTB—An Accurate and Broadly Parametrized Self-Consistent Tight-Binding Quantum Chemical Method with Multipole Electrostatics and Density-Dependent Dispersion Contributions journal January 2019
Generative Models as an Emerging Paradigm in the Chemical Sciences journal April 2023
Architector for high-throughput cross-periodic table 3D complex building journal May 2023
Highly accurate protein structure prediction with AlphaFold journal July 2021
Quantum chemical calculations of lithium-ion battery electrolyte and interphase species journal August 2021
GEOM, energy-annotated molecular conformations for property prediction and molecular generation journal April 2022
Molecular Geometry Prediction using a Deep Generative Graph Neural Network journal December 2019
Automated exploration of the low-energy chemical space with fast quantum chemical methods journal January 2020
Coarse graining molecular dynamics with graph neural networks journal November 2020
Ensuring thermodynamic consistency with invertible coarse-graining journal March 2023
Automatic processing of rotation diffraction data from crystals of initially unknown symmetry and cell constants journal December 1993
Chemically Transferable Generative Backmapping of Coarse-Grained Proteins preprint January 2023
3D Equivariant Diffusion for Target-Aware Molecule Generation and Affinity Prediction preprint January 2023
Coarse-to-Fine: a Hierarchical Diffusion Model for Molecule Generation in 3D preprint January 2023