DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: ZMPY3D: accelerating protein structure volume analysis through vectorized 3D Zernike moments and Python-based GPU integration

Journal Article · · Bioinformatics Advances

Abstract Motivation Volumetric 3D object analyses are being applied in research fields such as structural bioinformatics, biophysics, and structural biology, with potential integration of artificial intelligence/machine learning (AI/ML) techniques. One such method, 3D Zernike moments, has proven valuable in analyzing protein structures (e.g., protein fold classification, protein–protein interaction analysis, and molecular dynamics simulations). Their compactness and efficiency make them amenable to large-scale analyses. Established methods for deriving 3D Zernike moments, however, can be inefficient, particularly when higher order terms are required, hindering broader applications. As the volume of experimental and computationally-predicted protein structure information continues to increase, structural biology has become a “big data” science requiring more efficient analysis tools. Results This application note presents a Python-based software package, ZMPY3D, to accelerate computation of 3D Zernike moments by vectorizing the mathematical formulae and using graphical processing units (GPUs). The package offers popular GPU-supported libraries such as CuPy and TensorFlow together with NumPy implementations, aiming to improve computational efficiency, adaptability, and flexibility in future algorithm development. The ZMPY3D package can be installed via PyPI, and the source code is available from GitHub. Volumetric-based protein 3D structural similarity scores and transform matrix of superposition functionalities have both been implemented, creating a powerful computational tool that will allow the research community to amalgamate 3D Zernike moments with existing AI/ML tools, to advance research and education in protein structure bioinformatics. Availability and implementation ZMPY3D, implemented in Python, is available on GitHub (https://github.com/tawssie/ZMPY3D) and PyPI, released under the GPL License.

Sponsoring Organization:
USDOE
Grant/Contract Number:
SC0019749
OSTI ID:
2426412
Journal Information:
Bioinformatics Advances, Journal Name: Bioinformatics Advances Journal Issue: 1 Vol. 4; ISSN 2635-0041
Publisher:
Oxford University PressCopyright Statement
Country of Publication:
United Kingdom
Language:
English

References (20)

Efficient spherical harmonic transforms aimed at pseudospectral numerical simulations: EFFICIENT SPHERICAL HARMONIC TRANSFORM journal March 2013
Capturing the geometry, function, and evolution of enzymes with 3D templates journal June 2022
Complete Moment Invariants and Pose Determination for Orthogonal Transformations of 3D Objects book January 1996
Binding site identification of G protein-coupled receptors through a 3D Zernike polynomials-based method: application to C. elegans olfactory receptors journal January 2022
Deep learning for reconstructing protein structures from cryo-EM density maps: Recent advances and future directions journal April 2023
Quantitative Characterization of Binding Pockets and Binding Complementarity by Means of Zernike Descriptors journal February 2020
Array programming with NumPy journal September 2020
Highly accurate protein structure prediction with AlphaFold journal July 2021
Protein secondary structure detection in intermediate-resolution cryo-EM maps using deep learning journal July 2019
Real-time structure search and structure classification for AlphaFold protein models journal April 2022
Zernike polynomials and their applications journal November 2022
ZEAL: protein structure alignment based on shape similarity journal March 2021
Antibody interface prediction with 3D Zernike descriptors and SVM journal November 2018
RCSB Protein Data Bank (RCSB.org): delivery of experimentally-determined PDB structures alongside one million computed structure models of proteins from artificial intelligence/machine learning journal November 2022
Accurate prediction of protein structures and interactions using a three-track neural network journal July 2021
An Algorithm for Fast Computation of 3D Zernike Moments for Volumetric Images journal January 2012
Protein-protein docking using region-based 3D Zernike descriptors journal January 2009
Exploring the potential of 3D Zernike descriptors and SVM for protein–protein interface prediction journal February 2018
Real time structural search of the Protein Data Bank journal July 2020
Multiple Subunit Fitting into a Low-Resolution Density Map of a Macromolecular Complex Using a Gaussian Mixture Model journal November 2008

Related Subjects