DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: pKPDB: a protein data bank extension database of p Ka and pI theoretical values

Journal Article · · Bioinformatics

Abstract Summary pKa values of ionizable residues and isoelectric points of proteins provide valuable local and global insights about their structure and function. These properties can be estimated with reasonably good accuracy using Poisson–Boltzmann and Monte Carlo calculations at a considerable computational cost (from some minutes to several hours). pKPDB is a database of over 12 M theoretical pKa values calculated over 120k protein structures deposited in the Protein Data Bank. By providing precomputed pKa and pI values, users can retrieve results instantaneously for their protein(s) of interest while also saving countless hours and resources that would be spent on repeated calculations. Furthermore, there is an ever-growing imbalance between experimental pKa and pI values and the number of resolved structures. This database will complement the experimental and computational data already available and can also provide crucial information regarding buried residues that are under-represented in experimental measurements. Availability and implementation Gzipped csv files containing p Ka and isoelectric point values can be downloaded from https://pypka.org/pKPDB. To query a single PDB code please use the PypKa free server at https://pypka.org. The pKPDB source code can be found at https://github.com/mms-fcul/pKPDB. Supplementary information Supplementary data are available at Bioinformatics online.

Sponsoring Organization:
USDOE Office of Nuclear Energy (NE), Nuclear Fuel Cycle and Supply Chain
OSTI ID:
1837314
Journal Information:
Bioinformatics, Journal Name: Bioinformatics Journal Issue: 1 Vol. 38; ISSN 1367-4803
Publisher:
Oxford University PressCopyright Statement
Country of Publication:
United Kingdom
Language:
English

References (14)

Reduced surface: An efficient way to compute molecular surfaces journal March 1996
An amino acid has two sides: A new 2D measure provides a different view of solvent exposure journal February 2005
Are Acidic and Basic Groups in Buried Proteins Predicted to be Ionized? journal May 2005
Cysteine Function Governs Its Conservation and Degeneration and Restricts Its Utilization on Protein Surfaces journal December 2010
RCSB Protein Data Bank: Architectural Advances Towards Integrated Searching and Efficient Access to Macromolecular Structure Data from the PDB Archive journal November 2020
PypKa: A Flexible Python Module for Poisson–Boltzmann-Based p K a Calculations journal August 2020
MMseqs2 enables sensitive protein sequence searching for the analysis of massive data sets journal October 2017
Biopython: freely available Python tools for computational molecular biology and bioinformatics journal March 2009
PIP-DB: the Protein Isoelectric Point database journal September 2014
PKAD: a database of experimentally measured pKa values of ionizable groups in proteins journal January 2019
The Protein Data Bank journal January 2000
Proteome-pI: proteome isoelectric point database journal October 2016
pK values of the ionizable groups of proteins journal May 2006
Electrostatic Energy and Macromolecular Function journal June 1991