skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: An affinity-structure database of helix-turn-helix: DNA complexes with a universal coordinate system

Journal Article · · BMC Bioinformatics

Molecular interactions between proteins and DNA molecules underlie many cellular processes, including transcriptional regulation, chromosome replication, and nucleosome positioning. Computational analyses of protein-DNA interactions rely on experimental data characterizing known protein-DNA interactions structurally and biochemically. While many databases exist that contain either structural or biochemical data, few integrate these two data sources in a unified fashion. Such integration is becoming increasingly critical with the rapid growth of structural and biochemical data, and the emergence of algorithms that rely on the synthesis of multiple data types to derive computational models of molecular interactions. We have developed an integrated affinity-structure database in which the experimental and quantitative DNA binding affinities of helix-turn-helix proteins are mapped onto the crystal structures of the corresponding protein-DNA complexes. This database provides access to: (i) protein-DNA structures, (ii) quantitative summaries of protein-DNA binding affinities using position weight matrices, and (iii) raw experimental data of protein-DNA binding instances. Critically, this database establishes a correspondence between experimental structural data and quantitative binding affinity data at the single basepair level. Furthermore, we present a novel alignment algorithm that structurally aligns the protein-DNA complexes in the database and creates a unified residue-level coordinate system for comparing the physico-chemical environments at the interface between complexes. Using this unified coordinate system, we compute the statistics of atomic interactions at the protein-DNA interface of helix-turn-helix proteins. We provide an interactive website for visualization, querying, and analyzing this database, and a downloadable version to facilitate programmatic analysis. Lastly, this database will facilitate the analysis of protein-DNA interactions and the development of programmatic computational methods that capitalize on integration of structural and biochemical datasets. The database can be accessed at http://ProteinDNA.hms.harvard.edu.

Research Organization:
Stanford Univ., CA (United States)
Sponsoring Organization:
USDOE Office of Science (SC)
Grant/Contract Number:
FG02-05ER64136
OSTI ID:
1241035
Journal Information:
BMC Bioinformatics, Vol. 16, Issue 1; ISSN 1471-2105
Publisher:
BioMed CentralCopyright Statement
Country of Publication:
United States
Language:
English
Citation Metrics:
Cited by: 3 works
Citation information provided by
Web of Science

References (83)

Target site choice of the related transposable elements Tc1 and Tc3 of Caenorhabditis elegans journal January 1994
A Feature-Based Approach to Modeling Protein–DNA Interactions journal August 2008
The complex between phage 434 repressor DNA-binding domain and operator site OR3: structural differences between consensus and non-consensus half-sites journal December 1993
AGRIS and AtRegNet. A Platform to Link cis-Regulatory Elements and Transcription Factors into Regulatory Networks journal March 2006
Geometric analysis and comparison of protein-DNA interfaces: why is there no simple code for recognition? 1 1Edited by R. Ebright journal August 2000
Generality of a functional initiator consensus sequence journal December 1996
A census of human transcription factors: function, expression and evolution journal April 2009
Effect of non-contacted bases on the affinity of 434 operator for 434 repressor and Cro journal April 1987
JASPAR 2010: the greatly expanded open-access database of transcription factor binding profiles journal November 2009
BIPA: a database for protein–nucleic acid interaction in 3D structures journal April 2009
DNA binding activities of the Caenorhabditis elegans Tc3 transposase journal January 1994
A method for registration of 3-D shapes journal February 1992
Divergent homeo box proteins recognize similar DNA sequences in Drosophila journal April 1988
Human transcription factor protein interaction networks journal February 2022
Winged helix proteins journal February 2000
AbdB-like Hox proteins stabilize DNA binding by the Meis1 homeodomain proteins. journal November 1997
Clustering by Passing Messages Between Data Points journal February 2007
Prediction of TF target sites based on atomistic models of protein-DNA complexes journal October 2008
Cis requirements for transposition of Tc1-like transposons in C. elegans journal September 1999
ProTherm and ProNIT: thermodynamic databases for proteins and protein-nucleic acid interactions journal January 2006
Maximally Efficient Modeling of DNA Sequence Motifs at All Levels of Complexity journal February 2011
Atomistic modeling of protein–DNA interaction specificity: progress and applications journal August 2012
The many faces of the helix-turn-helix domain: Transcription regulation and beyond journal April 2005
Computational prediction and experimental verification of novel IdeR binding sites in the upstream sequences of Mycobacterium tuberculosis open reading frames journal March 2005
Sequence-specific interaction of the Salmonella Hin recombinase in both major and minor grooves of DNA. journal July 1992
UniPROBE: an online database of protein binding microarray data on protein-DNA interactions journal January 2009
Identification of transcription factor binding sites with variable-order Bayesian networks journal March 2005
Transcriptional regulatory code of a eukaryotic genome journal September 2004
Transcription Initiation at Its Most Basic Level journal November 2003
RegulonDB (version 6.0): gene regulation model of Escherichia coli K-12 beyond transcription, active (experimental) annotated promoters and Textpresso navigation journal December 2007
Search for additional replication terminators in the Bacillus subtilis 168 chromosome. journal May 1997
NPIDB: nucleic acid—protein interaction database journal November 2012
CENP-B box and pJα sequence distribution in human alpha satellite higher-order repeats (HOR) journal November 2006
Indirect readout of DNA sequence at the primary-kink site in the CAP-DNA complex: alteration of DNA binding specificity through alteration of DNA kinking journal November 2001
REDfly 2.0: an integrated database of cis-regulatory modules and transcription factor binding sites in Drosophila journal December 2007
The quorum-sensing transcription factor TraR decodes its DNA binding site by direct contacts with DNA bases and by detection of DNA flexibility: TraR decodes its DNA binding site journal March 2007
DNA sequence requirements for transcriptional initiator activity in mammalian cells. journal January 1994
On the Transcriptional Regulation of Methicillin Resistance: MecI REPRESSOR IN COMPLEX WITH ITS OPERATOR journal February 2004
Contacts between gamma delta resolvase and the gamma delta res site. journal March 1987
Replication Terminator Protein-Based Replication Fork-Arrest Systems in Various Bacillus Species journal July 1998
Alignment of recombination sites in Hin-mediated site-specific DNA recombination. journal September 1991
AthaMap, integrating transcriptional and post-transcriptional data journal January 2009
ChIPBase: a database for decoding the transcriptional regulation of long non-coding RNA and microRNA genes from ChIP-Seq data journal November 2012
DNAase footprinting a simple method for the detection of protein-DNA binding specificity journal January 1978
Genome-Wide Mapping of in Vivo Protein-DNA Interactions journal June 2007
Identification of DNA-binding protein target sequences by physical effective energy functions. Free energy analysis of lambda repressor-DNA complexes text January 2007
DBTBS: a database of transcriptional regulation in Bacillus subtilis containing upstream intergenic conservation information journal October 2007
The many faces of the helix-turn-helix domain: Transcription regulation and beyond journal April 2005
High resolution crystal structure of a paired (Pax) class cooperative homeodomain dimer on DNA journal September 1995
TRANSFAC(R) and its module TRANSCompel(R): transcriptional gene regulation in eukaryotes journal January 2006
Binding geometry of α-helices that recognize DNA journal December 1995
HNF1, a homeoprotein member of the hepatic transcription regulatory network journal September 1992
The Protein Data Bank journal January 2000
Inclusion of neighboring base interdependencies substantially improves genome-wide prokaryotic transcription factor binding site prediction journal May 2010
Analysis of a Ubiquitous Promoter Element in a Primitive Eukaryote: Early Evolution of the Initiator Element journal March 1999
Identification and characterization of new DNA replication terminators in Bacillus subtilis journal July 1995
Prediction of DtxR regulon: Identification of binding sites and operons controlled by Diphtheria toxin repressor in Corynebacterium diphtheriae journal January 2004
Selection for Unequal Densities of σ70 Promoter-Like Signals in Different Regions of Large Bacterial Genomes journal January 2006
The Initiator Element: A Paradigm for Core Promoter Heterogeneity within Metazoan Protein-coding Genes journal January 1998
3DNA: a versatile, integrated software system for the analysis, rebuilding and visualization of three-dimensional nucleic-acid structures journal July 2008
Structural Basis of Core Promoter Recognition in a Primitive Eukaryote journal November 2003
New restriction endonucleases from Flavobacterium okeanokoites (FokI) and Micrococcus luteus (MluI) journal December 1981
Protein-DNA binding specificity predictions with structural models journal October 2005
TFinDit: transcription factor-DNA interaction data depository journal January 2012
PRODORIC: prokaryotic database of gene regulation journal January 2003
PDB2PQR: an automated pipeline for the setup of Poisson-Boltzmann electrostatics calculations journal July 2004
The hierarchy of KorB binding at its 12 binding sites on the broad-host-range plasmid RK2 and modulation of this binding by IncC1 protein 1 1Edited by J. Karn journal January 2000
Robust and rigorous identification of tissue-specific genes by statistically extending tau score journal December 2022
Improved predictions of transcription factor binding sites using physicochemical features of DNA journal August 2012
PDB2PQR: expanding and upgrading automated preparation of biomolecular structures for molecular simulations journal May 2007
Energetics of protein–DNA interactions journal January 2007
REDfly v3.0: toward a comprehensive database of transcriptional regulatory elements in Drosophila journal October 2010
Direct inference of protein-DNA interactions using compressed sensing methods journal August 2011
RegTransBase--a database of regulatory sequences and interactions in a wide range of prokaryotic genomes journal January 2007
Structural Classification of HTH DNA-binding Domains and Protein – DNA Interaction Modes journal September 1996
Variable structure motifs for transcription factor binding sites journal January 2010
Identification of DNA-binding protein target sequences by physical effective energy functions, free energy analysis of lambda repressor-DNA complexes journal January 2007
Three enhancements to the inference of statistical protein-DNA potentials journal November 2012
HTPSELEX--a database of high-throughput SELEX libraries for transcription factor binding sites journal January 2006
Visualization and Labeling of Point Clouds in Virtual Reality text January 2018
3D-footprint: a database for the structural analysis of protein–DNA complexes journal September 2009
Large-Scale Discovery of Promoter Motifs in Drosophila melanogaster journal January 2007
Differential recognition of OR1 and OR3 by bacteriophage 434 repressor and Cro. journal November 1993

Cited By (2)