An affinity-structure database of helix-turn-helix: DNA complexes with a universal coordinate system
Abstract
Molecular interactions between proteins and DNA molecules underlie many cellular processes, including transcriptional regulation, chromosome replication, and nucleosome positioning. Computational analyses of protein-DNA interactions rely on experimental data characterizing known protein-DNA interactions structurally and biochemically. While many databases exist that contain either structural or biochemical data, few integrate these two data sources in a unified fashion. Such integration is becoming increasingly critical with the rapid growth of structural and biochemical data, and the emergence of algorithms that rely on the synthesis of multiple data types to derive computational models of molecular interactions. We have developed an integrated affinity-structure database in which the experimental and quantitative DNA binding affinities of helix-turn-helix proteins are mapped onto the crystal structures of the corresponding protein-DNA complexes. This database provides access to: (i) protein-DNA structures, (ii) quantitative summaries of protein-DNA binding affinities using position weight matrices, and (iii) raw experimental data of protein-DNA binding instances. Critically, this database establishes a correspondence between experimental structural data and quantitative binding affinity data at the single basepair level. Furthermore, we present a novel alignment algorithm that structurally aligns the protein-DNA complexes in the database and creates a unified residue-level coordinate system for comparing the physico-chemical environments at themore »
- Authors:
-
- Harvard Medical School, Boston, MA (United States)
- Publication Date:
- Research Org.:
- Stanford Univ., CA (United States)
- Sponsoring Org.:
- USDOE Office of Science (SC)
- OSTI Identifier:
- 1241035
- Grant/Contract Number:
- FG02-05ER64136
- Resource Type:
- Accepted Manuscript
- Journal Name:
- BMC Bioinformatics
- Additional Journal Information:
- Journal Volume: 16; Journal Issue: 1; Journal ID: ISSN 1471-2105
- Publisher:
- BioMed Central
- Country of Publication:
- United States
- Language:
- English
- Subject:
- 59 BASIC BIOLOGICAL SCIENCES; protein-DNA; database; helix-turn-helix; transcription factors; structure; PWM
Citation Formats
AlQuraishi, Mohammed, Tang, Shengdong, and Xia, Xide. An affinity-structure database of helix-turn-helix: DNA complexes with a universal coordinate system. United States: N. p., 2015.
Web. doi:10.1186/s12859-015-0819-2.
AlQuraishi, Mohammed, Tang, Shengdong, & Xia, Xide. An affinity-structure database of helix-turn-helix: DNA complexes with a universal coordinate system. United States. doi:https://doi.org/10.1186/s12859-015-0819-2
AlQuraishi, Mohammed, Tang, Shengdong, and Xia, Xide. Thu .
"An affinity-structure database of helix-turn-helix: DNA complexes with a universal coordinate system". United States. doi:https://doi.org/10.1186/s12859-015-0819-2. https://www.osti.gov/servlets/purl/1241035.
@article{osti_1241035,
title = {An affinity-structure database of helix-turn-helix: DNA complexes with a universal coordinate system},
author = {AlQuraishi, Mohammed and Tang, Shengdong and Xia, Xide},
abstractNote = {Molecular interactions between proteins and DNA molecules underlie many cellular processes, including transcriptional regulation, chromosome replication, and nucleosome positioning. Computational analyses of protein-DNA interactions rely on experimental data characterizing known protein-DNA interactions structurally and biochemically. While many databases exist that contain either structural or biochemical data, few integrate these two data sources in a unified fashion. Such integration is becoming increasingly critical with the rapid growth of structural and biochemical data, and the emergence of algorithms that rely on the synthesis of multiple data types to derive computational models of molecular interactions. We have developed an integrated affinity-structure database in which the experimental and quantitative DNA binding affinities of helix-turn-helix proteins are mapped onto the crystal structures of the corresponding protein-DNA complexes. This database provides access to: (i) protein-DNA structures, (ii) quantitative summaries of protein-DNA binding affinities using position weight matrices, and (iii) raw experimental data of protein-DNA binding instances. Critically, this database establishes a correspondence between experimental structural data and quantitative binding affinity data at the single basepair level. Furthermore, we present a novel alignment algorithm that structurally aligns the protein-DNA complexes in the database and creates a unified residue-level coordinate system for comparing the physico-chemical environments at the interface between complexes. Using this unified coordinate system, we compute the statistics of atomic interactions at the protein-DNA interface of helix-turn-helix proteins. We provide an interactive website for visualization, querying, and analyzing this database, and a downloadable version to facilitate programmatic analysis. Lastly, this database will facilitate the analysis of protein-DNA interactions and the development of programmatic computational methods that capitalize on integration of structural and biochemical datasets. The database can be accessed at http://ProteinDNA.hms.harvard.edu.},
doi = {10.1186/s12859-015-0819-2},
journal = {BMC Bioinformatics},
number = 1,
volume = 16,
place = {United States},
year = {2015},
month = {11}
}
Works referenced in this record:
Identification of transcription factor binding sites with variable-order Bayesian networks
journal, March 2005
- Ben-Gal, I.; Shani, A.; Gohr, A.
- Bioinformatics, Vol. 21, Issue 11
Inclusion of neighboring base interdependencies substantially improves genome-wide prokaryotic transcription factor binding site prediction
journal, May 2010
- Salama, R. A.; Stekel, D. J.
- Nucleic Acids Research, Vol. 38, Issue 12
A Feature-Based Approach to Modeling Protein–DNA Interactions
journal, August 2008
- Sharon, Eilon; Lubliner, Shai; Segal, Eran
- PLoS Computational Biology, Vol. 4, Issue 8
Variable structure motifs for transcription factor binding sites
journal, January 2010
- Reid, John E.; Evans, Kenneth J.; Dyer, Nigel
- BMC Genomics, Vol. 11, Issue 1
Maximally Efficient Modeling of DNA Sequence Motifs at All Levels of Complexity
journal, February 2011
- Stormo, Gary D.
- Genetics, Vol. 187, Issue 4
Improved predictions of transcription factor binding sites using physicochemical features of DNA
journal, August 2012
- Maienschein-Cline, Mark; Dinner, Aaron R.; Hlavacek, William S.
- Nucleic Acids Research, Vol. 40, Issue 22
DNAase footprinting a simple method for the detection of protein-DNA binding specificity
journal, January 1978
- Galas, David J.; Schmitz, Albert
- Nucleic Acids Research, Vol. 5, Issue 9
REDfly v3.0: toward a comprehensive database of transcriptional regulatory elements in Drosophila
journal, October 2010
- Gallo, S. M.; Gerrard, D. T.; Miner, D.
- Nucleic Acids Research, Vol. 39, Issue Database
HTPSELEX--a database of high-throughput SELEX libraries for transcription factor binding sites
journal, January 2006
- Jagannathan, V.
- Nucleic Acids Research, Vol. 34, Issue 90001
ChIPBase: a database for decoding the transcriptional regulation of long non-coding RNA and microRNA genes from ChIP-Seq data
journal, November 2012
- Yang, Jian-Hua; Li, Jun-Hao; Jiang, Shan
- Nucleic Acids Research, Vol. 41, Issue D1
Genome-Wide Mapping of in Vivo Protein-DNA Interactions
journal, June 2007
- Johnson, D. S.; Mortazavi, A.; Myers, R. M.
- Science, Vol. 316, Issue 5830
UniPROBE: an online database of protein binding microarray data on protein-DNA interactions
journal, January 2009
- Newburger, D. E.; Bulyk, M. L.
- Nucleic Acids Research, Vol. 37, Issue Database
Prediction of TF target sites based on atomistic models of protein-DNA complexes
journal, October 2008
- Angarica, Vladimir Espinosa; Pérez, Abel González; Vasconcelos, Ana T.
- BMC Bioinformatics, Vol. 9, Issue 1
Energetics of protein–DNA interactions
journal, January 2007
- Donald, Jason E.; Chen, William W.; Shakhnovich, Eugene I.
- Nucleic Acids Research, Vol. 35, Issue 4
Identification of DNA-binding protein target sequences by physical effective energy functions, free energy analysis of lambda repressor-DNA complexes
journal, January 2007
- Moroni, Elisabetta; Caselle, Michele; Fogolari, Federico
- BMC Structural Biology, Vol. 7, Issue 1
Protein-DNA binding specificity predictions with structural models
journal, October 2005
- Morozov, A. V.
- Nucleic Acids Research, Vol. 33, Issue 18
Atomistic modeling of protein–DNA interaction specificity: progress and applications
journal, August 2012
- Liu, Limin Angela; Bradley, Philip
- Current Opinion in Structural Biology, Vol. 22, Issue 4
TRANSFAC(R) and its module TRANSCompel(R): transcriptional gene regulation in eukaryotes
journal, January 2006
- Matys, V.
- Nucleic Acids Research, Vol. 34, Issue 90001
JASPAR 2010: the greatly expanded open-access database of transcription factor binding profiles
journal, November 2009
- Portales-Casamar, Elodie; Thongjuea, Supat; Kwon, Andrew T.
- Nucleic Acids Research, Vol. 38, Issue suppl_1
ProTherm and ProNIT: thermodynamic databases for proteins and protein-nucleic acid interactions
journal, January 2006
- Kumar, M. D. S.
- Nucleic Acids Research, Vol. 34, Issue 90001
NPIDB: nucleic acid—protein interaction database
journal, November 2012
- Kirsanov, Dmitry D.; Zanegina, Olga N.; Aksianov, Evgeniy A.
- Nucleic Acids Research, Vol. 41, Issue D1
BIPA: a database for protein–nucleic acid interaction in 3D structures
journal, April 2009
- Lee, Semin; Blundell, Tom L.
- Bioinformatics, Vol. 25, Issue 12
TFinDit: transcription factor-DNA interaction data depository
journal, January 2012
- Turner, Daniel; Kim, RyangGuk; Guo, Jun-tao
- BMC Bioinformatics, Vol. 13, Issue 1
3D-footprint: a database for the structural analysis of protein–DNA complexes
journal, September 2009
- Contreras-Moreira, Bruno
- Nucleic Acids Research, Vol. 38, Issue suppl_1
Direct inference of protein-DNA interactions using compressed sensing methods
journal, August 2011
- AlQuraishi, M.; McAdams, H. H.
- Proceedings of the National Academy of Sciences, Vol. 108, Issue 36
Three enhancements to the inference of statistical protein-DNA potentials
journal, November 2012
- AlQuraishi, Mohammed; McAdams, Harley H.
- Proteins: Structure, Function, and Bioinformatics, Vol. 81, Issue 3
The many faces of the helix-turn-helix domain: Transcription regulation and beyond
journal, April 2005
- Aravind, L.; Anantharaman, Vivek; Balaji, Santhanam
- FEMS Microbiology Reviews, Vol. 29, Issue 2
A census of human transcription factors: function, expression and evolution
journal, April 2009
- Vaquerizas, Juan M.; Kummerfeld, Sarah K.; Teichmann, Sarah A.
- Nature Reviews Genetics, Vol. 10, Issue 4
3DNA: a versatile, integrated software system for the analysis, rebuilding and visualization of three-dimensional nucleic-acid structures
journal, July 2008
- Lu, Xiang-Jun; Olson, Wilma K.
- Nature Protocols, Vol. 3, Issue 7
Winged helix proteins
journal, February 2000
- Gajiwala, Ketan S.; Burley, Stephen K.
- Current Opinion in Structural Biology, Vol. 10, Issue 1
PDB2PQR: an automated pipeline for the setup of Poisson-Boltzmann electrostatics calculations
journal, July 2004
- Dolinsky, T. J.; Nielsen, J. E.; McCammon, J. A.
- Nucleic Acids Research, Vol. 32, Issue Web Server, p. W665-W667
PDB2PQR: expanding and upgrading automated preparation of biomolecular structures for molecular simulations
journal, May 2007
- Dolinsky, T. J.; Czodrowski, P.; Li, H.
- Nucleic Acids Research, Vol. 35, Issue Web Server
RegTransBase--a database of regulatory sequences and interactions in a wide range of prokaryotic genomes
journal, January 2007
- Kazakov, A. E.; Cipriano, M. J.; Novichkov, P. S.
- Nucleic Acids Research, Vol. 35, Issue Database
PRODORIC: prokaryotic database of gene regulation
journal, January 2003
- Munch, R.
- Nucleic Acids Research, Vol. 31, Issue 1
DBTBS: a database of transcriptional regulation in Bacillus subtilis containing upstream intergenic conservation information
journal, October 2007
- Sierro, Nicolas; Makita, Yuko; de Hoon, Michiel
- Nucleic Acids Research, Vol. 36, Issue suppl_1
Large-Scale Discovery of Promoter Motifs in Drosophila melanogaster
journal, January 2007
- Down, Thomas A.; Bergman, Casey M.; Su, Jing
- PLoS Computational Biology, Vol. 3, Issue 1
AGRIS and AtRegNet. A Platform to Link cis-Regulatory Elements and Transcription Factors into Regulatory Networks
journal, March 2006
- Palaniswamy, Saranyan K.; James, Stephen; Sun, Hao
- Plant Physiology, Vol. 140, Issue 3
AthaMap, integrating transcriptional and post-transcriptional data
journal, January 2009
- Bulow, L.; Engelmann, S.; Schindler, M.
- Nucleic Acids Research, Vol. 37, Issue Database
Identification and characterization of new DNA replication terminators in Bacillus subtilis
journal, July 1995
- Franks, A. H.; Griffiths, A. A.; Wake, R. G.
- Molecular Microbiology, Vol. 17, Issue 1
Search for additional replication terminators in the Bacillus subtilis 168 chromosome.
journal, May 1997
- Griffiths, A. A.; Wake, R. G.
- Journal of Bacteriology, Vol. 179, Issue 10
Contacts between gamma delta resolvase and the gamma delta res site.
journal, March 1987
- Falvey, E.; Grindley, N. D.
- The EMBO Journal, Vol. 6, Issue 3
Alignment of recombination sites in Hin-mediated site-specific DNA recombination.
journal, September 1991
- Moskowitz, I. P.; Heichman, K. A.; Johnson, R. C.
- Genes & Development, Vol. 5, Issue 9
CENP-B box and pJα sequence distribution in human alpha satellite higher-order repeats (HOR)
journal, November 2006
- Rosandić, Marija; Paar, Vladimir; Basar, Ivan
- Chromosome Research, Vol. 14, Issue 7
HNF1, a homeoprotein member of the hepatic transcription regulatory network
journal, September 1992
- Tronche, Fran�lois; Yaniv, Moshe
- BioEssays, Vol. 14, Issue 9
Analysis of a Ubiquitous Promoter Element in a Primitive Eukaryote: Early Evolution of the Initiator Element
journal, March 1999
- Liston, David R.; Johnson, Patricia J.
- Molecular and Cellular Biology, Vol. 19, Issue 3
AbdB-like Hox proteins stabilize DNA binding by the Meis1 homeodomain proteins.
journal, November 1997
- Shen, W. F.; Montgomery, J. C.; Rozenfeld, S.
- Molecular and Cellular Biology, Vol. 17, Issue 11
The hierarchy of KorB binding at its 12 binding sites on the broad-host-range plasmid RK2 and modulation of this binding by IncC1 protein 1 1Edited by J. Karn
journal, January 2000
- Kostelidou, Kalliopi; Thomas, Christopher M.
- Journal of Molecular Biology, Vol. 295, Issue 3
On the Transcriptional Regulation of Methicillin Resistance: MecI REPRESSOR IN COMPLEX WITH ITS OPERATOR
journal, February 2004
- García-Castellanos, Raquel; Mallorquí-Fernández, Goretti; Marrero, Aniebrys
- Journal of Biological Chemistry, Vol. 279, Issue 17
DNA binding activities of the Caenorhabditis elegans Tc3 transposase
journal, January 1994
- Colloms, Sean D.; van Luenen, Henri G. A. M.; PIasterk, Ronald H. A.
- Nucleic Acids Research, Vol. 22, Issue 25
Computational prediction and experimental verification of novel IdeR binding sites in the upstream sequences of Mycobacterium tuberculosis open reading frames
journal, March 2005
- Prakash, P.; Yellaboina, S.; Ranjan, A.
- Bioinformatics, Vol. 21, Issue 10
High resolution crystal structure of a paired (Pax) class cooperative homeodomain dimer on DNA
journal, September 1995
- Wilson, David S.; Guenther, Brian; Desplan, Claude
- Cell, Vol. 82, Issue 5
Sequence-specific interaction of the Salmonella Hin recombinase in both major and minor grooves of DNA.
journal, July 1992
- Hughes, K. T.; Gaines, P. C.; Karlinsey, J. E.
- The EMBO Journal, Vol. 11, Issue 7
Divergent homeo box proteins recognize similar DNA sequences in Drosophila
journal, April 1988
- Hoey, Timothy; Levine, Michael
- Nature, Vol. 332, Issue 6167
The quorum-sensing transcription factor TraR decodes its DNA binding site by direct contacts with DNA bases and by detection of DNA flexibility: TraR decodes its DNA binding site
journal, March 2007
- White, Catharine E.; Winans, Stephen C.
- Molecular Microbiology, Vol. 64, Issue 1
Transcriptional regulatory code of a eukaryotic genome
journal, September 2004
- Harbison, Christopher T.; Gordon, D. Benjamin; Lee, Tong Ihn
- Nature, Vol. 431, Issue 7004
Indirect readout of DNA sequence at the primary-kink site in the CAP-DNA complex: alteration of DNA binding specificity through alteration of DNA kinking
journal, November 2001
- Chen, Shengfeng; Gunasekera, Angelo; Zhang, Xiaoping
- Journal of Molecular Biology, Vol. 314, Issue 1
Effect of non-contacted bases on the affinity of 434 operator for 434 repressor and Cro
journal, April 1987
- Koudelka, Gerald B.; Harrison, Stephen C.; Ptashne, Mark
- Nature, Vol. 326, Issue 6116
Structural Basis of Core Promoter Recognition in a Primitive Eukaryote
journal, November 2003
- Schumacher, Maria A.; Lau, Audrey O. T.; Johnson, Patricia J.
- Cell, Vol. 115, Issue 4
The Initiator Element: A Paradigm for Core Promoter Heterogeneity within Metazoan Protein-coding Genes
journal, January 1998
- Smale, S. T.; Jain, A.; Kaufmann, J.
- Cold Spring Harbor Symposia on Quantitative Biology, Vol. 63, Issue 0
Generality of a functional initiator consensus sequence
journal, December 1996
- Lo, Kiersten; Smale, Stephen T.
- Gene, Vol. 182, Issue 1-2
DNA sequence requirements for transcriptional initiator activity in mammalian cells.
journal, January 1994
- Javahery, R.; Khachi, A.; Lo, K.
- Molecular and Cellular Biology, Vol. 14, Issue 1
Selection for Unequal Densities of σ70 Promoter-Like Signals in Different Regions of Large Bacterial Genomes
journal, January 2006
- Huerta, Araceli M.; Francino, M. Pilar; Morett, Enrique
- PLoS Genetics, Vol. 2, Issue 11
The complex between phage 434 repressor DNA-binding domain and operator site OR3: structural differences between consensus and non-consensus half-sites
journal, December 1993
- Rodgers, David W.; Harrison, Stephen C.
- Structure, Vol. 1, Issue 4
Target site choice of the related transposable elements Tc1 and Tc3 of Caenorhabditis elegans
journal, January 1994
- van Luenen, Henri G. A. M.; Plasterk, Ronald H. A.
- Nucleic Acids Research, Vol. 22, Issue 3
Structural Classification of HTH DNA-binding Domains and Protein – DNA Interaction Modes
journal, September 1996
- Wintjens, René; Rooman, Marianne
- Journal of Molecular Biology, Vol. 262, Issue 2
Binding geometry of α-helices that recognize DNA
journal, December 1995
- Suzuki, Masashi; Gerstein, Mark
- Proteins: Structure, Function, and Genetics, Vol. 23, Issue 4
Geometric analysis and comparison of protein-DNA interfaces: why is there no simple code for recognition? 1 1Edited by R. Ebright
journal, August 2000
- Pabo, Carl O.; Nekludova, Lena
- Journal of Molecular Biology, Vol. 301, Issue 3
A method for registration of 3-D shapes
journal, February 1992
- Besl, P. J.; McKay, Neil D.
- IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 14, Issue 2
Clustering by Passing Messages Between Data Points
journal, February 2007
- Frey, B. J.; Dueck, D.
- Science, Vol. 315, Issue 5814