DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Automated side-chain model building and sequence assignment by template matching

Abstract

An algorithm is described for automated building of side chains in an electron-density map once a main-chain model is built and for alignment of the protein sequence to the map. The procedure is based on a comparison of electron density at the expected side-chain positions with electron-density templates. The templates are constructed from average amino-acid side-chain densities in 574 refined protein structures. For each contiguous segment of main chain, a matrix with entries corresponding to an estimate of the probability that each of the 20 amino acids is located at each position of the main-chain model is obtained. The probability that this segment corresponds to each possible alignment with the sequence of the protein is estimated using a Bayesian approach and high-confidence matches are kept. Once side-chain identities are determined, the most probable rotamer for each side chain is built into the model. The automated procedure has been implemented in the RESOLVE software. Combined with automated main-chain model building, the procedure produces a preliminary model suitable for refinement and extension by an experienced crystallographer.

Authors:
ORCiD logo [1]
  1. Los Alamos National Lab. (LANL), Los Alamos, NM (United States)
Publication Date:
Research Org.:
Los Alamos National Lab. (LANL), Los Alamos, NM (United States)
Sponsoring Org.:
USDOE
OSTI Identifier:
810424
Report Number(s):
LA-UR-02-3903
Journal ID: ISSN 0907-4449; ABCRE6
Grant/Contract Number:  
W-7405-ENG-36
Resource Type:
Accepted Manuscript
Journal Name:
Acta Crystallographica. Section D: Biological Crystallography
Additional Journal Information:
Journal Volume: 59; Journal Issue: 1; Journal ID: ISSN 0907-4449
Publisher:
International Union of Crystallography
Country of Publication:
United States
Language:
English
Subject:
59 BASIC BIOLOGICAL SCIENCES

Citation Formats

Terwilliger, Thomas C. Automated side-chain model building and sequence assignment by template matching. United States: N. p., 2002. Web. doi:10.1107/S0907444902018048.
Terwilliger, Thomas C. Automated side-chain model building and sequence assignment by template matching. United States. https://doi.org/10.1107/S0907444902018048
Terwilliger, Thomas C. Fri . "Automated side-chain model building and sequence assignment by template matching". United States. https://doi.org/10.1107/S0907444902018048. https://www.osti.gov/servlets/purl/810424.
@article{osti_810424,
title = {Automated side-chain model building and sequence assignment by template matching},
author = {Terwilliger, Thomas C.},
abstractNote = {An algorithm is described for automated building of side chains in an electron-density map once a main-chain model is built and for alignment of the protein sequence to the map. The procedure is based on a comparison of electron density at the expected side-chain positions with electron-density templates. The templates are constructed from average amino-acid side-chain densities in 574 refined protein structures. For each contiguous segment of main chain, a matrix with entries corresponding to an estimate of the probability that each of the 20 amino acids is located at each position of the main-chain model is obtained. The probability that this segment corresponds to each possible alignment with the sequence of the protein is estimated using a Bayesian approach and high-confidence matches are kept. Once side-chain identities are determined, the most probable rotamer for each side chain is built into the model. The automated procedure has been implemented in the RESOLVE software. Combined with automated main-chain model building, the procedure produces a preliminary model suitable for refinement and extension by an experienced crystallographer.},
doi = {10.1107/S0907444902018048},
journal = {Acta Crystallographica. Section D: Biological Crystallography},
number = 1,
volume = 59,
place = {United States},
year = {Fri Dec 20 00:00:00 EST 2002},
month = {Fri Dec 20 00:00:00 EST 2002}
}

Journal Article:
Free Publicly Available Full Text
Publisher's Version of Record

Citation Metrics:
Cited by: 106 works
Citation information provided by
Web of Science

Figures / Tables:

Figure 1 Figure 1: Fraction of correct amino-acid side-chain assignments as a function of the probability estimated from (1). For each residue in the main-chain models for the eight structures listed in Table 1, the relative probabilities for each of the possible side chains were obtained using (1). The correct side chainsmore » were identified as the nearest amino acid in the refined model of each structure. The fraction of correct amino-acid side-chain assignments is tabulated as a function of the probability estimates.« less

Save / Share:

Works referencing / citing this record:

Functional significance of four successive glycine residues in the pyrophosphate binding loop of fungal 6-oxopurine phosphoribosyltransferases
journal, June 2012

  • Moynié, Lucile; Giraud, Marie-France; Breton, Annick
  • Protein Science, Vol. 21, Issue 8
  • DOI: 10.1002/pro.2098

Crystal structure of VC1805, a conserved hypothetical protein from a Vibrio cholerae pathogenicity island, reveals homology to human p32: Crystal Structure of VC1805
journal, February 2008

  • Sheikh, Md. Arif; Potter, Jane A.; Johnson, Kenneth A.
  • Proteins: Structure, Function, and Bioinformatics, Vol. 71, Issue 3
  • DOI: 10.1002/prot.21993

Structural insights into dehydratase substrate selection for the borrelidin and fluvirucin polyketide synthases
journal, May 2019

  • Barajas, Jesus F.; McAndrew, Ryan P.; Thompson, Mitchell G.
  • Journal of Industrial Microbiology & Biotechnology, Vol. 46, Issue 8
  • DOI: 10.1007/s10295-019-02189-z

Crystal structures of the human adiponectin receptors
journal, April 2015

  • Tanabe, Hiroaki; Fujii, Yoshifumi; Okada-Iwabu, Miki
  • Nature, Vol. 520, Issue 7547
  • DOI: 10.1038/nature14301

The ligand-binding face of the semaphorins revealed by the high-resolution crystal structure of SEMA4D
journal, September 2003

  • Love, Christopher A.; Harlos, Karl; Mavaddat, Nasim
  • Nature Structural & Molecular Biology, Vol. 10, Issue 10
  • DOI: 10.1038/nsb977

Structural insights into hedgehog ligand sequestration by the human hedgehog-interacting protein HHIP
journal, June 2009

  • Bishop, Benjamin; Aricescu, A. Radu; Harlos, Karl
  • Nature Structural & Molecular Biology, Vol. 16, Issue 7
  • DOI: 10.1038/nsmb.1607

A fully automatic method yielding initial models from high-resolution cryo-electron microscopy maps
journal, October 2018


Structure of Foot-and-Mouth Disease Virus RNA-dependent RNA Polymerase and Its Complex with a Template-Primer RNA
journal, August 2004

  • Ferrer-Orta, Cristina; Arias, Armando; Perez-Luque, Rosa
  • Journal of Biological Chemistry, Vol. 279, Issue 45
  • DOI: 10.1074/jbc.m405465200

Structure of the cytoplasmic domain of Yersinia pestis YscD, an essential component of the type III secretion system
journal, February 2012

  • Lountos, George T.; Tropea, Joseph E.; Waugh, David S.
  • Acta Crystallographica Section D Biological Crystallography, Vol. 68, Issue 3
  • DOI: 10.1107/s0907444911054308

Sequence assignment for low-resolution modelling of protein crystal structures
journal, July 2019

  • Chojnowski, Grzegorz; Pereira, Joana; Lamzin, Victor S.
  • Acta Crystallographica Section D Structural Biology, Vol. 75, Issue 8
  • DOI: 10.1107/s2059798319009392

Functional Analysis of preA in Aeromonas veronii TH0426 Reveals a Key Role in the Regulation of Virulence and Resistance to Oxidative Stress
journal, December 2019

  • Yang, Bintong; Song, Haichao; An, Dingjie
  • International Journal of Molecular Sciences, Vol. 21, Issue 1
  • DOI: 10.3390/ijms21010098

PHENIX: a comprehensive Python-based system for macromolecular structure solution.
text, January 2010

  • Adams, Paul D.; Afonine, Pavel V.; Bunkóczi, Gábor
  • Apollo - University of Cambridge Repository
  • DOI: 10.17863/cam.45787

Figures/Tables have been extracted from DOE-funded journal article accepted manuscripts.