Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

A Maximum Entropy Formalism for Disentangling Chains of Correlated Sequence Positions

Technical Report ·
DOI:https://doi.org/10.2172/763147· OSTI ID:763147
Covariation analysis of sets of aligned sequences of protein molecules is successful in certain instances in elucidating certain structural and functional links, but in general, pairs of sites displaying highly covarying mutations in protein sequences do not necessarily correspond to sites that are spatially close in the protein structure. In contrast, covariation analysis of sets of aligned sequences for RNA molecules is relatively successful in elucidating RNA secondary structure, as well as some aspects of tertiary structure. The goals of this paper are to (1) present the problem, (2) develop the mathematical formalism for solving the problem, and (3) validate the resulting algorithms on simulated data. Extensive application to biological sequences will be presented elsewhere.
Research Organization:
Los Alamos National Lab., NM (US)
Sponsoring Organization:
USDOE Office of Energy Research (ER) (US)
DOE Contract Number:
W-7405-ENG-36
OSTI ID:
763147
Report Number(s):
LA-UR-98-1094
Country of Publication:
United States
Language:
English

Similar Records

Correlated mutations in protein sequences: Phylogenetic and structural effects
Technical Report · Mon Nov 30 23:00:00 EST 1998 · OSTI ID:296863

Comparative analysis of ribonuclease P RNA using gene sequences from natural microbial populations reveals tertiary structural elements
Journal Article · Mon Apr 01 23:00:00 EST 1996 · Proceedings of the National Academy of Sciences of the United States of America · OSTI ID:258596

Maximum entropy weighting of aligned sequences of proteins or DNA
Technical Report · Sat Dec 30 23:00:00 EST 1995 · OSTI ID:401848