On the computational complexity of sequence design problems

Hart, W E

On the computational complexity of sequence design problems

Conference · Sun Nov 30 23:00:00 EST 1997

OSTI ID:549004

Hart, W E ^[1]

Sandia National Labs., Albuquerque, NM (United States)

Inverse protein folding concerns the identification of an amino acid sequence that folds to a given structure. Sequence design problems attempt to avoid the apparant difficulty of inverse protein folding by defining an energy that can be minimized to find protein-like sequences. We evaluate the practical relevance of two sequence design problems by analyzing their computational complexity. We show that the canonical method of sequence design is intractable and describe approximation algorithms for this problem. We also describe an efficient algorithm that exactly solves the grand canonical method. Our analysis shows how sequence design problems can fail to reduce the difficulty of the inverse protein folding problem and highlights the need to analyze these problems to evaluate their practical relevance. 10 refs., 8 figs.

🛈

OSTI does not have a digital full text copy available. For more information, please see document availability, search WorldCat, or search Google Scholar.

Research Organization:: Association for Computing Machinery, New York, NY (United States); Sloan (Alfred P.) Foundation, New York, NY (United States)

OSTI ID:: 549004

Report Number(s):: CONF-970137--

Country of Publication:: United States

Language:: English

Similar Records

On the computational complexity of sequence design problems

Technical Report · Mon Dec 30 23:00:00 EST 1996 · OSTI ID:425316

Native sequence determines sidechain packing in a protein, but does optimal sidechain packing determine the native sequence?

Conference · Mon Dec 30 23:00:00 EST 1996 · OSTI ID:549253

An efficient bit string implementation of a database cross-field association system (with an application to protein sequence patterns)

Conference · Sat Aug 01 00:00:00 EDT 1992 · OSTI ID:10162003

Related Subjects

55 BIOLOGY AND MEDICINE
BASIC STUDIES
99 GENERAL AND MISCELLANEOUS
ALGORITHMS
AMINO ACID SEQUENCE
COMPUTER CODES
DIFFERENTIAL EQUATIONS
DNA SEQUENCING
EFFICIENCY
ELECTRONIC STRUCTURE
ENERGY LEVELS
MATHEMATICAL MODELS
MOLECULAR BIOLOGY
PROTEIN STRUCTURE
PROTEINS
STRUCTURE-ACTIVITY RELATIONSHIPS

On the computational complexity of sequence design problems

Citation Formats

Similar Records

Related Subjects