skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Specific modelling of regulatory units in DNA sequences

Conference ·
OSTI ID:549249
;  [1]
  1. GSF - National Research Center for Environment and Health, Neuherberg (Germany)

Transcriptional control regions are usually composed of a complex arrangement of individual transcriptional elements like protein binding sites. This modular structure allows generation of enormous functional diversity of regulatory regions with a limited set of individual elements. We implemented simple formal representations of these general features of regulatory regions into an algorithm capable of developing complex models reflecting both the element composition and the functional organization of individual elements. Our method (ModelGenerator) requires a training set of at least 10 sequences containing the regulatory regions to be modelled and a very simple initial model which may consist of just two characteristic transcription factor binding sites. We show the capability of our algorithm to expand the initial model solely by comparative sequence analysis leading to complex, biologically meaningful models. A second program (ModelInspector) is capable to scan new sequence data for matches to models defined by ModelGenerator. We show two models for retroviral transcriptional control regions to be highly specific. A search against GenBank using one of the models is shown to be free of false negatives and to produce less than 2 false positives/million nucleotides. Thus, our algorithms appear to be useful tools for the analysis of extremely long genomic sequences which are now becoming available as results of various genome sequencing projects. 16 refs., 3 figs., 2 tabs.

OSTI ID:
549249
Report Number(s):
CONF-970132-; CNN: Grant BI04-CT95-0226; TRN: 97:005592-0019
Resource Relation:
Conference: Pacific symposium on biocomputing `97, Kapalua, HI (United States), 6-9 Jan 1997; Other Information: PBD: 1996; Related Information: Is Part Of Pacific symposium on biocomputing `97: Proceedings; Altman, R.B. [ed.] [Stanford Univ., CA (United States). Section on Medical Informatics]; Dunker, A.K. [ed.] [Washington State Univ., Pullman, WA (United States). Dept. of Biochemistry and Biophysics]; Hunter, L. [ed.] [National Insts. of Health, Bethesda, MD (United States). National Library of Medicine]; Klein, T.E. [ed.] [California Univ., San Francisco, CA (United States). Dept. of Pharmaceutical Chemistry]; PB: 508 p.
Country of Publication:
United States
Language:
English