A neural network system for prediction of RNA polymerase II promoters
One of the most difficult problems in the analysis of eucaryotic genes is the detection of RNA polymerase II promoter regions. Although promoter regions vary in the primary DNA sequence, a basic group of core promoter elements has been suggested in the literature. Many human promoter sequences contain a TATAA sequence element at approximately 30 bases upstream of the cap site (transcription start site). Other elements are the GC box which binds SPA and upregulates transcription, the CAAT box, and the ATG initiator codon. To characterize promoters, we constructed frequency matrices for each element using experimentally mapped human promoter regions. Additionally, we constructed histograms for the distances separating the various elements. We then used a neural network to combine these informational elements. The output of the neural network is then processed using a set of expert rules which depend on GRAIL`s ability to find exons in anonymous DNA. This improves the selectivity of promoter detection and reduces the false positive rate.
- Research Organization:
- Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States)
- Sponsoring Organization:
- USDOE, Washington, DC (United States)
- DOE Contract Number:
- AC05-84OR21400
- OSTI ID:
- 172082
- Report Number(s):
- CONF-9404262-2; ON: DE96003776; TRN: 96:000957
- Resource Relation:
- Conference: 1. world congress on computational medicine, public health and biotechnology, Austin, TX (United States), 24-28 Apr 1994; Other Information: PBD: 1994
- Country of Publication:
- United States
- Language:
- English
Similar Records
Structure of the gene for the catalytic subunit of human DNA polymerase {delta} (POLD1)
Structure and organization of the human neuronatin gene