Atypical regions in large genomic DNA sequences
- Lawrence Berkeley Lab., CA (United States)
- Univ. of California, Berkeley, CA (United States)
Large genomic DNA sequences contain regions with distinctive patterns of sequence organization. The authors describe a method using logarithms of probabilities based on seventh-order Markov chains to rapidly identify genomic sequences that do not resemble models of genome organization built from compilations of octanucleotide usage. Data bases have been constructed from Escherichia coli and Saccharomyces cerevisiae DNA sequences of >1000 nt and human sequences of >10,000 nt. Atypical genes and clusters of genes have been located in bacteriophage, yeast, and primate DNA sequences. The authors consider criteria for statistical significance of the results, offer possible explanations for the observed variation in genome organization, and give additional applications of these methods in DNA sequence analysis.
- Sponsoring Organization:
- USDOE
- DOE Contract Number:
- AC03-76SF00098
- OSTI ID:
- 86536
- Journal Information:
- Proceedings of the National Academy of Sciences of the United States of America, Vol. 91, Issue 15; Other Information: PBD: 19 Jul 1994
- Country of Publication:
- United States
- Language:
- English
Similar Records
DNA sequence of a mutation in the leader region of the yeast iso-1-cytochrome c mRNA
Contamination of cDNA libraries and expressed sequence-tags databases