Identification of genes in anonymous DNA sequences. Annual performance report, February 1, 1991--January 31, 1992
The objective of this project is the development of practical software to automate the identification of genes in anonymous DNA sequences from the human, and other higher eukaryotic genomes. A software system for automated sequence analysis, gm (gene modeler) has been designed, implemented, tested, and distributed to several dozen laboratories worldwide. A significantly faster, more robust, and more flexible version of this software, gm 2.0 has now been completed, and is being tested by operational use to analyze human cosmid sequence data. A range of efforts to further understand the features of eukaryoyic gene sequences are also underway. This progress report also contains papers coming out of the project including the following: gm: a Tool for Exploratory Analysis of DNA Sequence Data; The Human THE-LTR(O) and MstII Interspersed Repeats are subfamilies of a single widely distruted highly variable repeat family; Information contents and dinucleotide compostions of plant intron sequences vary with evolutionary origin; Splicing signals in Drosophila: intron size, information content, and consensus sequences; Integration of automated sequence analysis into mapping and sequencing projects; Software for the C. elegans genome project.
- Research Organization:
- New Mexico State Univ., Las Cruces, NM (United States). Computing Research Lab.
- Sponsoring Organization:
- USDOE, Washington, DC (United States)
- DOE Contract Number:
- FG04-89ER60865
- OSTI ID:
- 243485
- Report Number(s):
- DOE/ER/60865-3; ON: DE96011538; TRN: 96:003547
- Resource Relation:
- Other Information: PBD: [1996]
- Country of Publication:
- United States
- Language:
- English
Similar Records
Identification of genes in anonymous DNA sequences. Final report: Report period, 15 April 1993--15 April 1994
The human myelin oligodendrocyte glycoprotein (MOG) gene: Complete nucleotide sequence and structural characterization