Automated insertion of sequences into a ribosomal RNA alignment: An application of computational linguistics in molecular biology
- Case Western Reserve Univ., Cleveland, OH (United States)
This thesis involved the construction of (1) a grammar that incorporates knowledge on base invariancy and secondary structure in a molecule and (2) a parser engine that uses the grammar to position bases into the structural subunits of the molecule. These concepts were combined with a novel pinning technique to form a tool that semi-automates insertion of a new species into the alignment for the 16S rRNA molecule (a component of the ribosome) maintained by Dr. Carl Woese`s group at the University of Illinois at Urbana. The tool was tested on species extracted from the alignment and on a group of entirely new species. The results were very encouraging, and the tool should be substantial aid to the curators of the 16S alignment. The construction of the grammar was itself automated, allowing application of the tool to alignments for other molecules. The logic programming language Prolog was used to construct all programs involved. The computational linguistics approach used here was found to be a useful way to attach the problem of insertion into an alignment.
- Research Organization:
- Argonne National Lab. (ANL), Argonne, IL (United States)
- Sponsoring Organization:
- USDOE Office of Science (SC)
- DOE Contract Number:
- W-31109-ENG-38
- OSTI ID:
- 10108317
- Report Number(s):
- ANL-91/29; ON: DE92004954
- Resource Relation:
- Other Information: DN: Thesis submitted to Case Western Reserve University, Cleveland, OH; TH: Thesis (M.S.); PBD: Nov 1991
- Country of Publication:
- United States
- Language:
- English
Similar Records
An automated procedure for covariation-based detection of RNA structure
Evolution of protein-coupled RNA dynamics during hierarchical assembly of ribosomal complexes