skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Automated insertion of sequences into a ribosomal RNA alignment: An application of computational linguistics in molecular biology

Thesis/Dissertation ·
DOI:https://doi.org/10.2172/10108317· OSTI ID:10108317
 [1]
  1. Case Western Reserve Univ., Cleveland, OH (United States)

This thesis involved the construction of (1) a grammar that incorporates knowledge on base invariancy and secondary structure in a molecule and (2) a parser engine that uses the grammar to position bases into the structural subunits of the molecule. These concepts were combined with a novel pinning technique to form a tool that semi-automates insertion of a new species into the alignment for the 16S rRNA molecule (a component of the ribosome) maintained by Dr. Carl Woese`s group at the University of Illinois at Urbana. The tool was tested on species extracted from the alignment and on a group of entirely new species. The results were very encouraging, and the tool should be substantial aid to the curators of the 16S alignment. The construction of the grammar was itself automated, allowing application of the tool to alignments for other molecules. The logic programming language Prolog was used to construct all programs involved. The computational linguistics approach used here was found to be a useful way to attach the problem of insertion into an alignment.

Research Organization:
Argonne National Lab. (ANL), Argonne, IL (United States)
Sponsoring Organization:
USDOE Office of Science (SC)
DOE Contract Number:
W-31109-ENG-38
OSTI ID:
10108317
Report Number(s):
ANL-91/29; ON: DE92004954
Resource Relation:
Other Information: DN: Thesis submitted to Case Western Reserve University, Cleveland, OH; TH: Thesis (M.S.); PBD: Nov 1991
Country of Publication:
United States
Language:
English