Automated insertion of sequences into a ribosomal RNA alignment: An application of computational linguistics in molecular biology
Abstract
This thesis involved the construction of (1) a grammar that incorporates knowledge on base invariancy and secondary structure in a molecule and (2) a parser engine that uses the grammar to position bases into the structural subunits of the molecule. These concepts were combined with a novel pinning technique to form a tool that semi-automates insertion of a new species into the alignment for the 16S rRNA molecule (a component of the ribosome) maintained by Dr. Carl Woese`s group at the University of Illinois at Urbana. The tool was tested on species extracted from the alignment and on a group of entirely new species. The results were very encouraging, and the tool should be substantial aid to the curators of the 16S alignment. The construction of the grammar was itself automated, allowing application of the tool to alignments for other molecules. The logic programming language Prolog was used to construct all programs involved. The computational linguistics approach used here was found to be a useful way to attach the problem of insertion into an alignment.
- Authors:
-
- Case Western Reserve Univ., Cleveland, OH (United States)
- Publication Date:
- Research Org.:
- Argonne National Lab. (ANL), Argonne, IL (United States)
- Sponsoring Org.:
- USDOE Office of Science (SC)
- OSTI Identifier:
- 10108317
- Report Number(s):
- ANL-91/29
ON: DE92004954
- DOE Contract Number:
- W-31109-ENG-38
- Resource Type:
- Thesis/Dissertation
- Resource Relation:
- Other Information: DN: Thesis submitted to Case Western Reserve University, Cleveland, OH; TH: Thesis (M.S.); PBD: Nov 1991
- Country of Publication:
- United States
- Language:
- English
- Subject:
- 59 BASIC BIOLOGICAL SCIENCES; 97 MATHEMATICS AND COMPUTING; RNA; TEXT EDITORS; AUTOMATION; MOLECULAR BIOLOGY; PROLOG; COMPUTER CODES; 550200; 990200; BIOCHEMISTRY; MATHEMATICS AND COMPUTERS
Citation Formats
Taylor, Ronald C. Automated insertion of sequences into a ribosomal RNA alignment: An application of computational linguistics in molecular biology. United States: N. p., 1991.
Web. doi:10.2172/10108317.
Taylor, Ronald C. Automated insertion of sequences into a ribosomal RNA alignment: An application of computational linguistics in molecular biology. United States. https://doi.org/10.2172/10108317
Taylor, Ronald C. 1991.
"Automated insertion of sequences into a ribosomal RNA alignment: An application of computational linguistics in molecular biology". United States. https://doi.org/10.2172/10108317. https://www.osti.gov/servlets/purl/10108317.
@article{osti_10108317,
title = {Automated insertion of sequences into a ribosomal RNA alignment: An application of computational linguistics in molecular biology},
author = {Taylor, Ronald C.},
abstractNote = {This thesis involved the construction of (1) a grammar that incorporates knowledge on base invariancy and secondary structure in a molecule and (2) a parser engine that uses the grammar to position bases into the structural subunits of the molecule. These concepts were combined with a novel pinning technique to form a tool that semi-automates insertion of a new species into the alignment for the 16S rRNA molecule (a component of the ribosome) maintained by Dr. Carl Woese`s group at the University of Illinois at Urbana. The tool was tested on species extracted from the alignment and on a group of entirely new species. The results were very encouraging, and the tool should be substantial aid to the curators of the 16S alignment. The construction of the grammar was itself automated, allowing application of the tool to alignments for other molecules. The logic programming language Prolog was used to construct all programs involved. The computational linguistics approach used here was found to be a useful way to attach the problem of insertion into an alignment.},
doi = {10.2172/10108317},
url = {https://www.osti.gov/biblio/10108317},
journal = {},
number = ,
volume = ,
place = {United States},
year = {Fri Nov 01 00:00:00 EST 1991},
month = {Fri Nov 01 00:00:00 EST 1991}
}