skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: HYTREM; A hybrid text-retrieval machine for large databases

Journal Article · · IEEE (Institute of Electrical and Electronics Engineers) Transactions on Computers; (USA)
DOI:https://doi.org/10.1109/12.46285· OSTI ID:7129348
 [1];  [2]
  1. Dept. of Computer and Information Science, Ohio State Univ., Columbus, OH (US)
  2. Computer Systems Research Institute, Univ. of Toronto, Toronto, Ontario (CA)

This paper describes the design of a text-retrieval machine, called HYTREM (hybrid text-retrieval machine), for the support of large unformatted text databases. A signature file is used as an access method to reduce the amount of data that need to be searched directly. HYTREM consists of two major subsystems: a signature processor and a text processor. The signature processor is based on a word-parallel, bit-serial (WPBS) organization which is faster, more efficient, and more flexible than a word-serial, bit-parallel (WSBP) organization proposed in the literature. The text processor, called ALTEP (associative linear text processor), is a linear array of logic cells capable of matching regular expressions at a much higher speed than that of previous designs. Since both the signature processor and ALTEP are highly parallel processors, a high-speed multiple-response resolver (MRR) is provided to facilitate data transfer between the processors and the controllers over a single common bus. Issues about the design of a cost-effective mass-storage system (MSS) are discussed. The performance and implementation issues of HYTREM are discussed.

OSTI ID:
7129348
Journal Information:
IEEE (Institute of Electrical and Electronics Engineers) Transactions on Computers; (USA), Vol. 39:1; ISSN 0018-9340
Country of Publication:
United States
Language:
English