HYTREM; A hybrid text-retrieval machine for large databases
- Dept. of Computer and Information Science, Ohio State Univ., Columbus, OH (US)
- Computer Systems Research Institute, Univ. of Toronto, Toronto, Ontario (CA)
This paper describes the design of a text-retrieval machine, called HYTREM (hybrid text-retrieval machine), for the support of large unformatted text databases. A signature file is used as an access method to reduce the amount of data that need to be searched directly. HYTREM consists of two major subsystems: a signature processor and a text processor. The signature processor is based on a word-parallel, bit-serial (WPBS) organization which is faster, more efficient, and more flexible than a word-serial, bit-parallel (WSBP) organization proposed in the literature. The text processor, called ALTEP (associative linear text processor), is a linear array of logic cells capable of matching regular expressions at a much higher speed than that of previous designs. Since both the signature processor and ALTEP are highly parallel processors, a high-speed multiple-response resolver (MRR) is provided to facilitate data transfer between the processors and the controllers over a single common bus. Issues about the design of a cost-effective mass-storage system (MSS) are discussed. The performance and implementation issues of HYTREM are discussed.
- OSTI ID:
- 7129348
- Journal Information:
- IEEE (Institute of Electrical and Electronics Engineers) Transactions on Computers; (USA), Vol. 39:1; ISSN 0018-9340
- Country of Publication:
- United States
- Language:
- English
Similar Records
Machine Learning for Identifying Relevance to Biosurveillance in Multilingual Text
Implementation issues for algorithmic VLSI processor arrays