Advanced Search

Browse by Discipline

Scientific Societies

E-print Alerts

Add E-prints

E-print Network

  Advanced Search  

On the Integration of Structured Data and Text: A Review of the SIRE Architecture

Summary: On the Integration of Structured Data and Text:
A Review of the SIRE Architecture
(An Invited Overview)
O. Frieder, A. Chowdhury, D. Grossman, & M.C. McCabe
Information Retrieval Laboratory
Illinois Institute of Technology
1.0 Introduction
Over the past decade, members of the Information Retrieval Lab have designed, developed, and
deployed a variety of information retrieval systems. A central theme for all of our systems was the
integration of structured data and text. One of our more recent efforts, SIRE, a Scalable Information
Retrieval Engine [Grossman97, Grossman98, Lundquist99] is the focus of this paper. For completeness,
we review some of the functionality of SIRE although it is described, in greater detail, in other forums. We
describe the architecture of the prototype developed for the National Institutes of Health (NIH) National
Center for Complementary and Alternative Medicine (NCCAM) [Frieder00] by some of the members of
the laboratory. The version deployed at NCCAM is a more industrialized version of this prototype.
The mainstream approach in the development of information retrieval systems uses a customized
inverted index to represent the text. SIRE, on the other hand, is a relational information retrieval approach
and uses relations to model an inverted index. Storing the full text in a relational environment integrates
the search of unstructured data with the traditional structured data search of Relational Database


Source: Argamon, Shlomo - Department of Computer Science, Illinois Institute of Technology


Collections: Computer Technologies and Information Sciences