| | |
Summary: Low-cost Management of Inverted Files
for Online Full-Text Search
Giorgos Margaritis Stergios V. Anastasiadis
Department of Computer Science
University of Ioannina
Ioannina 45110, GREECE
{gmargari,stergios}@cs.uoi.gr
ABSTRACT
In dynamic environments with frequent content updates, we re-
quire online full-text search that scales to large data collections and
achieves low search latency. Several recent methods that support
fast incremental indexing of documents typically keep on disk mul-
tiple partial index structures that they continuously update as new
documents are added. However, spreading indexing information
across multiple locations on disk tends to considerably decrease the
search responsiveness of the system. In the present paper, we take
a fresh look at the problem of online full-text search with consid-
eration of the architectural features of modern systems. Selective
Range Flush is a greedy method that we introduce to manage the
index in the system by using fixed-size blocks to organize the data
|