| | |
Summary: WebDB 2003
Modeling Query-Based Access to Text Databases
Eugene Agichtein Panagiotis Ipeirotis Luis Gravano
Columbia University
{eugene,pirot,gravano}@cs.columbia.edu
ABSTRACT
Searchable text databases abound on the web. Applications that
require access to such databases often resort to querying to ex-
tract relevant documents because of two main reasons. First, some
text databases on the web are not "crawlable," and hence the only
way to retrieve their documents is via querying. Second, applica-
tions often require only a small fraction of a database's contents, so
retrieving relevant documents via querying is an attractive choice
from an efficiency viewpoint, even for crawlable databases. Of-
ten an application's query-based strategy starts with a small num-
ber of user-provided queries. Then, new queries are extracted in
an application-dependent way from the documents in the initial
query results, and the process iterates. The success of this com-
mon type of strategy relies on retrieved documents "contributing"
new queries. If new documents fail to produce new queries, then
|