Summary: Automatic Web Query Classification Using Labeled
and Unlabeled Training Data
Steven M. Beitzel, Eric C. Jensen,
Ophir Frieder, David Grossman
Information Retrieval Laboratory
Illinois Institute of Technology
David D. Lewis, Abdur Chowdhury,
America Online, Inc.
Accurate topical categorization of user queries allows for
increased effectiveness, efficiency, and revenue potential in
general-purpose web search systems. Such categorization
becomes critical if the system is to return results not just from a
general web collection but from topic-specific databases as well.
Maintaining sufficient categorization recall is very difficult as
web queries are typically short, yielding few features per query.