Home

About

Advanced Search

Browse by Discipline

Scientific Societies

E-print Alerts

Add E-prints

E-print Network
FAQHELPSITE MAPCONTACT US


  Advanced Search  

 
Using the Past To Score the Present: Extending Term Weighting Models Through Revision History Analysis
 

Summary: Using the Past To Score the Present: Extending Term
Weighting Models Through Revision History Analysis
Ablimit Aji Yu Wang
Emory University
{aaji,yu.wang}@emory.edu
Eugene Agichtein
Emory University
eugene@mathcs.emory.edu
Evgeniy Gabrilovich
Yahoo! Research
gabr@yahoo-inc.com
ABSTRACT
The generative process underlies many information retrieval mod-
els, notably statistical language models. Yet these models only ex-
amine one (current) version of the document, effectively ignoring
the actual document generation process. We posit that a consider-
able amount of information is encoded in the document authoring
process, and this information is complementary to the word oc-
currence statistics upon which most modern retrieval models are
based. We propose a new term weighting model, Revision His-

  

Source: Agichtein, Eugene - Department of Mathematics and Computer Science, Emory University

 

Collections: Computer Technologies and Information Sciences