Home

About

Advanced Search

Browse by Discipline

Scientific Societies

E-print Alerts

Add E-prints

E-print Network
FAQHELPSITE MAPCONTACT US


  Advanced Search  

 
An Evaluation of Linguisticallymotivated Indexing Schemes Avi Arampatzis Th.P. van der Weide C.H.A. Koster P. van Bommel
 

Summary: An Evaluation of Linguistically­motivated Indexing Schemes
Avi Arampatzis Th.P. van der Weide C.H.A. Koster P. van Bommel
Technical Report CSI­R9927, December 1999, Dept. of Information Systems and Information Retrieval,
University of Nijmegen, The Netherlands.
favgerino,tvdw,kees,pvbg@cs.kun.nl
Submitted to BCS­IRSG 2000
December 14, 1999
Abstract
In this article, we describe a number of indexing experiments based on indexing terms other than simple keywords.
These experiments were conducted as one step in validating a linguistically­motivated indexing model. The problem
is important but not new. What is new in this approach is the variety of schemes evaluated. It is important since it
should not only help to overcome the well­known problems of bag­of­words representations, but also the difficulties
raised by non­linguistic text simplification techniques such as stemming, stop­word deletion, and term selection. Our
approach in the selection of terms is based on part­of­speech tagging and shallow parsing. The indexing schemes
evaluated vary from simple keywords to nouns, verbs, adverbs, adjectives, adjacent word­pairs, and head­modifier
pairs. Our findings apply to Information Retrieval and most of related areas.
1 Introduction
The purpose of an automated information seeking system is to process information sources, and provide users with the
information they need. The particular nature of an information seeking process is determined by the characteristics
of information needs and information sources, such as the change rate. For instance, Information Retrieval assumes a

  

Source: Arampatzis, Avi - Department of Electrical and Computer Engineering, Democritus University of Thrace

 

Collections: Computer Technologies and Information Sciences