Home

About

Advanced Search

Browse by Discipline

Scientific Societies

E-print Alerts

Add E-prints

E-print Network
FAQHELPSITE MAPCONTACT US


  Advanced Search  

 
A Generalized Method for Word Sense Disambiguation based on Wikipedia
 

Summary: A Generalized Method for Word Sense
Disambiguation based on Wikipedia
Chenliang Li, Aixin Sun, and Anwitaman Datta
School of Computer Engineering,
Nanyang Technological University, Singapore
{lich0020|axsun|anwitaman}@ntu.edu.sg
Abstract. In this paper we propose a general framework for word sense
disambiguation using knowledge latent in Wikipedia. Specifically, we ex-
ploit the rich and growing Wikipedia corpus in order to achieve a large
and robust knowledge repository consisting of keyphrases and their asso-
ciated candidate topics. Keyphrases are mainly derived from Wikipedia
article titles and anchor texts associated with wikilinks. The disambigua-
tion of a given keyphrase is based on both the commonness of a can-
didate topic and the context-dependent relatedness where unnecessary
(and potentially noisy) context information is pruned. With extensive
experimental evaluations using different relatedness measures, we show
that the proposed technique achieved comparable disambiguation accu-
racies with respect to state-of-the-art techniques, while incurring orders
of magnitude less computation cost.
Keywords: Word Sense Disambiguation, Wikipedia, Context Pruning

  

Source: Aixin, Sun - School of Computer Engineering, Nanyang Technological University

 

Collections: Computer Technologies and Information Sciences