Advanced Search

Browse by Discipline

Scientific Societies

E-print Alerts

Add E-prints

E-print Network

  Advanced Search  

Performance Measurement Framework for Hierarchical Text Classification

Summary: Performance Measurement Framework for Hierarchical Text
Aixin Sun, Ee-Peng Lim, Wee-Keong Ng
Centre for Advanced Information Systems, Nanyang Technological University, Nanyang Avenue, Singapore 639798
Email:sunaixin@pmail.ntu.edu.sg aseplim@ntu.edu.sg
Abstract Hierarchical text classification or simply hi-
erarchical classification refers to assigning a document to
one or more suitable categories from a hierarchical cate-
gory space. In our literature survey, we have found that
the existing hierarchical classification experiments used
a variety of measures to evaluate performance. These
performance measures often assume independence be-
tween categories and do not consider documents mis-
classified into categories that are similar or not far from
the correct categories in the category tree. In this paper,
we therefore propose new performance measures for hier-
archical classification. The proposed performance mea-
sures consist of category similarity measures and distance
based measures that consider the contributions of mis-
classified documents. Our experiments on hierarchical


Source: Aixin, Sun - School of Computer Engineering, Nanyang Technological University


Collections: Computer Technologies and Information Sciences