Advanced Search

Browse by Discipline

Scientific Societies

E-print Alerts

Add E-prints

E-print Network

  Advanced Search  

Hierarchical Text Classification and Evaluation Aixin Sun and Ee-Peng Lim

Summary: Hierarchical Text Classification and Evaluation
Aixin Sun and Ee-Peng Lim
Center for Advanced Information Systems
Nanyang Technological University
Nanyang Avenue, Singapore 639798, Singapore
sunaixin@pmail.ntu.edu.sg aseplim@ntu.edu.sg
Hierarchical Classification refers to assigning of one or
more suitable categories from a hierarchical category space
to a document. While previous work in hierarchical classi-
fication focused on virtual category trees where documents
are assigned only to the leaf categories, we propose a top-
down level-based classification method that can classify
documents to both leaf and internal categories. As the stan-
dard performance measures assume independence between
categories, they have not considered the documents incor-
rectly classified into categories that are similar or not far
from the correct ones in the category tree. We therefore
propose the Category-Similarity Measures and Distance-
Based Measures to consider the degree of misclassification


Source: Aixin, Sun - School of Computer Engineering, Nanyang Technological University


Collections: Computer Technologies and Information Sciences