York University, Canada
Copyright © 2005, Idea Group Inc., distributing in print or electronic forms without written permission of IGI is prohibited.
Generally speaking, classification is the action of as-
signing an object to a category according to the charac-
teristics of the object. In data mining, classification
refers to the task of analyzing a set of pre-classified data
objects to learn a model (or a function) that can be used
to classify an unseen data object into one of several
predefined classes. A data object, referred to as an
example, is described by a set of attributes or variables.
One of the attributes describes the class that an example
belongs to and is thus called the class attribute or class
variable. Other attributes are often called independent
or predictor attributes (or variables). The set of ex-
amples used to learn the classification model is called
the training data set. Tasks related to classification