Abstract
Classification accuracy plays an important role in the evaluation of single-label and multi-label classification methods. Many different evaluation methods have been proposed to evaluate different kind of classification methods. Even in the same kind of classification methods, there are also many different evaluation methods. In this paper, we seek to present a unified evaluation criterion for the classification accuracy measurement, irrespective of single-label and multi-label classification method. We use neighborhood system theory to design absolute accuracy (AA) and relative accuracy rate (RAR) to evaluate different single-label classification methods, then apply them to evaluate different multi-label classification methods. And finally, some important examples are illustrated to understand the unified evaluation criterion in different classification situations.
Introduction
Classification refers to categorizing samples into different groups under the given labels, which is a very important task in data mining. Many classification methods have been proposed [1, 2]. However, how to choose a classification method which is more accurate than others, when a classification task is given, becomes crucial problem. As we all know, estimating the accuracy of classification method for the given task is important not only to predict its future prediction accuracy but also to choose a method. So far, there are several methods [3, 4], which are holdout, cross-validation, bootstrapping and leave-one-out etc. to be used to estimate the accuracy of classification method. While it is known that no accuracy estimation can be correct all the time [5], they all are prediction methods which used unknown data (without being classified) to identify a suitable classification model that is well suited for the given task. They usually assume that every sample data has only one label, that’s to say, each sample data must be labeled into only one category. Actually, sample data may have more than one label, which is concerned with multi-label classification [6].
In this paper, we focus on the accuracy measurement which is indispensable part for the classification method to evaluate its classification accuracy in the classification procedure. So far, there are many ways to measure classification accuracy [1, 7], different classification methods or different kind of classification methods use different ways to measure its accuracy. There is no unified measurement criterion, when considering all classification situations. So we continue our research based on our previous works [8]. We will take full consideration of different kind of classification methods (one is single-label, and the other is multi-label), and seek for a unified evaluation criterion, called unified evaluation paradigm, for the classification method accuracy with the background of neighborhood system theory.
The structure of the paper is as follows. Section 2 reviews briefly previous related work on the measurement of classification accuracy. Section 3 gives some basic concepts and innovative Theorem of neighborhood system, which will help us to present the unified evaluation criterion. A unified accuracy evaluation paradigm induced by single-label classification is shown in Section 4 and some important examples are illustrated to help understanding this paradigm. Section 5 discusses induced accuracy evaluation paradigm for the multi-label classification and shows its application with an important example. Finally, conclusions are summarized in the end.
Related works
For the measurement of single-label classification accuracy [1], the sensitivity and specificity measures can be used respectively. Sensitivity is also referred to as the true positive (recognition) rate (that is, the proportion of positive samples that are correctly identified), while specificityis the true negative rate(that is, the proportion of negative samples that are correctly identified). Then the accuracy is defined as a function of sensitivity and specificity.
Multi-label classification methods are increasingly required by modern applications, such as protein function classification, music categorization and semantic scene classification etc. For the measurement of multi-label classification accuracy [6, 7], different multi-label classification problems requires different metrics, which is different from those used in traditional single-label classification. So far, several methods, such as Hamming Loss [9], Forgiveness Rate [10], Accuracy-precision-recall [11], have been proposed to evaluate the accuracy of multi-label classification method.
Preliminaries
A neighborhood of A neighborhood system of A neighborhood system of A set
which is the set of subscript number that every element in
For example.
Here we have a new definition called derived covering according to neighborhood system.
Finally, we induced a new Theorem, which can help us to find overlapped elements from the covering
Proof For the construction of
Discussion for different single-label classification conditions
The purpose of this paper is to evaluate the accuracy of classification method distinguished with estimating accuracy in the different classification assumption after we knew classification result and ideal classification result. In order to continue our works, we define absolute accuracy (AA) and relative accuracy rate (RAR) and then evaluate accuracy of different classification method. Therefore, for the single-label classification, there are four kinds of classification results under the two ideal classification assumptions.
Ideal classification result is a partition
Suppose that
Classification result is a partition which is supposed as We have
in which
where
so
Classification result is a covering which is supposed as We have
in which
where
so
Suppose that
Classification result is a partition which is supposed as We have
in which
where
so
Classification result is a covering which is supposed as We have
in which
where
so
and according to Theorem,
We discussed four conditions for the accuracy measurement of single-label classification method. Actually, summarizing the universality from these four conditions, we have a unified form to evaluate classification accuracy according to neighborhood system. So, for the given sample space and ideal classification assumption, a unified evaluation paradigm for the accuracy of single-label classification is given as follows without need to consider difference of the classification results and difference of the ideal classification assumptions.
Suppose that
We have the final
in which
where
We have classification method 1:
and classification method 2:
Here, according to the Remarks 3, classification method 1 is more accurate than method 2 for the given sample space and ideal classification assumption.
Discussion for the accuracy of multi-label classification
We apply induced accuracy evaluation criterion into the conditions of multi-label classification. For the purpose of obvious comparison with induced criterion, we choose accuracy-precision-recall, which is mentioned above for the accuracy evaluation of multi-label classification method.
Measurement of accuracy-precision-recall [11]
Suppose that
Measurement of induced accuracy evaluation criterion
Suppose that
in which
where
Multi-label sample set
Multi-label sample set
Suppose that there are two multi-label classification methods, method 1 and method 2, to classify multi-label sample set listed by Table 1:
According to the measurement of accuracy-precision-recall, method 1:
method 2:
so, for the given multi-label sample set, according to Remarks 4, multi-label classification method 1 is more accurate than method 2. According to the measurement of induced accuracy evaluation criterion, method 1:
method 2:
so, according to Remarks 4, multi-label classification method 1 is also more accurate than method 2 for the given multi-label sample set by the measurement of induced accuracy evaluation criterion.
This paper, using neighborhood system theory, proposed a unified evaluation criterion for the accuracy of classification method, irrespective of single-label and multi-label method. We illustrated this evaluation criterion by some important examples. We hope that it can provide useful and revelatory help to the accuracy estimation of classification, accuracy evaluation of classification and accuracy improvement of classification method etc. in the future, without necessarily to consider the difference of classification method.
