Abstract
In rough fuzzy set, equivalence relation is used for approximating the target concept, which is a fuzzy set. In this paper, we extend the equivalence relation in rough fuzzy set to tolerance relation, and propose an extended model named tolerance rough fuzzy set. Furthermore, the properties of the extended model are investigated, and the proofs of the properties are given in the paper. The advantage of the extended model is that it can directly deal with continuous-valued attributes, and it does not require the process of discretization.
Introduction
Rough Set [1] was proposed by Pawlak in 1982, which uses a pair of operators to approximate target concept X, the lower approximation operator approximates X from its inside, while upper approximation operator approximates X from its outside. The objects dealt with by RS is called information system IS = (U, A), where U is a set of objects named universe, A is a set of attributes. In the framework of classification, the objects dealt with by RS is called decision table DT = (U, A ∪ C), where C is a set of decision attributes. In this paper, C includes only a decision attribute, whose values is in the set d = {1, 2, …, k}. In other words, the objects in U are categorized into k classes: C1, C2, …, C k . In decision table DT, A is called conditional attribute set. In classical RS, R is an equivalent relation, which is equivalent to a subset of A, X is a subset of U. Since the rough set was put forward in 1982, many extended models were presented, these extensions can be roughly classified into two categories: the extensions in certain circumstance and the extensions in uncertainty circumstance. The extensions in certainty circumstance can further be classified into two categories: the improvements of the definitions and the extensions of the equivalent relation, see fig. 1.
In the definition of the lower approximation operator in classical rough set, it is required that the equivalence class belongs to the target concept X completely, in other words, it is required that all elements of the equivalence class belong to the target concept X. While in the definition of the upper approximation operator in classical rough set, if even only one element of the equivalence class belong to the target concept X, then the equivalence class belong to the upper approximation. This limitation severely restricts the applicability of the rough set approach to many problems. In 1993, Ziarko proposed variable precision rough set [2] by introducing multiple inclusion relation into RS to deal with the problems mentioned above. Along with this technical route, in order to deal with the same problems, Yao proposed probabilistic rough set [3] in 2003 by introducing rough membership function into RS, and decision rough set [4] in 1992 by introducing Bayes risk into RS respectively. In this route, the hot research topics are focused on the study of three-way decision, which is based on decision rough set, many researchers pay their attention to this study[5–9].
In the first category, along with the technical route of extension of equivalent relation, Yao and Lin proposed generalized rough set [10] in 1996 by replacing equivalent relation with generalization relation. Skowron proposed tolerance rough set [11] in 2000 by replacing equivalent relation with tolerance relation also named similarity relation. Grecoet al. proposed dominance rough set [12] by replacing equivalent relation with dominance relation. In this category, the hot research topics are focused on tolerance rough set and dominance rough set, especially the former[13–15].
In the second category, Dubois and Prade proposed rough fuzzy set [16] in 1990 by replacing crisp target concept with a fuzzy target concept. They also proposed fuzzy rough set [16] in the same year by replacing equivalent relation with a fuzzy similarity relation and replacing crisp target concept with a fuzzy target concept. In addition, three other kinds of fuzzy rough sets were independently proposed by Kuncheva [17], Nanda [18] and Yao [19]. Rough fuzzy set is a special case of fuzzy rough set [16]. There are three hot topics: attributes reduction [20-24] of fuzzy rough set, the extensions of fuzzy rough set, and the applications of fuzzy rough set. Jensen and Parthaláin applied method of fuzzy-rough sets to feature selection for alleviating computational effort, and proposed two approaches [25]. The first approach calculates nearest neighbours prior to search and then uses only these neighbours for the subsequent fuzzy-rough dependency calculations. The second approach is an attempt to tackle the problem of larger data from the perspective of large dimensionality. Based on extended t-norms with respect to type-2 fuzzy relations, a model of type-2 fuzzy rough sets were proposed by Wang [26]. This model is defined on two finite universes of discourse and is characterized with both constructive and axiomatic approaches. By using the maximal and minimal membership degrees of an object with respect to a fuzzy set based on multi-fuzzy tolerance relations, Feng and Mi proposed two types of variable precision multi-granulation fuzzy rough sets [27]. Based on fuzzy rough sets, Vluymans et al. proposed a fuzzy rough classifier to deal with the problems of class imbalanced multi-instance data [28]. Wu et al. comprehensively investigated the axiomatic characterizations of (S, T)-fuzzy rough approximation operators [29].
All works mentioned above can only deal with the fuzzy conditional attributes, but many data sets in practice are with continuous-valued attributes. It is necessary for fuzzy rough set and rough set to fuzzify conditional attributes. In this paper, we extend the rough fuzzy set by replacing equivalence relation with tolerance relation, and propose an extended model named tolerance rough fuzzy set, denoted by TRFS in short. TRFS can overcome the drawback mentioned before. In addition, the properties of the extended model are investigated, and the proofs of the properties are given in the paper.
The paper is organized as follows. The preliminaries used in this paper are given in Section 2. The proposed extended model of rough fuzzy set are presented in Section 3. The properties of tolerance rough fuzzy set model and their proofs are presented in Section 4. Section 5 concludes the paper.
Preliminaries
In this section, we briefly review the preliminaries in the framework of classification, including rough set, rough fuzzy set, and tolerance rough set.
Rough set
Rough set uses a pair of operators to approximate the target concepts. Let DT = (U, A ∪ C) be a decision table, where U = {x1, x2, …, x
n
}, A = {a1, a2, …, a
d
}, the instances in U are categorized into k classes: C1, C2, …, C
k
, i.e. U/C = C1 ∪ C2 ∪ ⋯ ∪ C
k
. Let x ∈ U and R is an equivalence relation induced by a subset of A, the equivalence class containing x is given by:
Given a decision table, for arbitrary target concept C
i
(1 ≤ i ≤ k), the lower approximation and the upper approximation of C
i
with respect to R are defined by
Given an equivalence relation R, U can be divided into three disjoint regions: POS
R
(U), NEG
R
(U) and BND
R
(U), which are called positive region, negative region and boundary region respectively. The definitions of POS
R
(U), NEG
R
(U) and BND
R
(U) are given as follows.
Rough fuzzy set (RFS) [16] is an extended model of rough set. In RFS, the used knowledge is also equivalence relation, while the target concept is a fuzzy set. Without loss of generality, we also use C i (1 ≤ i ≤ k) as the k fuzzy target concepts, which are k fuzzy sets. The corresponding decision table is called fuzzy decision table, an example of fuzzy decision table with 5 instances is given in Table 1. In this example, there are three fuzzy target concepts: Poor, Good and Excellent, which are three fuzzy sets.
Given a fuzzy decision table, for arbitrary fuzzy target concept C i (1 ≤ i ≤ k), the lower approximation and the upper approximation of C i with respect to equivalence relation R are defined by
According to the fuzzy extension principle, (7) and (8) can be equivalently written as follows.
Tolerance rough set (TRS) [11] is another extension of rough set. TRS extends rough set by replacing an equivalence relation with a similarity relation. The target concept is same as in rough set, which is also a crisp decision class.
Given a decision table DT = (U, A ∪ C), R is a similarity relation defined on U, if and only if R satisfies the following conditions:
(1) Reflexivity, i.e. for each x ∈ U, xRx;
(2) Symmetry, i.e. for each x, y ∈ U, xRy, and yRx.
We can define many similarity relations on U, such as, the definitions given in (11), (12) and (13).
In (11)-(13), a ∈ A, x ∈ U, and a max and a min denote the maximum value and minimum value of a respectively, σ a is variance of attribute a.
For ∀R ⊆ A, we can define the similarity relations induced by subset R as follows.
For each x ∈ U, the τ tolerance class generated by a given similarity relation R is defined as:
Given a decision table and a similarity threshold τ, for arbitrary target concept C
i
(1 ≤ i ≤ k), the tolerance lower approximation and the tolerance upper approximation of C
i
with respect to R are defined by
In this section, we present the proposed extended model named tolerance rough fuzzy set(TRFS), which can deal with the fuzzy decision table with continuous-valued conditional attributes and fuzzy-valued decision attribute, Table 2 is an example of this kind of fuzzy decision table with 6 instances. In this fuzzy decision table, there are three continuous-valued conditional attributes a1, a2, a3 and two fuzzy
decision classes, i.e. two fuzzy target concept C1, C2. In the following, we first present the definitions of tolerance rough fuzzy lower approximation and tolerance rough fuzzy upper approximation in Subsection 3.1, and then present an example in Subsection 3.2 to illustrate the related concepts.
The definitions of tolerance rough fuzzy lower approximation and tolerance rough fuzzy upper approximation
Given a fuzzy decision table DT = (U, A ∪ C), R is a similarity relation defined on U, τ is a similarity threshold, C
i
is the ith fuzzy decision class which is a fuzzy set. The tolerance rough fuzzy lower approximation and tolerance rough fuzzy upper approximation of C
i
with respect to R are defined by
Similarly to (9) and (10), we have the following equivalent definitions:
Regarding the effect of threshold τ on and , we have the following proposition.
In other words, the bigger the τ is, the greater the is, the smaller is.
Given a fuzzy decision Table 2. Let R1 = {a1}, R2 = {a1, a2}, R3 = {a1, a2, a3}, and τ = 0.8. Compute and , i = 1, 2, 3.
(1) Compute . Because
Hence, according to (21), we have
Similarly, we can obtain
(2) Compute . Because
Hence, according to (22), we have
Similarly, we can obtain
(3) Compute . Because
Hence, according to (21), we have
Similarly, we can obtain
(4) Compute . Because
Hence, according to (22), we have
(5) Compute . Because
Hence, according to (21), we have
Similarly, we can obtain
(6) Compute . Because
Similarly, we can obtain
The Properties of tolerance rough fuzzy set model
Given a fuzzy decision table DT = (U, A ∪ C), R is a similarity relation defined on U, τ is a similarity threshold, C i is the ith fuzzy decision class, ∀x, y ∈ U, we have the following conclusions.
Hence, the property (1) is hold.
(2) Because
(3) Because
(4) Because
Given a fuzzy decision table DT = (U, A ∪ C), R is a similarity relation defined on U, τ is a similarity threshold, C i and C j are the ith and jth decision class respectively. The tolerance rough fuzzy lower approximation and tolerance rough fuzzy upper approximation satisfy the following properties.
And
Because
Hence
(2) Because
Therefore
Hence
Because
Therefore
Hence
(3) Because
(4) Because
Hence, the property (4) is hold.
(5) Because
Hence, the property (5) is hold.
(6) Because
Hence, the property (6) is hold.
(7) Because
Hence, we have
Hence, we have
(9) Because
Hence, the property (9) is hold.
(10) Because
Hence, the property (10) is hold.
(11) Because
Hence, the property (11) is hold.
(12) Because
(13) Because
Hence, the property (13) is hold.
(14) Because
Hence, the property (14) is hold.
Conclusions
This paper proposes an extended model of rough fuzzy set. The advantage of the proposed model is that it can directly deal with continuous-valued data sets without discretizing the conditional attributes. Accordingly, there is no information loss in the process of dealing with the fuzzy decision tables. In addition, the properties of the proposed model are investigated in details. The promising future works of this study include (1) the monotonicity of the lower approximation and the upper approximation with respect to similar parameter τ; (2) the numerical characteristics of the proposed TRFS model, including significance of knowledge, dependency of knowledge, rough degree, etc. (3) the attribute reductions and core, and the corresponding computing methods.
Footnotes
Acknowledgments
This research is supported by the national natural science foundation of China (61170040, 71371063), by the natural science foundation of Hebei Province (F2013201110, F2013201220), by the Key Scientific Research Foundation of Education Department of Hebei Province (ZD20131028) and by the Opening Fund of Zhejiang Provincial Top Key Discipline of Computer Science and Technology at Zhejiang Normal University, China.
