Abstract
As a new and meaningful extension of the Pawlak rough set, multi-granulation rough sets (MGRSs) have attracted much attention and fruitful achievements have been reported in different aspects. By combining with fuzzy rough set, the paper introduces multi-granulation fuzzy rough sets in the covering approximation space, namely, covering-based multi-granulation fuzzy rough sets (CMFRS), which form the extension of fuzzy rough sets. We first investigate several important properties of lower and upper approximations of concepts in covering-based multi-granulation fuzzy rough sets and elaborate on the differences between the proposed models and the existing ones in literature. By employing the notions of reduct and exclusion of a covering, the paper studies the necessary and sufficient conditions for two CMFRS to generate identical lower and upper approximations of a target concept in the given covering approximation space. Finally, the relationships between the new models are explored in the paper.
Introduction
As a tool for data analysis and processing, rough set theory has been successfully applied to many fields, such as such as pattern recognition, machine learning, knowledge acquisition and data mining [1–14].
As Pawlak [15] introduced the rough set approximation operators with respect to equivalence relations, this may induced some restrictions in practical applications. To enhance the scope of usage of Pawlak’s rough sets, many meaningful extensions have been proposed, for example, the graded rough sets [16], tolerance or similarity relations based rough sets [18–20], rough fuzzy sets and fuzzy rough sets [21], etc. As one of the extension models, covering-based rough sets, which was first proposed by Zakowski [22], has attracted much attention and induced interesting results. Pomykala [23] introduced several pairs of dual approximation operators of covering-based rough sets. Bonikowski et al. [24] studied the covering-based rough approximation operators based on the mutual correspondence of the concepts of extension and intension. Tsang et al. [25] proposed a new upper approximation operator of covering-based rough sets, which may help to extract more efficient rules. Mordeson [26] applied covering-based rough sets to ideal theory and discussed basic properties of the upper approximation operator and showed how it can be used to give algebraic structural properties of certain subsets. Zhu and Wang [27, 28] proposed three types of covering-based rough sets and introduced the notions of reduct and exclusion of a covering. Grzymala-Busse [63–66] has done a series of studies on the missing attribute values, which also an important aspect of covering-based rough approximation operators. Liu and Sai gave a comparison of two types of rough sets induced by coverings [67]. Qin et al. discussed some properties of five pairs of dual covering approximation operators, and presented conditions with which these covering approximation operators are identical [68]. Zhang and Luo changed five pairs of covering approximation operators into the same pair of relation approximation operators [69]. Restrepo et al. investigated partial order relation for approximation operators in covering based rough sets [70]. Restrepo et al. used the concepts of duality, conjugacy and adjointness to establish relationships between the most commonly used covering approximation operators [71]. Zhu et al [29, 30] discussed some measurements of uncertainty in covering-based rough sets. Ma [31] introduced some types of neighborhood-related covering rough sets. Zhang et al. [32] have shown the closeness of union and intersection operations of rough approximation pairs. As a guideline and directional research of covering-based rough sets, Yao et al. [33] proposed a framework for the study of covering based rough set approximations. They summarized and classified the existing approximation operators into element-based, granule-based and subsystem-based definitions, which enables us to reproduce many existing approximation operators and introduce some new approximationoperators.
From the viewpoint of Granular Computing [34–37], in Pawlak’s rough set model and its various extended rough set models, a target concept is characterized by a pair of approximations under a single granulation, which cannot be applied in some practical multi-granulation circumstances, and therefore, they can be named as the single-granulation rough sets. To enable rough set theory to fit the multi-granulation circumstances, Qian et al. [38, 39] introduced the concept of multi-granulation rough sets (MGRS), where the approximations of a target concept are characterized under multi-equivalence relations. In MGRS, due to the different requirements, a target concept can be described by two different strategies, i.e., “seeking common reserving difference” and “seeking common rejecting difference”. Following these two strategies, the optimistic and pessimistic MGRS approximations have been proposed, respectively. Up to now, many researchers have focused on the development of the MGRS. Those studies can be divided into three categories. The first category is the construction of approximation operators. For instance, Qian et al. [40] extended MGRS to incomplete information systems with respect to multiple tolerance relations. Yang et al. [41] constructed multi-granulation rough sets based on the fuzzy approximation space. Xu et al. [42–44] constructed three kinds of multi-granulation rough sets based on the tolerance, ordered and generalized relations, respectively. Lin et al. [45] discussed two kinds of neighborhood-based multi-granulation rough sets and three kinds of covering based multigranulation rough sets [46]. Liu et al. [47–49] studied the covering-based multi-granulation rough sets by employing the minimal and maximum descriptions of target concepts. Yang et al. [50] discussed the updating multigranulation rough approximations with increasing of granular structures. Xu et al. introduced [51] multi-granulation fuzzy rough sets and Huang et al. [52] proposed intuitionistic fuzzy multigranulation rough sets. Qian et al. [53] developed the multigranulation decision-theoretic rough set and proved that many existing multigranulation rough set models can be derived from the multigranulation decision-theoretic rough set framework. The second category is the topology properties of multi-granulation rough sets. For example, Yang et al. [54] discussed the hierarchical structures of multi-granulation rough sets. She et al. [55] and He et al. [56] investigated the topological and lattice-theoretic properties of multi-granulation rough sets. Lin et al. [57] explored multigranulationtopological rough space and proposed topological granularity and topological entropy to characterize the uncertainty of the multigranulation topological rough space. Geetha et al. [58] discussed the algebraic properties of multigranulation rough set on two universes. The third category is feature selection. For example, Liang et al. [59] introduced an efficient rough feature selection algorithm from a multi-granulation view. Lin et al. [60] discussed feature selection via neighborhood multi-granulation fusion. Liu et al. [61] proposed a rule-extraction framework under multigranulation rough sets.
The main objective of this paper is to establish a CMFRS model by using different approximation strategies according to different applied backgrounds. As it is known that the objective concepts in most existing MGRS models are suppose to be distinct. But in reality, people often lack an explicit relation to granulate the universe due to imprecise human knowledge. That is to say, many real problems involve the situation of vague concepts, such as symbols of diseases, mechanical defects, and so on. Most existing MGRS models based on distinct concepts have a limitation for this issue. Therefore,it is necessary to employ the idea of fuzzy rough sets to the domain. To provide a theoretical framework for dealing with real problems involved vague concepts, in this paper we propose six kinds of covering-based multi-granulation rough set models, in which a concept is described by multiple binary relations instead of a single binary relation. This idea is also accordant with decision making and information fusion in practice. It is may more credible and reasonable to make the decision by combining the diagnoses from all specialists rather than from only one specialist when diagnosing whether or not a patient has contracted some diseases in a medical diagnosis problem. In another aspect, the six types of covering rough sets proposed in this paper are different from the types of covering-based fuzzy rough sets available in the literature, since we discuss the issues of covering-based fuzzy rough sets in the multi-granulation environment, and little attention has been given to the covering-based multi-granulation fuzzy rough sets. Moreover, we use the maximal and minimal descriptors of objects to approximate a target concept, which is different from the current methods. And our models are based on the multi-covering, whereas, the other types of covering-based fuzzy rough sets introduced at various stages are based on only one covering, thus, they could be considered as special cases of our model or our models are extensions of them. Furthermore, from the viewpoint of granular computing, many existing MGRS models can be derived from our new framework of covering-based multi-granulation fuzzy rough sets by setting specific binary relations or granular structures.
The paper is organized as follows. The next section deals with some preliminary concepts and properties regarding the Pawlak’s rough sets, covering-based rough sets, fuzzy rough sets and multi-granulation roughsets. In section 3, we introduce six kinds of constructing methods of rough approximation operations of CMFRS and discuss their basic properties and applications. Firstly, to compare with MGRS, we detail the similarities and different points between MGRS and CMFRS. Secondly, based on the concept of reduction of a covering, we obtain the sufficient conditions for two same type CMFRS to generate identical lower approximation operator or upper approximation operator and find that the necessary conditions may not satisfied. In section 4, we present some results concerning the interrelationship among six kinds of CMFRS proposed in this paper. The conditions under which six kinds of approximation operators are identical are also discussed. The last section concludes the present research and brings some remarks for future works.
Basic concepts and properties
Pawlak’s rough sets
Let U denote a nonempty and finite set called the universe and R ⊆ U × U be an equivalence relation on U. The pair apr = (U, R) is called an approximation space. The equivalence relation R partitions the universe U into disjoint subsets, each subset is called equivalence class or equivalence granule. Given an arbitrary set X ⊆ U, it may not be described precisely by the present information in (U, R). One may characterize X by a pair of lower and upper approximations defined asfollows.
, ∩X ≠ ∅}.
The pair is referred to as rough set approximation of X.
Let ∼X = U - X, we have the following basic properties of Pawlak’s rough sets.
Covering-based rough sets
We review some fundamental concepts about covering-based rough sets such as the minimal descriptor, neighborhoods of an object and approximation operators of a given concept in this subsection.
md C (x) ={K ∈ C x | ∀ S ∈ C x (S ⊆ K ⇒ K = S)},
MD C (x) ={K ∈ C x | ∀ S ∈ C x (S ⊇ K ⇒ K = S)} are called the minimal description and the maximal description of x, respectively, where C x ={K ∈ C|x ∈ K}.
n1 (x) = ∩{K|K ∈ C x }, n2 (x) = ∪{K|K ∈ C x }
n3 (x) = ∩{K|K ∈ md C (x)}, n4 (x) = ∪{K|K ∈ MD C (x)}
There are many kinds of approximation operators have been proposed in covering-based rough sets, but as Yao et al. [33] have pointed out that it seems more reasonable the operators as a pair meet some relationships. Thus, Yao et al. introduced several pairs of approximation operators which satisfy the property of duality.
Fundamentals of MGRS
In recent years, Qian et al. [38, 39] have proposed a new rough set model named multi-granulation rough sets, where a target concept is approximated by multiple binary relations. In this subsection, we will briefly outline two definitions of multi-granulation rough set models, i.e., optimistic and pessimistic multi-granulation rough sets, respectively. The detailed descriptions could be found in [38, 39].
Covering-based Multi-granulation Fuzzy Rough Sets(CMFRS)
In this section, we address the problems of the rough approximations of a fuzzy set based on multiple covering relations, that is, we extend fuzzy rough sets to a multi-granulation covering approximation space. Being different from the definition of covering approximation space introduced by Zakowski [22], we define the pair 〈U,
For the purposes of clearness and easy understanding, we only take the case of two coverings into consideration in this paper. In what follows, F (U) denotes all the fuzzy sets defined on U.
If , then X is called a type-1 covering-based multi-granulation fuzzy rough set with respect to C1, C2, ⋯ , C m , else X is called a fuzzy definable set.
If , then X is called a type-2 covering-based multi-granulation fuzzy rough set with respect to C1, C2, ⋯ , C m , else X is called a fuzzy definable set.
If , then X is called a type-3 covering-based multi-granulation fuzzy rough set with respect to C1, C2, ⋯ , C m , else X is called a fuzzy definable set.
If , then X is called a type-4 covering-based multi-granulation fuzzy rough set with respect to C1, C2, ⋯ , C m , else X is called a fuzzy definable set.
If , then X is called a type-5 covering-based multi-granulation fuzzy rough set with respect to C1, C2, ⋯ , C m , else X is called a fuzzy definable set.
If , then X is called a type-6 covering-based multi-granulation fuzzy rough set with respect to C1, C2, ⋯ , C m , else X is called a fuzzy definable set.
Next, an example is given to show the differences among the CMFRS models defined above.
Suppose is a fuzzy set defined in U. According to the above definitions, we have the following results.
,
;
,
;
,
;
,
;
,
;
,
.
From Example 1, we found that for a given fuzzy subset of U, the six types of approximations of X are different from each other, which means that the above six types of CMFRS models are also different from each other.
Yao [17] and many others [62] emphasized that the duality is one of the most important properties of lower and upper approximations and a number of results on the duality of approximation operators have been reported.
Proposition 1 gives the duality of the six types of approximations defined in the paper.
(1) ,
;
(2) ,
;
(3) ,
;
(4) ,
;
(5) ,
;
(6) ,
.
According to Definition 7, for any x ∈ U
Analogously,
This completes the proof of Proposition 1. □
In the following, we will use symbol ◊ to denote M1, M2, . . . , M6.
If X ⊆ Y, then ; If X ⊆ Y, then ;
;
;
;
.
(1) According to Definition 7, we have . If X ⊆ Y, then X (y) ≤ Y (y), therefore, ;
One can prove the remaining five cases in a similar manner, the proof is omitted here.
(2) It can be proved in a similar way like (1).
This completes the proof of Proposition 2. □
In Proposition 2, (1) and (2) show the monotonicity of covering rough approximations, w.r.t., the variety of target or concept, and each of (3)-(6) expresses the relationship between the rough approximation of X ∩ Y (or X ∪ Y) with the intersection (or union) of the two rough approximations of X and Y under the multi-covering environment.
From Proposition 3, one may see that for any element in the coverings which construct a given CMFRS model, its lower approximation in that model is not larger than itself, if the element is in partitions then its lower approximation in that model is itself.
From Definition 13, if C1 ⪯ C2, then for any x ∈ U, we have md C 1 (x) ⊆ md C 2 (x) and MD C 1 (x) ⊆ MD C 2 (x).
(1) If C1 ⪯ C2 ⪯ ⋯ ⪯ C m , then ;
(2) if C1 ⪯ C2 ⪯ ⋯ ⪯ C m , then .
(1) If C1 ⪯ C2 ⪯ ⋯ ⪯ C m , by Definition 13, we have md C 1 (x) ⊆ md C 2 (x) ⊆ ·· · ⊆ md C m (x).
Then for any X ∈ F (U) and x ∈ U, we have
(2) If C1 ⪯ C2 ⪯ ⋯ ⪯ C m , by Definition 13, we have md C 1 (x) ⊆ md C 2 (x) ⊆ ·· · ⊆ md C m (x).
Then for any X ∈ F (U) and x ∈ U, we have
This completes the proof of Theorem 2. □
A reduct of a covering is a minimal subcovering that retains each object subset’s approximation. The relationship between approximations produced by C and corresponding reduct (C) in same covering approximation space is an interesting topic. Next we shall discuss the relationships of approximations produced by C1, C2, ⋯ C m and their corresponding reduct (C1), reduct (C2), ⋯, reduct (C m ).
reduct (C1) = reduct (C3), reduct (C2) = reduct (C4) and at the same time, exclusion (C1) = exclusion (C3), exclusion (C2) = exclusion (C4); reduct (C1) = reduct (C4), reduct (C2) = reduct (C3) and at the same time, exclusion (C1) = exclusion (C4), exclusion (C2) = exclusion (C3).
According to Theorem 1, for any x ∈ U, C and its corresponding reduct (C) yield the same md (x). If reduct (C1) = reduct (C3) and reduct (C2) = reduct (C4), then C1 and C3 produce the identical md (x), and C2 and C4 share the other md (x). According to Definition 7, 8 and 11, M1C1+C2 and M1C3+C4, M2C1+C2 and M2C3+C4, M5C1+C2 and M5C3+C4 produce the identical lower approximation operators of CMFRS.
For any x ∈ U, C and corresponding exclusion (C) yield the identical MD (x). If exclusion (C1) = exclusion (C3) and exclusion (C2) = exclusion (C4), then C1 and C3 produce the same MD (x), and C2 and C4 share the other MD (x). According to Definition 9, 10 and 12, M3C1+C2 and M3C3+C4, M4C1+C2 and M4C3+C4, M6C1+C2 and M6C3+C4 produce the identical lower approximation operators of CMFRS.
This completes the proof of Theorem 4. □
In fact, the above theorem offers sufficient conditions for which different CMFRS models produce the identical lower approximation operators.
Remark 2 can be illustrated by using the following example.
According to Theorem 1, we have
reduct (C1) = {{x1, x2} , {x2, x3, x4} , {x3, x4}},
reduct (C2) = {{x1, x2, x3} , {x2, x4} , {x1, x2, x4}};
reduct (C3) = {{x1, x2} , {x2, x3, x4}},
reduct (C4) = {{x1, x2, x3} , {x2, x4}}.
Hence,
reduct (C1) ≠ reduct (C3) , reduct (C2) ≠ reduct (C4)
But, we have
,
,
,
.
(1) reduct (C1) = reduct (C3) and reduct (C2) = reduct (C4)
(2) reduct (C1) = reduct (C4) and reduct (C2) = reduct (C3)
According to Theorem 1, for any x ∈ U, C and reduct (C) generate the same md (x). If reduct (C1) = reduct (C3) and reduct (C2) = reduct (C4), then C1 and C3 own the same md (x), and at the same time, C2 and C4 share another md (x). According to Definition 7, 8 and 11, M1C1+C2 and M1C3+C4, M2C1+C2 and M2C3+C4, M5C1+C2 and M5C3+C4 produce the identical covering multi-granulation fuzzy rough lower and upper approximations.
This completes the proof of Theorem 5. □
Example 4 gives a proof to Remark 3.
From Theorem 1, we can calculate that
reduct (C1) = {{x1} , {x2} , {x3}},
reduct (C2) = {{x1, x2} , {x1, x3}},
reduct (C3) = {{x1, x2} , {x2, x3} , {x1, x3}},
reduct (C4) = {{x1, x3} , {x1, x2, x3}}.
Therefore,
reduct (C1) ≠ reduct (C3), reduct (C1) ≠ reduct (C4)
reduct (C2) ≠ reduct (C3), reduct (C2) ≠ reduct (C4)
But, given , , we have
,
;
,
;
,
.
exclusion (C1) = exclusion (C3) and exclusion (C2) = exclusion (C4) ; exclusion (C1) = exclusion (C4) and exclusion (C2) = exclusion (C3).
Theorem 6 gives the sufficient conditions for which two different Type-3 or Type-4 or Type-6 CMFRS models produce the identical upper approximation operators.
Remark 4 can be proved by the following example.
From Definition 15, we can obtain
exclusion (C1) = {{x3, x4} , {x1, x2, x3} , {x1, x2, x4}},
exclusion (C2) = {{x1, x2} , {x3, x4}},
exclusion (C3) = {{x1, x2, x3, x4}},
exclusion (C4) = {{x1, x2, x4} , {x3, x4}}.Therefore,
exclusion (C1) ≠ exclusion (C3) and
exclusion (C2)≠ exclusion (C4) ;
exclusion (C1) ≠ exclusion (C4) and
exclusion (C2) ≠ exclusion (C3). But, let , we have
Relationships Among Types of CMFRS
We investigate the relationships among the six types of covering-based multi-granulation fuzzy rough sets proposed in the previous section. In addition, some interesting properties are also discussed here.
According to the definitions of CMFRS, the following theorem can be obtained.
;
;
;
;
.
Theorem 7 can be easily proved by employing the definitions of CMFRS, we omit them here. For the readers’ convenience, the relationships among the six types of CMFRS models are described by Fig. 1. In Fig. 1, each node denotes an approximation or a concept, and each line connects two approximations, where the lower node is a subset of the upper node. Unfortunately, for any X ⊆ U, the set of all the six types of lower and upper approximations equipped with the binary relation ⊆ of inclusion, can not construct a lattice.
From Fig. 1, the following conclusions are obtained.
First, from Fig. 1, one could find that among those six types of lower approximations of CMFRS, the first type lower approximation is the best one to describe X, since in Fig. 1 the first type lower approximation is closer to X than other types of lower approximations. While for the case of upper approximation, we may not decide which type of upper approximations is the best one to approximate the concept X. Moreover, the fourth type of lower approximation is the worst to describe X, since in Fig. 1 the fourth type of lower approximation has the furthest distance with X with respect to other types of lower approximations. And we can not figure out which is the worst to describe X for the case of upper approximations.
Second, from Fig. 1, one could also find that and are generally not equal, and one does not contain another, that is, neither nor , that is to say, and are independent. So do and , and , and .
Finally, from Fig. 1, one could obtain the following theorem, which contains some important conclusions about the relationships among the six pairs of lower and upper approximations.
,
,
,
,
,
.
□
,
,
,
,
,
.
From Theorem 8 and Corollary 1, one could see that there exist some meaningful relationships among the six pairs of lower and upper approximations. For instance, from (1) of Theorem 8, one could find that for any X ⊆ F (U), the type-5 lower approximation of X is a superset of the union of the type-2 and type-3 lower approximations of X. Specially, from (1) of Corollary 1, if C1, C2, ⋯ , C m are all partitions of U, then the type-5 lower approximation of X is equal to the union of the type-2 and type-3 lower approximations of X. Moreover, from (2)-(6) of Theorem 8 and (2)-(6) of Corollary 1, some similar conclusions could be obtained.
Example 6 shows the above results about the six pairs of lower and upper approximations given in Fig. 1.
,
;
,
;
,
;
,
;
,
;
,
.
Therefore,
,
,
,
,
,
,
,
,
,
,
.
Therefore, and are all not satisfied.
C is also said to be unary if for any x ∈ U,|MD (x) | = 1.
,
;
,
;
,
.
If C1, C2, ⋯ , C
m
are all unary, then for any x ∈ U, we have that ∩{K|K ∈ md
C
i
(x)} = ∪{K|K ∈ md
C
i
(x)}. Therefore, according to Definition 7, 8, 9 and 10, one has
Theorem 9 proposes a sufficient condition for which type-1 and type-2 CMFRS produce the identical approximations. Moreover, Theorem 9 also gives the sufficient condition for which type-3 and type-4 CMFRS and type-5 and type-6 produce identical approximations.
Remark 6 could be illustrated by means ofExample 8.
Suppose that , one has
Obviously, both C1 and C2 are not unary. But, owing to the definitions of CMFRS, we have
Conclusions
We have introduced six types of covering multi-granulation fuzzy rough set models and studied some important properties of these models. We have shown that our CMFRS models are natural extensions of MGRS models in covering approximation space. Moreover, the sufficient conditions for two distinct CMFRS models which produce identical lower and upper approximations of a given concept in a covering approximation space are disclosed. Finally, the relationships among various CMFRS models were discussed in details. The outputs show that for any X ⊆ F (U), the set of all those lower and upper approximations equipped with the binary relation ⊆ of inclusion, can not construct a lattice.
Acknowledgments
This work is supported by China National Natural Science Foundation of Youth Science Foundation under Grant No.: 61305052, 61403329 and the State Scholarship Fund of China (File No. 201409865003).
