Abstract
Vague sets are a further extension of fuzzy sets. In rough set theory, target concept can be characterized by different rough approximation spaces when it is a vague concept. The uncertainty measure of vague sets in rough approximation spaces is an important issue. If the uncertainty measure is not accurate enough, different rough approximation spaces of a vague concept may possess the same result, which makes it impossible to distinguish these approximation spaces for charactering a vague concept strictly. In this paper, this problem will be solved from the perspective of similarity. Firstly, based on the similarity between vague information granules(VIGs), we proposed an uncertainty measure with strong distinguishing ability called rough vague similarity (RVS). Furthermore, by studying the multi-granularity rough approximations of a vague concept, we reveal the change rules of RVS with the changing granularities and conclude that the RVS between any two rough approximation spaces can degenerate to granularity measure and information measure. Finally, a case study and related experiments are listed to verify that RVS possesses a better performance for reflecting differences among rough approximation spaces for describing a vague concept.
Keywords
Introduction
There are couple of effective tools for dealing with uncertain knowledge, i.e., fuzzy sets [44], rough sets [26], probability theory [25], and cloud model [19], etc.. Vague sets [11], as well as intuitionistic fuzzy sets (further generalization of fuzzy sets) [3], are essentially the same. The similarities and differences between vague sets and intuitionistic fuzzy sets are further investigated in literature [13], which concluded that vague sets are more universally applicable than intuitionistic fuzzy sets to some degree. Based on fuzzy sets, vague sets generalize the membership degree from single value to interval value. As a classical soft computing tool, vague sets have more powerful ability to handle uncertain information than fuzzy sets. Currently, vague sets have been applied in various fields, including decision making [1, 14], pattern recognition [35, 46], control and reasoning [5, 49], et al.
Rough sets [26] are effective tool to handle uncertain information by using the given information granulation. In rough set theory, target concepts can be described by upper and lower approximation sets. Generally speaking, the target concepts in classical rough sets are usually accurate and crisp. However, the target concept may be vague or uncertain in many real applications. To solve this problem, there are many research works [28, 50] on the combination between rough sets and vague sets. Especially, the research on uncertainty of vague concepts in rough approximation spaces becomes a basic issue. Current works [4, 48] mainly focus on measuring the uncertainty of vague sets itself, but it is necessary to consider the uncertainty of vague concepts in rough approximation spaces. Zhang [50] constructed the approximation sets of a vague set in rough approximation spaces and proved that average-step-vague set is an optimal step-vague set. Moreover, by defining the concept of interval fuzziness of vague sets, Zhang [52] proposed the average fuzziness of vague sets. By constructing rough approximations of vague sets in fuzzy approximation spaces, Shen [36] presented two types of roughness measure of vague sets. These works indicate that rough sets are suitable mathematical tool for studying vagueness and uncertainty.
Granular computing (GrC) [20, 42] provides a method to simulate the human mechanism to deal with complex problems hierarchically. Different knowledge spaces can be induced by diverse attribute sets, thus the multi-granularity rough approximations of uncertain knowledge can be achieved [9, 40]. The uncertainty of multi-granularity approximation spaces for describing a vague concept is an important issue, and the rules of uncertainty of vague sets in multi-granularity approximation spaces should be in line with human cognitive mechanism. However, when a vague concept is described by different rough approximation spaces respectively, these approximation spaces may possess the same measure result, and it does not mean these approximation spaces are completely equivalent. Moreover, the current similarity measures [4, 39] focus on reflecting the similarity between two vague sets, and the difference among approximation spaces for characterizing a vague concept cannot be reflected. Actually, in some cases, such as attribute evaluation, pattern recognition and knowledge matching, it is necessary to discriminate these rough approximation spaces when describe a vague concept. Take attribute evaluation as an example, if the rough approximation spaces corresponding to different attributes cannot be distinguished, ranking by attribute significance will be not accurate. Therefore, it is necessary to establish an uncertainty measure model with strong distinguishing ability in multi-granularity spaces. This is one of motivations of this paper. In the view of GrC, the information granules will be subdivided into finer granules with granularity being finer gradually. That means a vague concept can be described by rough approximation spaces with different granularities with the increase of attributes. Is there any rules in describing the same vague concept in multi-granularity approximation spaces? From the above discussion, the description for a vague concept in multi-granulation rough approximation spaces still need further research.
The main contributions of this paper are as follows: (1) to reflect the similarity between equivalence classes for characterizing a vague concept, the similarity between vague information granules (VIGs) is defined; (2) based on the similarity between VIGs, rough vague similarity (RVS) is proposed to measure the differences among approximation spaces for characterizing a vague concept; (3) the multi-granularity approximations of a vague set is analyzed from the perspective of similarity, then the change rules of the RVS with the changing granularities are revealed.
The rest of this paper is organized as follows. Many preliminary concepts are briefly recalled in Section 2. In Section 3, the rough vague similarity is proposed and its relative properties are investigated. In Section 4, the multi-granularity rough approximations of a vague concept are discussed in the view of similarity. In Section 5, a case study for relevant illustrations and related experiments are presented. In Section 6, the conclusions are drawn.
Preliminaries
To facilitate the description of this paper, many basic concepts are reviewed briefly in this section.
Let S = (U, C ∪ D) be a decision system, where U = {x1, x2, …, x N } is the universe of discourse, C is the set of condition attributes, and D is the decision attribute. If R ⊆ C, U/R = {[x] R } = {[x] 1, [x] 2, …, [x] l } denotes a partition of U induced by U/R, where [x] R denotes the equivalence class of R. U/R is also called a knowledge space or approximation space.
If
(1) Equality: V1 = V2 iff ∀x ∈ U, t V 1 (x) = t V 2 (x) and f V 1 (x) = f V 2 (x);
(2) Containment: V1 ⊆ V2 iff ∀x ∈ U, t V 1 (x) ≤ t V 2 (x) and f V 1 (x) ≥ f V 2 (x);
(3) Intersection: V3 = V1 ∩ V2 iff ∀x ∈ U, t V 3 (x) = min {t V 1 (x) , t V 2 (x)}, f V 3 (x) = max {f V 1 (x) , f V 2 (x)};
(4) Union: V3 = V1 ∪ V2 iff ∀x ∈ U, t V 3 (x) = max {t V 1 (x) , t V 2 (x)}, f V 3 (x) = min {f V 1 (x) , f V 2 (x)};
(5) Complement:
Where, μ denotes the membership degree of x belongs to the vague concept V.
When U is continuous, the intuitionistic fuzziness of V can be denoted by
When U is discrete, the intuitionistic fuzziness of V can be denoted by UNC V (x) = ∑x∈UH V (x).
To measure the uncertainty of a rough approximation spaces U/R for describing a vague concept V, literature [51] further proposed the average intuitionistic fuzziness as follows:
Where, μ denotes the membership degree of x belongs to the vague concept V.
When U is continuous, the average intuitionistic fuzziness of V can be denoted by
When U is discrete, the average intuitionistic fuzziness of V can be denoted by
According to Definition 4 and formula (2), we have
Since Zadeh proposed fuzzy sets, many researchers pay attentions to similarity measures between fuzzy sets from different viewpoints. As a generalization of fuzzy sets, large of similarity measures [4, 27] are proposed for vague sets, which has been applied in various areas, including data mining [39], approximate reasoning [4], pattern recognition [6], et al. However, when a vague concept is described by different rough approximation spaces respectively, the existed similarity measures failed to reflect the similarity among these approximation spaces. Moreover, different rough approximation spaces of a vague concept may possess the same measure result, and the differences among them for characterizing a vague concept cannot be reflected. To solve the above problems, in this section, we proposed a similarity measure for rough approximations of vague concepts based on the idea of Earth Mover’s Distance [32, 33].
Suppose that U is the universe of discourse, V is a vague set on U and A is a finite set on U. Then, VIG A is a vague set formed by A to characterize V, and VIG A is called a vague information granule (VIG).
(continued)
(1) Positive: s (A, B) ≥0;
(2) Symmetric: s (A, B) = s (B, A);
(3) Triangle inequality: s (A, B) + s (B, C) ≤1 + s (A, C).
(1) Positive: d (A, B) ≥0;
(2) Symmetric: d (A, B) = d (B, A);
(3) Triangle inequality: d (A, B) + d (B, C) ≥ d (A, C).
Proof. Suppose that V is a vague set on U, A, B and C are finite sets on U. VIG A , VIG B and VIG C are vague information granules formed by A, B and C. Obviously, s VIG (· , ·) is positive and symmetric. We have the following assumption a = ∑x∈UHVIG A ∪VIG B (x) - ∑x∈UHVIG A ∩VIG B (x), b = ∑x∈UHVIG B ∪VIG C (x) -∑x∈UHVIG B ∩VIG C (x), and c = ∑x∈UHVIG A ∪VIG C (x) - ∑x∈UHVIG A ∩VIG C (x). Because (A ∪ B - A ∩ B) + (B ∪ C - B ∩ C) ≥ A ∪ C - A ∩ C, then a + b ≥ c.
Obviously,
According to Definition 6, s VIG (· , ·) is a similarity measure between VIGs. We denote d VIG (· , ·) =1 - s VIG (· , ·). From Theorem 1, we have the following Lemma:
∑x∈UHVIG A ∪VIG B (x) = 1.432,∑x∈UHVIG A ∩VIG B (x) = 0.645,UNC V (x) =1.432.
According to Definition 8, we have
As well known, the Earth Mover’s Distance (EMD) [32, 33] is proposed by Rubner to calculate the differences between probability distributions in image retrieval. This prevents quantization errors by minimizing the cost of transformation. Moreover, the EMD achieves a many-to-many matching computation, which is in line with human cognition and applied in various fields, i.e., computer vision [24, 53], image processing [10, 34].
In formula (6), d ij is a distance measure between p i and q j . flow ij is the flow from p i to q j . The purpose of the EMD is to search a matrix F = [flow ij ] to minimize the transformation cost.
(1) d (· , ·) is a distance measure;
(2)
Based on the idea of EMD, a similarity measure for rough approximations of vague sets is presented, which is called rough vague similarity (RVS).
In formula (7), s VIG (· , ·) can be any similarity measure, and it is chosen according to the special problem. Inheriting the merits of the EMD, RVS is more robust to achieves a many-to-many matching calculation, and is more in line with human cognition. Definition 10 is further explained by the following example.
According to Definition 10, we have
Moreover, Example 3 can be further described intuitively by Fig. 1. w p i (i = 1, 2) denotes the weight of equivalent classes p i , which is the number of elements in equivalent classes p i . Similarly, w q j (j = 1, 2, 3) denotes the weight of equivalent classes q j , which is the number of elements in equivalent classes q j . Then, f ij represents the flow of elements from p i to q j or q j to p i .

Schematic diagram of the RVS explained by Example 3.
Suppose that U/R1 = {p1, p2, …, p
l
} is a rough approximation spaces on U, where p
i
= {xi1, xi2, …, xi|p
i
|}. Then p
i
= g
R
1
(xi1) = g
R
1
(xi2) = ⋯ = g
R
1
(xi|p
i
|). The mechanism provides a uniform representation of granular spaces in a knowledge base. For example,
Proof. Suppose that decision system S = (U, C ∪ D), R1, R2 ⊆ C and V is a vague set on U. Let U = {x1, x2, . . . , x N } be a non-empty finite universe, U/R1 = {p1, p2, . . . , p l }, U/R2 = {q1, q2, . . . , q m } and U/R3 = {r1, r2, . . . , r n }. Obviously, it is positive and symmetric. We have
Similarly,
According to Theorem 1, we have
s (g R 1 (x i ) , g R 2 (x i )) + s (g R 2 (x i ) , g R 3 (x i ))
≤1 + s (g R 1 (x i ) , g R 3 (x i )) then,
RVS (U/R1, U/R2)+ RVS (U/R2, U/R3) ≤1 + RVS (U/R1, U/R3). According to Definition 6, RVS (· , ·) is a similarity between rough approximation spaces for charactering a vague concept.□
From the perspective of granular computing, with the granularity being finer gradually, the information granules in approximation space will be subdivided into finer granules. As a result, the hierarchical multi-granularity approximation spaces are formed, which means that a vague concept can be described by multilevel rough approximation spaces in a decision system. In this section, the changing rules will be discussed in multi-granularity rough approximation spaces for charactering a vague concept.
Proof. According to formula (5), we have
Proof. Let U = {x1, x2, . . . , x
N
} be a non-empty finite universe, U/R1 = {p1, p2, . . . , p
l
}, U/R2 = {q1, q2, . . . , q
m
} and U/R3 = {r1, r2, . . . , r
n
}. Because R1 ⊆ R2 ⊆ R3, so
Without loss of generality, let p1 = q1 ∪ q2, p2 = q3, p3 = q4,...,p
l
= q
m
(m = l + 1), q1 = r1∪r2, q2 = r3, q3 = r4,⋯,q
m
= r
n
(n = m + 1), namely, U/R2 = {q1, q2, p2, p3, . . . , p
l
}, U/R3 ={r1, r2, q2, q3, . . . , q
m
}. According to formula (7), we have
According to Lemma 2, because s VIG (p1, q1)>s VIG (p1, r1) and s VIG (p1, q2) > s VIG (p1, r2), we have RVS (U/R1, U/R2) - RVS (U/R1, U/R3) ≥0, namely, RVS (U/R1, U/R2) ≥ RVS (U/R1, U/R3). Similarly, RVS (U/R2, U/R3) ≥ RVS(U/R1, U/R3) holds.□
According to conditions, we have
In a decision system, there are two special cases: the finest approximation space and the coarsest approximation space re denoted by ω and δ respectively. According to Theorem 5, the following corollaries hold:
Proof. Based on the proof of Theorem 4, we have
(continued)
(1) I (U/R1) ≥0, for any R1 ⊆ C;
(2) U/R1 ≺ U/R2 ⇒ I (U/R1) > I (U/R2), for any R1, R2 ⊆ C;
(3) U/R1 = U/R2 ⇒ I (U/R1) = I (U/R2), for any R1, R2 ⊆ C.
Then I is an information measure.
Proof.
(1) From Theorem 3, RVS (U/R1, ω) ≥0.
(2) Assume that R2 ⊆ C, when U/R1 = U/R2, obviously, RVS (U/R1, ω) = RVS (U/R2, ω).
(3) According to Corollary 1, if R1 ⊆ R2, RVS (U/R1, ω) ≤ RVS (U/R2, ω).□
According to Corollary 3 and Theorem 6, for a vague concept in hierarchical multi-granularity approximation spaces, the larger the information difference between rough approximation spaces is, the larger the rough vague similarity between them is.
(1) G (U/R1) ≥0, for any R1 ⊆ C;
(2) U/R1 ≺ U/R2 ⇒ G (U/R1) > G (U/R2), for any R1, R2 ⊆ C;
(3) U/R1 = U/R2 ⇒ G (U/R1) = G (U/R2), for any R1, R2 ⊆ C.
Then G is a granularity measure.
Proof.
(1) From Theorem 3, RVS (U/R1, δ) ≥0.
(2) Assume that R2 ⊆ C, when U/R1 = U/R2, obviously, RVS (U/R1, δ) = RVS (U/R2, δ).
(3) According to Corollary 2, if R1 ⊆ R2, RVS (U/R1, δ) ≥ RVS (U/R2, δ).
Therefore, RVS (U/R1, δ) is a granularity measure.□
According to Corollary 4 and Theorem 7, for a vague concept in hierarchical multi-granularity approximation spaces, the larger the granularity difference between rough approximation spaces is, the larger the rough vague similarity between them is.
Applicable analysis
As mentioned above in Section 1, the rough vague similarity has the following potential applications: (1) In rough set theory, the rough vague similarity is helpful for effectively choosing the suitable rough approximation spaces for describing a vague concept more accurately.(2) The rough vague similarity can be used to construct a heuristic function in attribute reduction. In this section, we only discuss the former, that is, the effectiveness of the proposed rough vague similarity in the view of granular computing.
Case study
As well known, various knowledge spaces are formed by different attribute sets. This means that a vague concept can be described by multi-granularity rough approximation spaces in a decision system. There always exists an optimal knowledge space by attribute reduction [15–17, 37] from the perspective of granular computing. For charactering a vague concept, although it is difficult to acquire the optimal attribute sets in some case, we can search the attribute sets that possess the similar approximation ability of optimal attribute sets by measuring the matching degree between optimal knowledge space and other knowledge spaces. The following example is presented to further illustrate this idea.
Table of student score
Table of student score
The descriptions of datasets
From the perspective of uncertainty, the significance of U/R1 and U/R2 for checking liver cancer can not be distinguished. However, according to rough vague similarity, we have
Therefore, for describing the target concept V, attribute sets R1 is more similar than attribute sets R2 to the optimal attribute sets C. To sum up, although the local hospital does not possess the all detection items for liver cancer for some reasons, we can choose the hospital whose approximate ability is closer to the optimal attribute set by rough vague similarity.
In this subsection, related experiments are conducted to validate the effectiveness of RVS. The experimental environments are Windows 7, Intel Core (TM) I5-4590 CPU (3.30 GHz) and 8GB RAM. The experimental platform is Matlab 2015b. The experimental datasets include one dataset from Greek industrial development bank [12] and three datasets from UCI Repository of machine learning databases [31] are listed in Table 1.
Supposing that GS = {GL1, GL2, GL3, GL4, GL5, GL6, GL7}, and it denotes the seven granularity layers with the granularity being finer. In Fig. 2, the different colors represent the RVS of different rough approximation spaces. Figure 2 shows that each dataset behaves similarly, that is, the larger the granularity difference between two rough approximation spaces, the smaller the RVS between them. Conversely, the smaller the granularity difference between two rough approximation spaces, the larger the RVS between them. Figure 3 further shows the details of RVS in Fig. 2.

The change of RVS among rough approximation spaces with partial order relation.

The matrix value of RVS in Fig. 2.
In Fig. 4, rough vague similarity and average intuitionistic fuzziness are compared for distingushing approximation spaces coming from the same dataset. The pairs of approximation spaces distingushed by rough vague similarity and average intuitionistic fuzziness, respectively. With loss of generality, average intuitionistic fuzziness is only one representative of the fuzziness measure in this experiment. Based on the idea of literature [29], we have the following formula of the identifiable ratio:

Comparison on rough vague similarity and average intuitionistic fuzziness.
In Fig. 4, the blue bar and the yellow bar represent the identifiable ratio of rough vague similarity and average intuitionistic fuzziness, respectively. Obviously, the identifiable ratio of rough vague similarity is greater than that of average intuitionistic fuzziness. Compared with average intuitionistic fuzziness, the rough vague similarity possesses better performance for reflecting differences among rough approximation spaces when they describe a vague concept at the same. That is, as long as the descriptive ability of two rough approximation spaces for describing a vague concept are not the same as each other, they can be distinguished by rough vague similarity.
Multi-granularity thinking of human intelligence is an effective method to establish a new model for data processing. Knowledge similarity can reflect the differences of different granularities in problem observation, which is in line with the information processing mechanism of human brain. In this paper, to reflect the similarity between rough approximation spaces for charactering a vague concept, the rough vague similarity (VGS) with strong distinguishing ability is proposed. Furthermore, a comprehensive analysis for multi-granularity rough approximations of vague sets from the perspective of RVS is presented and the relationship between RVS and uncertainty measure (granularity measure and information measure) is established. Finally, the effectiveness of RVS is verified by a case study and related experiments. These results are useful for knowledge discovery in multi-granularity spaces for describing an uncertain concept and important for establishing a GrC framework in knowledge bases.
Footnotes
Acknowledgment
This work is supported by the National Science Foundation of China (No. 61762089, 61663047, 61863036), Guizhou Provincial Natural Science Foundation (KY[2018]No.318), Innovation and exploration project of Guizhou Province (QKHPTRC [2017]5727-06), High level innovative talents of Guizhou Province (ZKG 2018 [15]).
