Abstract
An approximation space is one basic concept in rough set theory. This paper investigates measures of uncertainty for an approximation space from granular computing viewpoint. Granular structures of an approximation spaces are first described by means of set vectors. Then, relationships between granular structures of an approximation spaces are studied from the two aspects of dependence and separation. Next, properties of granular structures of an approximation space are given. Furthermore, as an application for granular structures of an approximation spaces, measuring uncertainty of approximation spaces is investigated. Finally, one example is employed to illustrate features of the proposed measures for uncertainty of approximation spaces. These results will be helpful for understanding the essence of uncertainty for an approximation space.
Keywords
Introduction
Rough set theory, presented by Pawlak [15], is a mathematical tool to deal with uncertainty and can be considered as the generalization of classical set theory. It has been successfully applied to intelligent systems, expert systems, knowledge discovery, pattern recognition, machine learning, signal analysis, image processing, inductive reasoning, decision analysis and many other fields [3, 15–17].
The basic structure of rough set theory is an approximation space. Based on it, lower and upper approximations can be induced. Using these approximations, knowledge hidden in information systems may be revealed and expressed in the form of decision rules. A key notion in Pawlak rough set model is equivalence relations. The equivalence classes are the building blocks for the construction of these approximations.
Granular computing proposed by Zadeh is an important tool in artificial intelligence [25–27]. Its key issues are granulation (or information granulation), organization and causation. Zadeh pointed out “Granulation involves decomposition of whole into parts, organization involves integration of parts into whole, and causation involves association of causes and effects". The aim of granular computing is to explore an approximation scheme, which can effectively solve a complex problem. It allows us to view a phenomenon with different levels of granularity. Now, granular computing has been applied in various fields such as data mining, data clustering, machine learning, artificial intelligence, approximate reasoning and knowledge discovery.
A granule (or an information granule) is a primitive concept in granular computing, which is a clump consisting of objects drawn together by similarity, indistinguishability and proximity of functionality [12, 24]. It may be interpreted as one of the numerous small particles forming a larger unit. Granulation of objects leads to a family of information granules. A granular structure is a mathematical structure of the family of information granule from a data set, where the internal structure of each information granule is visible and the coaction between information granules are tested by the visible structures [4–6].
The concept of entropy is originated from energetics. It can be used to measure out-of-order degree of a system. The entropy of a system, proposed by Shannon [21], gives a measure of uncertainty of a system. It has been applied in diverse fields as a useful mechanism for evaluating uncertainty in various modes. Some scholars have applied the extension of entropy and its variants to rough sets. For example, D
The aim of this paper is to investigate uncertainty measurement for an approximation space by using its granular structures.
The remaining part of this paper is organized as follows. In Section 2, some basic notions on approximation spaces and rough set theory are recalled. In Section 3, the concept of granular structures is introduced and dependence between granular structures are proposed. In Sections 4, some tools for measuring uncertainty of an approximation space are introduced and an illustrative examples is given. In Sections 5, a numerical experiment is given to show features of the proposed measures. Section 6 summarizes this paper.
Preliminaries
In this section, we recall some basic concepts about approximation spaces and rough set theory.
Throughout this paper, U denotes a non-empty finite set called the universe, 2 U denotes the family of all subsets of U and |X| denotes the cardinality of X ∈ 2 U .
Denote
Recall that R is a binary relation on U whenever R ⊆ U × U. If (x, y) ∈ R, then we denote it by xRy.
Let R be a binary relation on U. Then R is called
(1) reflexive, if xRx for any x ∈ U.
(2) symmetric, if xRy implies yRx for any x, y ∈ U.
(3) transitive, if xRy and yRz imply xRz for any x, y, z ∈ U.
Let R be a binary relation on U. Then R is called an equivalence relation on U, if R is reflexive, symmetric and transitive.
In this paper, U = {x1, x2, ⋯ , x
n
},
Given
Let
Then
Granular structures of approximation spaces
In this section, we give granular structures of approximation spaces and study dependence between them.
G (▵) = ({x1} , {x2} , ⋯⋯ , {x n }).
The following definition depicts relationships between granular structures of approximation spaces from two aspects.
(1) G (Q) is called to depend on G (P), if for each i, [x i ] P ⊆ [x i ] Q , we write G (P) ⪯ G (Q); G (Q) is called to depend strictly on G (P), if G (P) ⪯ G (Q) and G (P) = G (Q), we write G (P) ≺ G (Q).
(2) G (Q) is called to depend partially on G (P), if there exists i, [x i ] P ⊆ [x i ] Q , we write G (P) ⊑ G (Q); G (Q) is called to depend partially strictly on G (P), if G (P) ⊑ G (Q) and G (P) = G (Q), we write G (P) ⊏ G (Q).
(3) G (Q) is called to be independent of G (P), if for each i, [x i ] P ⊈ [x i ] Q . We write G (P) ⋈ G (Q).
Obviously,
G (P) = G (Q) ⇔ G (P) ⪯ G (Q), G (Q) ⪯ G (P),
G (P) ⪯ G (Q) ⇒ G (P) ⊑ G (Q).
G (P) notsqsubseteqG (Q) ⇔ G (P) ⊑ G (Q).
(1) G (P) = G (Q);
(2) U/P = U/Q;
(3) P = Q.
(1)
(2)
The following theorem quantitatively depict the dependence of granular structures by the condition information amount.
(1) G (P) ⪯ G (Q);
(2) U/P refines U/Q, i.e., for each A ∈ U/P, there exists B ∈ U/Q such that A ⊆ B;
(3) P ⊆ Q;
(4) H ((U/Q)/(U/P)) =0.
(3) ⇒ (4). Denote
U/P = {X1, X2, ⋯⋯ , X k } ,
U/Q = {Y1, Y2, ⋯⋯ , Y l } .
∀ i, since U/P refines U/Q, we have X i ⊆ Y j i for some j i ≤ l. Then X i - Y j i =∅.
∀ j ≠ j i , Y j ∩ Y j i = ∅. Then X i ∩ Y j = ∅.
Thus ∀ i, j, X
i
- Y
j
=∅ or X
i
∩ Y
j
= ∅. This implies
Hence H ((U/Q)/(U/P)) =0.
(4) ⇒ (2). Since H ((U/Q)/(U/P)) =0, we have ∀ i, j, X i - Y j =∅ or X i ∩ Y j = ∅. So ∀ i, j, X i ⊆ Y j or X i ∩ Y j = ∅.
∀ i,
Thus U/P refines U/Q. □
For
(1) G (P) ⪯ G (Q);
(2) for each j, D j ∈ σ (U/P);
(3) for each j,
(4)
Thus D j = ⋃ x∈D j [x] P ∈ σ (U/P).
(2) ⇒ (3). Since D j ∈ σ (U/R), D j = ⋃ x∈X [x] P for some X ∈ 2 U .
Thus
(3) ⇒ (4) is obvious.
(4) ⇒ (1). ∀ i ≤ n, ∀ y ∈ [x
i
]
P
. Since
(1) 0 ≤ D (G (Q)/G (P)) ≤1;
(2) G (P) ⪯ G (Q) implies D (G (Q)/G (P)) =1;
(3) G (P) ⊑ G (Q) ⊑ G (R) implies D (G (P)/G (R)) ≤ D (G (P)/G (Q)).
It is easy to prove that D is the inclusion degree on G (U).
The following theorem shows that relationships between granular structures in an approximation space can be quantitatively described by the inclusion degree.
(1) G (P) ⪯ G (Q) ⇔ D (G (Q)/G (P)) =1 .
(2) G (P) ⋈ G (Q) ⇔ D (G (Q)/G (P)) =0 .
(3) G (P) ⊑ G (Q) ⇔0 < D (G (Q)/G (P)) ≤1 .
“⇐". Put
Hence G (P) ⊏ G (Q).
(2) “⇒". Since G (P) ⋈ G (Q), we have ∀ l, [x
l
]
P
⊈ [x
l
]
Q
. Then for each l,
Thus D (G (Q)/G (P)) =0.
“⇐". Since D (G (Q)/G (P)) =0, we obtain that for each l,
Then ∀ l, [x l ] P ⊈ [x l ] Q . Thus G (P) ⋈ G (Q).
(3) This holds by (2). □
Some tools for measuring uncertainty of approximation spaces
In this section, we investigate measuring uncertainty of approximation spaces.
Granularity measures of approximation spaces
We first propose axiom definition of information granulation of approximation spaces in the following definition.
(1) Non-negativity:
(2) Invariability:
(3) Monotonicity:
Similar to Definition 2 in [12], the information granulation of a given approximation space is given in the following definition.
Suppose X
i
= {xi1, xi2, ⋯ , x
is
i
}, |X
i
| = s
i
. Then
Hence
□
By Proposition 4.3,
If P is an identity relation on U, then ∀ i, | [x
i
]
P
|=1. So
If P is a universal relation on U, then ∀ i, | [x i ] P | = n. So G (P) =1.□
Since (U, P) ≺ (U, Q), we have ∀ i, [x i ] P ⊆ [x i ] Q and ∃ j, [x j ] P ⊊ x i ] Q .
Then, ∀ i, | [x i ] P | ≤ | [x i ] Q | and ∃ j, 1 ≤ | [x j ] P | < | [x j ] Q | .
Hence G (P) < G (Q) . □
(2) Given
By Proposition 4.3, G (P) = G (Q).
(3) “Monotonicity" follows from Theorem 4.5.
□
Entropy measures of approximation spaces
In physics, entropy is often used to measure out-of-order degree of a system. The bigger the entropy value is, the higher out-of-order of a system will be. Shannon [21] applied the concept of entropy in physics to information theory for measuring uncertainty of a system.
Similar to Definition 8 in [12], the information entropy of a given approximation space is defined as follows.
Suppose X
i
= {xi1, xi2, ⋯ , x
is
i
}, |X
i
| = s
i
. Then
Hence
□
It should be noted that (U, P) ≺ (U, Q). Then, similar to the proof of Theorem 4.5, ∀ i, 1 ≤ | [x
i
]
P
| ≤ | [x
i
]
Q
| and ∃ j,
Then, ∀ i,
Hence H (Q) < H (P) . □
Rough entropy, introduced by Yao [24], is used to measure granularity of a given partition. Similar to Definition 6 in [12], the rough entropy of a given approximation space is proposed in the following definition.
Suppose X
i
= {xi1, xi2, ⋯ , x
is
i
}, |X
i
| = s
i
. Then
Hence
□
By Proposition 4.8,
If P is an identity relation on U, then ∀ i, | [x i ] P |=1. So E r (P) =0.
If P is a universal relation on U, then ∀ i, | [x i ] P | = n. So E r (P) = log 2n.□
It should be noted that (U, P) ≺ (U, Q). Then, similar to the proof of Theorem 4.5, ∀ i, 1 ≤ | [x
i
]
P
| ≤ | [x
i
]
Q
| and ∃ j,
Then ∀ i,
Hence E r (P) < E r (Q) . □
(2) Given
(3) “Monotonicity" follows from Theorem 4.13.
□
Information amounts of approximation spaces
Similar to Definition 10 in [12], the information amount of a given approximation space is proposed in the following definition.
Thus, ∀ i,
Hence
□
It should be noted that (U, P) ≺ (U, Q). Then, similar to the proof of Theorem 4.5, ∀ i, 1 ≤ | [x
i
]
P
| ≤ | [x
i
]
Q
| and ∃ j,
Hence E (Q) < E (P). □
Some properties
Suppose X
i
= {xi1, xi2, ⋯ , x
is
i
}, |X
i
| = s
i
. Then
Since {X1, X2, ⋯ , X
m
} is a partition on U, ∀ i, we have
Then
Thus G (P) + E (P) =1. □
By Theorem 4.18, E (P) =1 - G (P).
Thus
Suppose X
i
= {xi1, xi2, ⋯ , x
is
i
}, |X
i
| = s
i
. Then
Thus
□
By Theorem 4.20, H (P) = log2 n - E r (P)
Thus 0 ≤ H (P) ≤ log2 n. □
A numerical experiment example
Below, a numerical experiment example is given to show features of the proposed measures.
The information system (U, A) of postoperative patients
The information system (U, A) of postoperative patients
Denote
P i = ind ({a1, a2, ⋯ , a i }) (i = 1, 2, ⋯ , 9) .
Then (U, P i ) is the approximation space. And
U/P1 = {{x1, x2, x4, x5, x7, x9, x10} , {x3, x6, x8}},
U/P2 = {{x1, x4, x7, x10} , {x2, x9} , {x3, x6} , {x5} , {x8}},
U/P3 = {{x1, x7, x10} , {x2} , {x3} , {x4} , {x5} , {x6} , {x8} , {x9}},
U/P4 = {{x1, x10} , {x2} , {x3} , {x4} , {x5} , {x6} , {x7} , {x8} , {x9}},
U/P5 = {{x1} , {x2} , {x3} , {x4} , {x5} , {x6} , {x7} , {x8} , {x9} , {x10}},
U/P6 = {{x1} , {x2} , {x3} , {x4} , {x5} , {x6} , {x7} , {x8} , {x9} , {x10}},
U/P7 = {{x1} , {x2} , {x3} , {x4} , {x5} , {x6} , {x7} , {x8} , {x9} , {x10}},
U/P8 = {{x1} , {x2} , {x3} , {x4} , {x5} , {x6} , {x7} , {x8} , {x9} , {x10}},
U/P9 = {{x1} , {x2} , {x3} , {x4} , {x5} , {x6} , {x7} , {x8} , {x9} , {x10}}. By Proposition 3.3,
By Proposition 3.8,
By Proposition 3.16,
Measuring uncertainty of approximation.
The results of uncertainty measures are shown in Fig. 1. It can be seen the fact that with the attribute subset B ⊆ A growth, the information granulation and rough entropy of (U, {ind {a} : a ∈ B}) are both monotonically decreasing. Meanwhile, the information amount and information entropy of the approximation space (U, ind (B)) are both monotonically increasing with the attribute subset B ⊆ A growth. That means that information granulation, rough entropy, information amount and information entropy can be applied to measuring uncertainty of a given approximation space.
In this paper, granular structures of approximation spaces have been introduced. Dependence between granular structures has been discussed. Some properties of granular structures have been given. As an application for granular structures of approximation spaces, measures of uncertainty for approximation spaces have been investigated. These results will be very helpful for understanding the essence of uncertainty for an approximation space. In the future, we will consider applications of the proposed measures.
