Abstract
Most existing studies on decision-theoretic rough set approach focus on crisp decisions. However, in many cases, we need to handle fuzzy concepts. This paper investigates decision-theoretic rough set approach for fuzzy decisions. By introducing fuzzy operators, we extend the traditional decision-theoretic rough set approach with crisp decisions to fuzzy concepts. Decision-theoretic rough set approach with fuzzy concepts is investigated based on fuzzy probabilistic approximation space. Bayesian decision procedure based on fuzzy probabilistic approximation space with fuzzy concepts and fuzzy probabilistic decision systems with fuzzy or continuous decision attributes are studied. The application of the proposed approach in decision making is also illustrated by an example. This study may expand the applications of decision-theoretic rough set model.
Introduction
Rough set theory, originally proposed by Pawlak [72], has become a popular mathematical tool to deal with uncertainty and vagueness in many fields, such as data mining, data analysis and decision making [1, 112]. To enhance the robustness of rough set models, probabilistic rough set model (PRS) was proposed by Pawlak et al. in which thresholds were used to determine the lower and upper approximations of a set. The 0.5-probabilistic rough set model was well adopted, i.e. a threshold of 0.5 on probability is used to define probabilistic lower and upper approximations of a set. Yao et al. [94, 100], proposed a generalized probabilistic rough set model called Decision-Theoretic Rough Set (DTRS) model. Compared with related probabilistic rough set models, DTRS provides a solution on computing the threshold parameters for inclusion degree by introducing Bayesian decision theory. DTRS has been widely disussed in the research area of rough set [38, 113].
In DTRS approach, decision (also called state), conditional probability and loss function play an important role. Many existing studies focus on different types of loss function or different types of conditional probability. For example, Li gave a general loss function for supervised learning in which the cost and benefit of assigning an instance to a specific subcategory was studied [48]. Liang and Liu gave decision-theoretic rough set with triangular fuzzy loss function and interval valued loss function in [56] investigated DTRS approach in the frameworks of fuzzy and IVF probabilistic approximation spaces, respectively. In spite of all these studies on loss function and conditional probability, there are few studies on studying the decisions. In these studies, only crisp decision concepts are considered. As well known, the DTRS approach adopts two crisp decisions (also called states) and three actions to describe the decision process. The set of two decisions indicating an object is in a decision or not. The set of actions is deciding that an object belongs to a decision; an object does not belong to a decision and an object is not sure.
However, there are many fuzzy concepts in the real world, which we can’t make a definite division. We can describe them as fuzzy sets other than traditional sets. In this paper, we extend crisp decisions in DTRS to fuzzy decisions. Decision-theoretic rough set approach with fuzzy decisions based on fuzzy probability measure is investigated.
The remainder of this paper is organized as follows. In Section 2, some basic concepts of fuzzy sets and DTRS approach based on fuzzy probability are reviewed. In Section 3, DTRS approach with fuzzy decision based on fuzzy probability measure is investigated. Section 4 concludes this paper.
Preliminaries
Fuzzy logical operator
Throughout this paper, let T denote a triangular norm (t-norm for short). A triangular norm (t-norm for short) T is any increasing (T (a, b) ≤ T (c, d) if a ≤ c and b ≤ d), commutative (T (a, b) = T (b, a)) and associative (T (a, T (b, c)) = T (T (a, b) , c)) [0, 1] 2 → [0, 1] mapping satisfying T (a, 1) = a, for all a in [0, 1].
Three well-known T operators are listed as follows:
Fuzzy decision-theoretic rough set approach with traditional decision
For the purpose of comparison, we recall some useful concepts of fuzzy decision-theoretic rough set approach with traditional decisions. The details can be found in [107, 111].
Let be a fuzzy probabilistic approximation space in which U is a universe of discourse, is a fuzzy relation on U, and P is a fuzzy probability measure of U.
It should be noticed that this computation of condition probability is different from that in [95, 104] which are based on cardinalities.
For each object x ∈ U, for every y ∈ U. Assume that for all x ∈ U. The expected losses of taking different actions for x are as follows [111]:
Bayesian decision procedure based on fuzzy probabilistic approximation space with fuzzy concept
Let be a fuzzy probabilistic approximation space in which U is a universe of discourse, is a fuzzy relation on U, and P is a fuzzy probability measure of U. As well known, decision-theoretic rough set approach adopts two decisions and three actions to describe the Bayesian decision process. The set of two decisions is {X, X C }, (X ⊆ U is a classical set) indicating an object is in X or not, respectively. The set of actions is {a P , a N , a B } deciding that x belongs to X; x does not belong to X and x is not sure (thus a deferment decision for x), respectively. In this paper, we extend the traditional decisions to fuzzy cases, i.e. the two decisions are and . It should be noted that is a fuzzy set representing a fuzzy concept. For example, the concept of “tall” for a person, is actually a fuzzy concept. It is not reasonable to treat this concept as a crisp concept X. Consequently, the loss function [λ] X can given by a matrix in Table 2. The subscript represents this loss function is for , which is omitted in the following if no confusion arises.
For each object x ∈ U, is adopted as its description and for every y ∈ U. Assume that for all x ∈ U. Thus, the expected losses of taking different actions for x can be defined as follows:
where is a fuzzy set and [x] R is a fuzzy equivalence class induced by fuzzy equivalence relation R, [x] R (i) is the ith membership in fuzzy equivalence class [x] R , and T is a t-norm.
This computation of condition probability is different from that in [111]. The definition in this paper adopts a t-norm to perform the intersection of two fuzzy sets, and can handle the case when X becomes fuzzy. Moreover, it can degenerate to a classical case based on cardinalities.
□
It means that
When any two or all actions have the same risk, we break the tie by taking an action according to the order a
P
, a
N
, a
B
. Then the Bayesian decision procedure suggests the following three minimum-risk decision rules. if and , then decide ; if and , then decide ; if the element x satisfies neither P1 nor N1, then decide .
The pair of subsets, is called the [λ] - fuzzy probabilistic rough set of X([λ] - fuzzy, for short).
Consider a special and reasonable kind of loss function [94, 100] with
That is, the loss of classifying an object x belonging to into the positive region of is less than or equal to the loss of classifying x into the boundary region of and both of these losses are less than the loss of classifying an object x into the negative region of . The other order of losses is used for classifying an object that does not belong to . For this type of loss function the minimum-risk decision rules (P1)-(B1) can be rewritten as follows: if and , then decide ; if and , then decide ; if x satisfies neither P2 nor N2, then decide .
where
The (α, γ) - fuzzy probabilistic lower and (γ, β) - fuzzy upper approximations are as follows:
It follows that 1 ≥ α > γ > β ≥ 0. Thus, the following simplified rules are obtained: if , then decide ; if , then decide ; if x satisfies neither P3 nor N3, then decide .
The α - fuzzy probabilistic lower and β - fuzzy upper approximations are as follows:
Actually, it holds that
Let degenerate to a crisp set, the result obtained is the same as the result in [111]. Let X = {u5, u6, u7, u8, u9, u10}, if we denote it as a fuzzy set, it will be .
The fuzzy conditional probabilities for each u
i
∈ U are as follows:
Let (U, A) be a fuzzy information system and P be a fuzzy probability measure on U. Then (U, A, P) is called a fuzzy probabilistic information system. If A = C ∪ D and C∩ D = ∅ with C being the conditional attributes set and D the decision attribute, then (U, C ∪ D, P) is called a fuzzy probabilistic decision system and the decision classes generated by D are denoted by . Sometimes we have different loss functions for different decision classes. To avoid triviality, we only consider the special loss functions in the following discussion that satisfy
The α
i
- fuzzy probabilistic positive region, β
i
- fuzzy probabilistic negative region and (α
i
, β
i
) probabilistic boundary region of related to C are given by rules (P3)-(B3), respectively, as follows:
Since the decision attribute is fuzzy, we also need to construct fuzzy relation from decision attribute. We define the similarity between different decision values by . For example, . Hence, we can get a fuzzy similarity relation by fuzzy decision attribute, which is a fuzzy equivalence class. For d1 = d (u1), we can get its fuzzy equivalence class as follows:
Then, we have the conditional probabilities of shown in Table 6. Consequently, we have the positive region, negative region and bound region as follows:
If , then B is an α-lower (β-upper) consistent set for α-FP-LAD (β-FP-UAD) of π
D
. w.r.t. C. If B is α-lower (β-upper) consistent and no other proper subset of B is α-lower (β-upper) consistent, then B is an α-lower (β-upper) distribution reduct for α-FP-LAD (β-FP-UAD) of π
D
w.r.t. C. If , then B is called an α-fuzzy probabilistic consistent set of π
D
w.r.t. C. If B is a-fuzzy probabilistic consistent and no other proper subset of B is α-fuzzy probabilistic consistent, then B is an α-fuzzy probabilistic reduct of π
D
w.r.t. C.
In this section, the application of the proposed approach in decision making is illustrated by an example. Table 7 shows a data set about Breast Cancer diagnosis. The data set records ten patients {x1, x2,... . . , x10}. There are two conditional attributes representing two symptoms, i.e. the clump thickness and the uniformity of cell size. The decision attribute denotes the diagnosis results given by a doctor.
For the decision value of sample x1, (0 :0.9, 1 : 0.1) means that patient x1 has a probability of 0.9 belonging to class ’0’ and a probability of 0.1 belonging to class ‘1’. It is significant to consider such a decision system where decision attribute is fuzzy since sometimes the doctor can’t make a definite decision.
Let’s denote the two fuzzy decision classes by and . From Table 7, we can get the two fuzzy sets as follows:
The similarity relation generated by conditional attributes is shown in Table 8.
The loss function is shown in Table 9 which is given by experts previously.
From Table 9, we know the loss function for class D is as follows
Then we can get α, β as follows
Hence, we can get the positive region, negative region and the boundary region of fuzzy decision class as follows
From the three regions and the meaning of rough sets, we can get that patients {x1, x2, x3, x4, x6} are definitely belong to class D. In other words, under the loss function given, patients {x1, x2, x3, x4, x6} must be benign. At the same time, patients {x8, x9, x10} must be malignant. We are not quite sure about patients {x5, x7}, and these patients need to be diagnosed further.
Conclusion
Existing studies of decision-theoretic rough set approach mainly focus on crisp decisions. This paper investigates decision-theoretic rough set approach with fuzzy decisions in the framework of fuzzy approximation space. By introducing fuzzy operators, we construct new definition of conditional probability for decision-theoretic rough set approach with fuzzy decisions. Thus, the decision-theoretic rough set procedure of fuzzy concepts can be handled. Furthermore, some characteristics of the proposed method are studied. Theoretical analysis shows that the proposed approach is an extension of traditional decision-theoretic rough set model with crisp decisions.
In future studies, we plan to extend our study to two-universe case. Moreover, attribute reduction in fuzzy probabilistic decision system with fuzzy or continuous decision attribute is an interesting problem worthy of further research.
Footnotes
Acknowledgements
This work was partially supported by the National Natural Science Foundation of China (No. 61473259, No. 61070074, No. 60703038), the Zhejiang Provincial Natural Science Foundation (No. Y14F020118), the National Science & Technology Support Program of China (2015BAK26B00, 2015BAK26B02), the Major Projects of National Social Science Foundation of China (11&ZD189) and the PEIYANG Young Scholars Program of Tianjin University (2016XRX-0001).
