Abstract
The paper focuses on the question that ensures the completeness of the information and the authenticity of the conclusions, which is based on S-rough sets from the perspective of three-way decisions. In order to make a rational decision by the dynamic information, the multi-users S-rough sets model based on three-way decisions is proposed. Subsequently, in the case of incomplete information, the paper gives an application of the model, which may indicate usefulness of the model and prove that it is the new approach to deal with the relevant problem of uncertainty information provided by multiple users.
Keywords
Introduction
Rough Sets theory is mainly used in expressing and processing the uncertain, imprecise and vague information [1, 2]. It uses the equivalence relations to divide the universe into a number of equivalence classes. The concepts are approximated by the upper and lower approximations. Namely, the concepts are divided by three regions, which are the positive, boundary and negative regions. The positive region is constituted by the equivalence classes that belong to the approximated entirely, while the negative region is obtained by the equivalence classes that not belong to approximated. The equivalence classes constitute the boundary region if it belongs to the set partly. Based on three regions above, Yao proposed the three-way decision rules and explored a new semantic model [3–5]. In a sense, the classical rough set is the static rough sets, but the information has the dynamic nature in reality. In order to ensure the completeness of the information, literatures [6–13] proposed S-rough sets.
In order to ensure the completeness of information and authenticity of conclusion, this paper applies the ideology of three-way decisions to the S-rough sets and extends it to multi-users. In addition, different kinds of multi-users models are also discussed, and application conditions and scope of this model are presented. Subsequently, the paper applies the model to an instance, which verifies the practicality of the model. Eventually, this paper gives the definition and method of attribute reduction. Therefore, we can make a final reasonable decision in the case of existence of incomplete information.
Based on the above analyses, this paper is divided into six parts. In the first part the three-way decision rules and S-rough sets are briefly reviewed. In the second part the S-rough Sets Model based on Three-way Decisions is presented. In the third part extend the model to multi-users. In the fourth part a model instance is given. In the fifth part proposes the attribute reduction of S-rough sets model based on three-way decisions. And In the sixth part theconclusion of this paper is drawn and the further research is pointed out.
Three-way decisions and S-rough sets
Pawlak rough sets theory does not take the decision rules of fault-tolerant into account, which requires to introduce the concept of probabilistic rough set [14, 15].
In which Pr(X| [x]
R
) = | [x]
R
∩ X| ∖ | [x]
R
| denote the conditional probability of the classification, | · | represents the cardinality of a set. The (α, β)-probabilistic lower and upper approximations divide the universe into three regions: the (α, β)-probabilistic positive, boundary and negative regions, which are defined as:
For X*, its lower approximations is (R, F) ∘ (X*) and its upper approximations is (R, F) ° (X*).
The study of the paper is based on the two direction S-rough Sets.
Let U be a finite non-empty set and R be the equivalence relation defined in U, then (U, R) is the approximate space, which is denoted as: apr = (U, R). The division of U in the equivalence relation R is recorded as U/R = {[x] R : x ∈ U}, in which [x] R is the equivalence class including x. However, for S-rough sets, due to a family of the element migration γ, it makes [x] R change.
If u ∈ U, u ∉ X, f (u) = x ∈ X, the universe U is reclassified, which is denoted as:
If x ∈ Xf′(x) = u ∉ X, the universe U is reclassified, which is denoted as:
Then the universe U is reclassified as . Due to the above, it makes the lower and upper approximation of X* change. So for X*, the lower approximations of X*, namely (R, γ) * (X*), and the upper approximations of X*, namely (R, γ)*(X*) can be redefined.
For two direction Singular Set X*, the positive region of X*, namely POS (X*), the boundary region of X*, namely BND (X*), and the negative region of X*, namely NEG (X*) are defined as:
The S-rough sets model based on three-way decisions describe the decision-making process by using a finite set of two states and a finite set of three possible actions. According to Definition 3, X*⊂ U is two direction Singular Set in U. Given the set of states Ω, Ω = {X*, - X*} X* denote that an event belongs to X* and -X* denote that an event doesn’t belong to X*; Given the set of actions A, A = {a
p
, a
B
, a
N
}, in which a
P
, a
B
and a
N
represent the three actions in classifying an object, namely, accepting an event, delaying the decision of an event and refusing an event. Different actions can produce different losses. When x ∈ X, λ
PP
, λ
BP
and λ
NP
respectively denote the loss of taking actions a
P
, a
B
and a
N
; Likewise, when x ∉ X*, λ
PN
, λ
BN
and λ
PN
respectively denote the loss of taking actions α
P
, α
B
and α
N
. And is the new equivalence classes of x migrating by the migration family of γ. When taking the three different actions of α
P
, α
B
, α
N
, the expected loss can be expressed as:
According to the Bayesian decision rules, choosing the minimum loss as the optimal plan of the set of actions is needed, then the following minimum-risk decision rules are obtained: If ≤ and ≤ , decide x ∈ POS (X*); If ≤ and ≤ , decide x ∈ BND (X*); If ≤ and ≤ , decide x ∈ NEG (X*).
Because , the above decision rules is related to the condition probabilities and the involved loss function λ. Moreover, considered that the loss of accepting the right things is less than the loss of delaying to accept the right things, both of them are less than the loss of refusing to accept the right things; similarly, the loss of rejecting the wrong things is not greater than the loss of delaying to reject the wrong things, both of them are less than the loss of accepting the wrong things. Therefore, a reasonable assumption is
Here, we make
From the rule of (B), β < α, < and β < α, < < can be easily obtained; By calculating, 0 ≤ β < γ < α ≤ 1 can be obtained. Thus, the rule (P), (B), (N) can be rewritten as: If , decide x ∈ POS (X*); If β < < α, decide x ∈ BND (X*); If , decide x ∈ NEG (X*).
So the semantics of the S-rough sets model based on three-way decisions can be described as:
If the occurrence probability of X* is not less than the value of α in the description of , then can be divided into the positive region of X*, making the positive decisions immediately.
If the occurrence probability of X* is greater than the value of β and less than the value of α in the description of , then can be divided into the boundary domain of X*; Because the evidence of making decisions is insufficient, gathering more information is needed in order to make the right decisions.
If the occurrence probability of X* is less than or equal to the value of β in the description of , then can be can be divided into the negative domain of X*, making the negative decisions immediately.
Thus, in the S-rough sets model based on three-way decisions, its two parameters α and β can be directly calculated by the loss of function. Given the loss of function is needed in advance, so the appropriate prior knowledge is needed, which is limited in the application of many fields. However, there are some different methods [19, 20], which can solve this problem.
According to the Bayes theory, for the rules (P1)-(N1), the conditional probability can be calculated by the prior probability Pr(X*) and the similar ratio.
Pr(X*) is the probability of the state property set of X* in the division of the decision attributes based on S-rough sets; is the probability of the equivalence classes in the division of the condition attributes based on S-rough sets. It denote that , in which is the number of the condition attributes and v i is the value of in the i-th (i = 1, 2, …, m) condition attribute. Similarly, the formula can be obtained.
If the attributes are independent in the information system, then the following formula can be obtained:
Then the related parametersPr(X*), Pr(v i ) and Pr(v i |X*) can be got from the information system directly and be used to calculate the conditional probability . Thus, according to the above analysis, the method can classify and make decisions by calculating the related parameters and using the rules (P1)-(N1) for the objects, which provides a way to deal with the practical decision-makingproblems.
With the S-rough sets obtained, the three-way decisions will be more reasonable to deal with the existing risky and uncertain information. However, multiple users according to their respective standards of different decisions set, it is necessary to find a reasonable and have the consensus decision criteria and then deduce an agreeable decision which could be accepted by the multiple users. In order to solve the problems of multiuser decision in the real world, the multi-user [16–19, 38] s-rough sets model based on three-way decisions is proposed.
n is the number of the users, (α
i
, β
i
) is the threshold parameters and (P
i
, B
i
, N
i
) is the decision rules obtained by the i-th (i = 1,2, …, n) user. If f (α1, α2, …, α
n
) and g (β1, β2, …, β
n
) are procured, which make α* = f (α1, α2, …, α
n
), β* = g (β1, β2, …, β
n
) and α* < β* tenable, new decision rules are obtained. If , decide x ∈ MPOS (X*); If β* < < α*, decide x ∈ MBND (X*); If ≤ β*, decide x ∈ MNEG (X*).
In which, MPOS (X*), MBND (X*), and MNEG (X*) denote the positive region, the boundary region and the negative region, respectively; and they are all decided by each of the multi-users.
Based on different circumstances, the definition of conservative decision-making region and risky decision making region are put forwarded.
conservative boundary region
conservative negative region
risky boundary region
risky negative region
In which, POS
i
(X*) and NEG
i
(X*) denote the positive and the negative region obtained by the i-th (i = 1, 2, …, n) user, respectively; and the MBND (X*) can be gained as supplementary set of MPOS (X*) and MNEG (X*). Values of α* and β* can be achieved in many different ways according to different situations of the rules (P*), (B*) and (N*). Such as in the mean acceptance region model and the mean rejection region model, considering each user has different decision proportion, α* and β* can be described as
where k i (i = 1, 2, …, n) is the decision proportion.
In the trial process of one case, U is the domain constituted by criminal suspects; X ⊂ U and X ≠ φ, in order to prevent the omission or the erroneous judgements, it is essential to add or delete the elements inX. If one suspect proves to commit the crime with sufficient evidence, the suspect should be added to the set X. contrarily, if one suspect proves to be innocent, the suspect should be moved out from X. Then, the result should be submitted to the forum after the final decision is made according to the Three-way Decisions theory [22–27].
U is supposed to be the domain constituted by criminal suspects.
U = {x0, x1, x2, x3, x4, x5, x6, x7, x8, x9, x10}, x i (x i ∈ U) denote a subset, [x] i is the equivalence class of x i .
According to the initial properties of these objects, the initial equivalence classes in U are as follow.
According to the Definition 3, some elements should be moved into X or out from X. During the investigation of the case, x3 was forced to get involved in the case but he/she did not violate the law, x2 and x9 were escaped criminals.
Therefore
Depending on the newly-known properties of the objects, the equivalence classes can be reclassified as
According to the Definition 4
For one certain type of criminal suspects, the state set Ω = {X*, - X*} indicates the state of guilty and innocent; the action set A = {α P , α B , α N } denote three actions, which are detention, delaying decisions and release.
There are six parameters in the model, if [x] i ∈ X* let θ PP , θ BP , and θ NP represent the social harmfulness for taking actions of α P , α B , and α N , respectively; if [x] i ∈ - X*, let θ PN , θ BN , and θ NN represent the social harmfulness for taking actions of α P , α B , and α N , respectively (see Table 1).
And at the same time these six parameters are assumed to meet the formulas
In practical problems, the final decision should be made after the comparison between the value of conditional probability and the value ofα and β. The calculation of requires the priori information which could be carried out by the Equations (26–28). For simplicity, we assume that , then POS (X*), BND (X*) and NEG (X*) can be conducted from the decision rules (P1)-(N1). POS (X*) = {1, [x] 5}, and suspects denoted by [x] 1, [x] 5should be detained. BND (X*) = {[x] 2}, so how to dispose suspects denoted by [x] 2 should be decided later. NEG (X*) = {[x] 3, [x] 4}, which means suspects represented by [x] 3 and [x] 4 should be released. By Table 2, It is shown that the establishment of three decision regions depends on the value of the conditional probability and the threshold parameters.
Attribute reduction of S-rough sets model based on three-way decisions
Non-monotonicity of the positive region
In the theory of S-rough sets model based on three-way decision, positive region is defined as a universe objects set obtaining correct classifications at a certain probability. Probability threshold value can be acquired by minimizing the expected risk.
where is the approximate set of X* under α-, i.e.,
Compared with positive region of Pawlak rough sets theory, the upper α-positive region has a certain error tolerance, which allows a certain degree of misclassification. In this case, classification accuracy used for expressing the classifying ability of attribute set now is not the ratio of the number of objects classified completely correctly to the number of universe objects but the most number of classified correctly objects to the total number of universe objects. Then the performance of classification of attributes set is not always improved as the number of attribute increases.
According that positive region in S-rough sets model based on three-way decision has the feature of non-monotonicity, a new definition of attribute reduction is given. In this model, due to the characteristic of non-monotonicity, reducing the number of attribute may result in the extension of positive region and expanded positive region has stronger ability of classification. With comprehensive consideration, redefine attribute reduction as follows.
(1) Non-decreasing feature of the positive region:
(2) Independence of the attribute:
According to the S-rough Sets model based on three-way decisions, these mantic interpretation of the minimum risk based on the Bayes’ rules is given, and the threshold parameters can be calculated by the action loss functions. Moreover, with the circumstances of multi-users, the threshold parameters can be gained depends on the situation. The conditional probability can be calculated from the information system. The threshold parameters are used to verify the correctness of the conditional probability, meanwhile the conditional probability is used to guide the rationality of the threshold parameters. They oppose each other also complement each other. Thus, the multi-users s-rough sets model based on three-way decisions provides a new perspective to deal with the problem of uncertain information.
Footnotes
Acknowledgments
This work is supported by the Fundamental Research Funds for the Central Universities (lzujbky-2012-43). The authors would like to thank the reviewers for their constructive comments and suggestions.
