Similarity measure is an important uncertainty measurement in intuitionistic fuzzy set (IFS) theory. In this study, a novel similarity measure is presented by the combination of the information carried by hesitancy degree and the endpoint distance of membership and nonmembership, respectively. Moreover, a numerical example is used to verify the reasonable of the proposed similarity measure. After that, the similarity measure is applied to construct the IF decision-theoretic rough set (IF-DTRS) model and multigranulation IF decision-theoretic rough set (MG-IF-DTRS) model. Some properties of IF-DTRS and MG-IF-DTRS are also investigated. Thirdly, based on granular significance, a novel approach of optimal granulation selection is formulated. Finally, a heuristic algorithm is designed and the effectiveness of this algorithm is demonstrated by an illustrative example.
In fuzzy set theory [3], a membership function is used to describe the uncertainty. And now it has revolutionized many research fields. Based on this theory, several extensions of fuzzy set for various practical applications have been proposed. Atanassov [4, 5] proposed the concept of IFS, in which two values, called membership and nonmembership, are used to characterize the uncertainty of the objective world. The study found that the concept described by IFSs is similar as the essence of human induction and cognition. Therefore, the theoretical and application of IFS have been attentively studied by many scholars [51] and have been widely used in many fields, such as pattern recognition [6, 8], medicine diagnosis [12], decision making [1, 50], and clusteranalysis [11, 13].
In many real world application problems, people often need to compare two or more objects to determine the similarity or proximity between them, which is usually called similarity measure. Many scholars have made great efforts in developing new similarity measures between IFSs and studying their applications. Chen [14, 15] firstly presented a series of similarity measures in 1997. After that, Hong and Kim [16] indicated that the measures were not applicable in some cases and then modified the similarity measures. Based on an axiomatic approach, Li [7] proposed several similarity measures in handlingpattern recognition problems. In [10], Liang and Shi presented some counterintuitive examples to show the weakness of the similarity measures introduced by Li. Mitchell [9] modified the axiomatic definition and introduced a new similarity measure. Using geometrical representation of IFS, Liu [17] proposed some similarity measures. By means of the medians of intervals, Chen [18] presented a similarity measure between IFSs. Depending on Hausdorff distance, Hung [6] defined the distance between IFSs, which was used to generate several similarity measures. Hwang et.al [19] presented a cosine similarity measure for IFSs induced by the Sugeno integral. Considering the Frank t-norms family, Iancu [20] extended some crisp cardinality measures to calculate the similarity of IFSs. According to the uncertainty and hesitancy of IFS simultaneously, Beliakov et al., [21] introduced vector valued similarity measures for IFS. In recent years, some scholars continue to study similarity measurement between intuitionistic fuzzy sets from different perspectives [22, 32]. However, the existing similarity measures between IFSs may exist some counter-intuitive, it is necessary to generate more reasonable and effective similarity measure method.
Rough set [25] is an important tool for handing the vagueness and uncertainty of data. According to equivalence relation in approximation space, the lower and upper approximation sets were calculated to approximate the target concept. However, equivalence relation is still restrictive. In order to extend the application areas, tolerance relation [26, 34], dominance relation [28], neighborhood relation [29] and fuzzy relation [31] were introduced into the original rough set theory. By considering the tolerance of classification error, the decision-theoretic rough set (DTRS) model [35, 36], the Bayesian rough set model and the probabilistic rough set model were proposed. Those works make rough set theory more realistic and useful. By the view of granular computing [37, 38], an equivalence relation can be considered to be a granule. Hence, in Pawlak rough set, the target concept has been approximated by a single granule structure. If the relationship between attributes are independent, the intersection of knowledge obtained from all attributes is clearly irrational.
To address the issue above, Qian [39, 40] proposed multigranulation rough sets(MGRSs) in 2010, where multiple equivalent relations(multiple granular structures) have been used to approximate a target set. And now, many researchers have extended the MGRSs to the generalized MGRSs. Lin [42] introduced neighborhood-based MGRSs model. Combination with fuzzy set theory, Xu [41] proposed multigranulation fuzzy rough set model by considering crisp to fuzzy case. Li [44] developed the MGRSs in ordered information system. Feng [43] illuminated variable precision MGRSs model. Tan el at. [45] characterized the approximations operators and attribute reductions by means of belief structure.
Although IFS theory and MG-DTRSs theory have significant advantages for information expression and processing, respectively. Few works have focused on combining the two theories together to construct MG-IF-DTRSs. Huang [46] introduced the inclusion measure-based MG-IF-DTRSs model, in which the approximated sets were IFSs on the universe of discourse. However, this model can not process labeled data.
According to the above mentioned analysis, we investigate the MG-IF-DTRSs by using a novel similarity measure. The main contributions of this paper can be summarized as follows: (1) A novel similarity measure between IFSs is introduced which is more reasonable and effective. (2) We extend MG-DTRSs in multi-source IFDS and put forward new similarity measure-based MG-IF-DTRSs model based on the proposed similarity measure. (3) We focus on a fundamental issue on how to acquire optimal granulation reduction for MG-IF-DTRSs.
This study is constituted as follows. In Section 2, we review some basic notions in order to facilitate the subsequent discussions. And a novel similarity measure is proposed, we also compare the results with existing similarity measures in Section 3. Section 4 presents the definition of the similarity measure-based IF-DTRSs. Section 5, MG-IF-DTRSs are considered in multi-source IF information systems and mathematical properties are also investigated. In Section 6, we propose a heuristic algorithm to compute the optimistic optimal granulation. After that, the processing of the algorithm is illustrated by an example. We conclude this paper in Section 7.
Preliminaries
In this section, some basic notions are introduced including IFS, DTRS and MG-DTRSs.
IFS and IFIS
Definition 1. [4, 5] Let U be a universe of discourse. An IFS A on U is defined by A = {〈μA (x) , νA (x) 〉 : x ∈ U} , where 0 ≤ μA (x) + νA (x) ≤1, ∀x ∈ U. The mappings μA : U → [0, 1] and νA : U → [0, 1] represent the membership and nonmembership degrees of the element x ∈ U to A (namely μA (x) and νA (x)) respectively. 1 - μA (x) - νA (x) can be interpreted as the hesitancy degree of x ∈ U to A. All IFS subsets of U is IFS(U).
Definition 2. [4, 5] For two IFSs A = {〈μA (x) , νA (x) 〉 : x ∈ U}, B = {〈μB (x) , νB (x) 〉 : x ∈ U}. The operational laws are defined as follows:
A ⊆ B ⇔ ∀ x ∈ U, μA (x) ≤ μB (x), νA (x) ≥ νB (x);
A = B ⇔ A ⊆ B and B ⊆ A;
∼A = {〈νA (x) , μA (x) 〉 : x ∈ U};
A ∩ B = {〈μA (x) ∧ μB (x) , νA (x) ∨ νB (x) 〉 : x ∈ U};
A ∪ B = {〈μA (x) ∨ μB (x) , νA (x) ∧ νB (x) 〉 : x ∈ U}.
Definition 3. An intuitionistic fuzzy information system (IFIS) is defined as IFIS = (U, AT, {Va : a ∈ AT} , {fa : a ∈ AT}), in which U and AT are nonempty, finite object set and attribute set respectively. Va consists of all IF values of a ∈ AT, fa is an information function, such that fa(x)= 〈 μa(x),νa(x)〉 , and ∀x ∈ U, ∀a ∈ AT, 0 ≤ μa (x) + νa (x) ≤1.
Moveover, if C and D are conditional and decision attribute sets with C∩ D = ∅ and AT = C ∪ D, then IFIS is denoted as intuitionistic fuzzy decision information system (short for IFDS).
DTRS and MG-DTRSs
Since considering the risk and enhance the robustness, Yao [47] proposed the DTRS model in 1992.
Definition 4. [47] Let R be an equivalence relation on U, and 0 ≤ β < α ≤ 1. The lower and upper approximations of X ⊆ U are defined by, respectively,
where α, β are two probability thresholds, P (X| [x] R) is the conditional probability of [x] R with respect to X. We refere the pair as DTRS.
After that, new DTRS models called the MG-DTRSs model were proposed by combining the MGRSs and DTRS.
Definition 5. [48] Let be m equivalence relations on U and 0 ≤ β < α ≤ 1. The optimistic and pessimistic multigranulation lower and upper approximations of X ⊆ U are defined by, respectively,
where ∨ and ∧ mean the logical operator “or” and “and”, respectively.
The order pairs and are denoted as the optimistic and pessimistic MG-DTRSs, respectively.
A new similarity measure between IFSs
In this section, we consider similarity measures between IFSs starting from reminding the axiomatic definition of similarity measures.
Definition 6. [7] ∀A, B, C ∈ IFS (U), a similarity measure S on IFS (U) is a real function S : IFS (U) × IFS (U) → [0, 1], which satisfies the following conditions:
0 ≤ S (A, B) ≤1;
S (A, B) =1, if A = B;
S (A, B) = S (B, A);
S (A, C) ≤ S (A, B) and S (A, C) ≤ S (B, C) if A ⊆ B ⊆ C;
Mitchell [9] pointed out that condition (2) should be replaced by: S (A, B) =1, if and only if A = B; Li [49] claimed that the following condition should be rejoined: (5) S (A, B) =0, if and only if ∀x ∈ U, μA (x) =1, νA (x) =0 and B = ∼ A, or μB (x) =1, νB (x) =0 and A = ∼ B.
Now, we review some existing similarity measures, which were widely used. A, B ∈ IFS (U) are two IFSs.
In fact, in order to calculate the similarity of two IFSs, we should comprehensively consider the membership, non-membership and hesitation. There may be three kinds of problems with the similarity calculated using the above formula. First, Comparing the same IFS A with two different IFSs B and C, one might get S* (A, B) = S* (A, C). However, it is counter-intuitive. Second, S* (A, A) =1, ∀A ∈ IFS (U), and we will get S* (〈0, 0〉, 〈0, 0〉) = S* (〈0.2, 0.5〉, 〈0.2, 0.5〉) = S* (〈0.2, 0.8〉, 〈0.2, 0.8〉) =1. However, the hesitance of the three groups of IFVs are not equal, and the degree of similarity should not be the same. Third, let A = 〈0, 0〉 and B = 〈a, b〉, if a + b = 1, then S* (A, B) =0.5 for many formulas above. This is obviously unreasonable.
In order to resolve some unreasonable results presented above, we modify the condition (2) as (2″) S (A, B) =1 if and only if A = B, and πA (xi) = πB (xi) =0, ∀ xi ∈ U. Thus we define a new similarity measure.
Definition 7. Let A, B ∈ IFS (U), a new measure is defined by SLM (A, B), in which
Theorem 1.For A, B ∈ IFS (U), SLM (A, B) is a similarity measure on U.
Proof. Let A = {〈μA (x) , νA (x) 〉 : x ∈ U}, B = {〈μB (x) , νB (x) 〉 : x ∈ U}.
(1) It is obviously that SLM (A, B) ≤1.
If 〈a + x, b - y〉 and 〈a, b〉 are two IFVs, thena + x, b - y ∈[0, 1], and 0 ≤ a + x + b - y ≤ 1.
Denote f (x, y) = |a - (a + x) | + |b - (b - y) | + | (a - b) - (a + x - b + y) | + |(1 - a - b) - (1 - a - x - b + y) | + ((1 - a - b) + (1 - a - x - b + y))/2 = |x| + |y| + |x + y| + |x - y| + (y - x)/2 + (1 - a - b).
Let x ∈ [0, 1 - a] and y = 0, . If y = 0, x ∈ [0, 1 - a], f is an increasing function on the first argument.
Let x ∈ [- a, 0] and y = 0, . If y = 0, x ∈ [- a, 0], f is a decreasing function of the first argument.
Similarly, if x = 0, y ∈ [0, b], f is an increasing function of the second argument. If x = 0, y ∈ [a - 1, 0], f is a decreasing function of the second argument.
Therefore, given x, y ≥ 0, the maximum value of f is ; given x, y ≤ 0, the maximum value of f is .
(4) According to (1), set 0 ≤ u1 ≤ u2 ≤ 1 - a, 0 ≤ v2 ≤ v1 ≤ b, let A (xi) = 〈a, b〉, B (xi) = 〈a + u1, b - v1〉, C (xi) = 〈a + u2, b - v2〉, then A (xi) ⊆ B (xi) ⊆ C (xi) and f (u2, v2) ≥ f (u2, v1) ≥ f (u1, v1); that is SLM (A (xi) , B (xi)) ≥ SLM (A (xi) , C (xi)), which implies that SLM (A, B) ≥ SLM (A, C).
we can also have SLM (B, C) ≥ SLM (A, C). □
Proposition 1.Let A, B ∈ IFS (U) be two IFSs, then SLM (A, B) = SLM (∼ A, ∼ B).
Lemma 1.Let A, B ∈ IFS (U) be two IFSs. For all x ∈ U, if μA (x) + νA (x) =1, μB (x) + νB (x) =1 and πA (x) ≥ πB (x), then SLM (A, A) ≤ SLM (B, B).
Proof. It is straightforward by Definition 7. □
In fact, hesitancy degree reflects the uncertainty of information. The greater the hesitation is, the more uncertainty there are. Thus, S (A, B) =1 if and only if A = B, and πA (xi) = πB (xi) =0, ∀ xi ∈ U.
The reasonability of the proposed method is illustrated by a numerical example.
Example 1. The result of comprehensive comparison is shown in Table 1.
We can see that SC (A, B) = SDF (A, B) =1 for two different IFSs in cases 3,4,8,9, which means that the axiom (2) is not satisfied.
In case 2, SSW (A, B) =0, which means that the axiom (5) is not satisfied.
In case 7, SSZ (A, B) = non is unreasonable.
In case 8,9, None of these methods can distinguish between the two pairs of IFSs except SLM.
As for SC, SH, SLX, SDF, SSz, SM, SSW, one may get identical results from different pairs of A, B, which cannot satisfy the application of pattern recognition. According to the voting model, the existing voting results will affect the abusers. If πA (xi) = πB (xi) ≠0, due to the uncertainty of description, even if A = B, their similarity should not equal to 1.
∥Overall, the proposed similarity measure is more reasonable and has non counterintuitive cases.
The result of comprehensive comparison(counterintuitive cases are in bold type)
1
2
3
4
5
6
7
8
9
10
A
〈 0,1〉
〈 0,0〉
〈 0,0〉
〈 0.3,0.3〉
〈 0.3,0.7〉
〈 0.3,0.5〉
〈 0.2,0.2〉
〈 0.2,0.3〉
〈 0.2,0.3〉
〈 0.2,0.6〉
B
〈 1,0 〉
〈 0,1〉
〈 0.5,0.5〉
〈 0.4,0.4〉
〈 0.3,0.7〉
〈 0.3,0.5〉
〈 0.2,0.2〉
〈 0.1,0.2〉
〈 0.3,0.4〉
〈 0.1,0.6〉
SC
0
0.50
1
1
1
1
1
1
1
0.90
SH
0
0.50
0.50
0.90
1
1
1
0.90
0.90
0.90
SLX
0
0.50
0.75
0.95
1
1
1
0.95
0.95
0.90
SDF(p=1)
0
0.50
1
1
1
1
1
1
1
0.90
SSz
0
0.50
0.50
0.50
1
1
non
0.5
0.5
0.81
SM
0
0.50
0.50
0.90
1
1
1
0.90
0.90
0.90
SSW
0
0
0.35
0.99
1
1
1
0.99
0.99
0.99
SLM
0
0.13
0.38
0.83
1
0.95
0.85
0.75
0.80
0.85
Similarity measure-based IF-DTRS
In this section, the similarity measure is applied to define approximation operators (lower approximation,upper approximation).
Definition 8. Let IFIS = (U, AT, {Va : a ∈ AT} , {fa : a ∈ AT}), the mapping between two objects x, y ∈ U w.r.t. attribute set A ⊆ AT is defined as
In order to improve readability, we write SA (x, y) in stead of SLM(A) (x, y).
Definition 9. Let 0 ≤ λ ≤ 1, A ⊆ AT. A similarity relation on U is defined as:
It is obviously that is reflexive and symmetric. is denoted as the similarity class of x w.r.t. A in IFIS. The similarity relation above forms a covering of U.
Proposition 2.∀x ∈ U, if 0 ≤ λ1 ≤ λ2 ≤ 1, then .
Proof. It is straightforward.□
Definition 10. Let 0 ≤ λ ≤ 1 and 0 ≤ β < α ≤ 1. For any A ⊆ AT, the similarity measure-based α-lower and β-upper approximations of X ⊆ U are denoted by and , respectively, in which
We call the order pair as similarity measure-based IF-DTRS of X w.r.t. attribute set A in IFIS. If , then X is definable according to A in IFIS; otherwise, X is undefinable inIFIS.
Let IFDS = (U, AT = C ∪ D, {Va : a ∈ C} , {fa : a ∈ C}), RD is an indiscernibility relation on U, and U/RD = {D1, D2, ⋯ , Dr}. Therefore, the similarity measure-based α-IF positive is defined by .
Now the dependent degree in similarity measure-based IF-DTRS can be defined as follows,
Proposition 3.Let 0 ≤ λ ≤ 1 and 0 ≤ β < α ≤ 1. For any A ⊆ AT, the following properties hold:
and ;
and ;
If X ⊆ U, then ;
if X1 ⊆ X2 ⊆ U, then , ;
if 0 ≤ β1 ≤ β2 < α1 ≤ α2 ≤ 1, then , .
Proof. They are straightforward. □
From (4) of Proposition 3, if X, Y ⊆ U, then it is clear that:
Example 2. Let IFIS = (U, AT, {Va : a ∈ AT} , {fa : a ∈ AT}) as shown in Table 2, where U = {x1, x2, ⋯ , x14}, A = {a1, a2, a3, a4}.
Intuitionistic fuzzy information system
U
a1
a2
a3
a4
a5
a6
x1
〈0.40, 0.50〉
〈0.30, 0.70〉
〈0.80, 0.20〉
〈0.40, 0.50〉
〈0.70, 0.10〉
〈0.10, 0.70〉
x2
〈0.30, 0.50〉
〈0.40, 0.50〉
〈0.60, 0.10〉
〈0.40, 0.50〉
〈0.70, 0.30〉
〈0.30, 0.70〉
x3
〈0.30, 0.50〉
〈0.10, 0.80〉
〈0.80, 0.10〉
〈0.40, 0.50〉
〈0.80, 0.20〉
〈0.20, 0.80〉
x4
〈0.10, 0.80〉
〈0.10, 0.80〉
〈0.40, 0.50〉
〈0.10, 0.80〉
〈0.80, 0.10〉
〈0.10, 0.80〉
x5
〈0.70, 0.30〉
〈0.40, 0.50〉
〈0.90, 0.10〉
〈0.40, 0.60〉
〈0.90, 0.00〉
〈0.00, 0.90〉
x6
〈0.30, 0.60〉
〈0.40, 0.60〉
〈0.70, 0.20〉
〈0.50, 0.50〉
〈0.40, 0.50〉
〈0.50, 0.40〉
x7
〈0.50, 0.40〉
〈0.50, 0.00〉
〈0.50, 0.40〉
〈0.50, 0.40〉
〈0.50, 0.40〉
〈0.40, 0.50〉
x8
〈0.70, 0.00〉
〈0.70, 0.10〉
〈0.60, 0.10〉
〈0.60, 0.20〉
〈0.60, 0.10〉
〈0.10, 0.60〉
x9
〈0.60, 0.40〉
〈0.40, 0.40〉
〈0.60, 0.10〉
〈0.40, 0.40〉
〈0.40, 0.50〉
〈0.50, 0.40〉
x10
〈0.30, 0.60〉
〈0.30, 0.60〉
〈0.30, 0.70〉
〈0.30, 0.60〉
〈0.30, 0.60〉
〈0.60, 0.30〉
x11
〈0.20, 0.00〉
〈0.70, 0.00〉
〈0.60, 0.10〉
〈0.20, 0.20〉
〈0.20, 0.70〉
〈0.70, 0.20〉
x12
〈0.50, 0.30〉
〈0.50, 0.00〉
〈0.70, 0.20〉
〈0.20, 0.00〉
〈0.60, 0.30〉
〈0.30, 0.60〉
x13
〈0.50, 0.40〉
〈0.60, 0.40〉
〈0.50, 0.00〉
〈0.40, 0.10〉
〈0.50, 0.50〉
〈0.40, 0.50〉
x14
〈0.70, 0.20〉
〈0.50, 0.50〉
〈0.60, 0.00〉
〈0.60, 0.00〉
〈0.60, 0.10〉
〈0.10, 0.60〉
Setting λ = 0.85, the similarity measure classes based on similarity measure defined by Definition 9 for attribute set A are
Let X = {x2, x3, x4, x6, x7, x9, x12, x13} and α = 0.80, β = 0.50. On the basis of A, the similarity measure-based α-lower and β-upper approximations of X can be obtained by
Therefore, the positive region and the dependency degree of X w.r.t. A are calculated as
Similarity measure-based MG-IF-DTRSs
In practical problems, information data is often obtained from different sources with the same domain set. Inspired by the idea of multi-granulation, multiple granular structures induced by IF similarity measure relations will be used to approximate any target concept in this section.
Let MIFIS = {IFISi : IFISi = (U, Ai, {Va : a ∈ Ai} , {fa : a ∈ ATi} , 1 ≤ i ≤ m)} be a multi-source IFIS, briefly written as MIFIS = (U, A), where A = {A1, A2, ⋯ , Am}. In particular, when a decision attribute is added to MIFIS, we refer it as the multi-source IFDS, denoted as MIFDS = {IFDSi : IFDSi = (U, Ai, {Va : a ∈ Ai} , {fa : a ∈ ATi} , 1 ≤ i ≤ m, D)}, briefly written as MIFDS = (U, A, D).
Definition 11. Let MIFIS = (U, A), 0 ≤ λ ≤ 1. For 0 ≤ β < α ≤ 1 and X ⊆ U, the optimistic similarity measure-based α-lower and β-upper approximations of X w.r.t. A in MIFIS, are defined asfollows
where ∨ and ∧ mean the logical operator “or” and “and”, respectively.
If , then X is definable according to MIFIS; otherwise, X is undefinable in MIFIS. The order pair is referred to similarity measure-based optimistic MG-IF-DTRS of X w.r.t. A in MIFIS.
The positive, boundary and negative regions of X w.r.t. A in optimistic similarity measure-basedMG-IF-DTRS are defined by
Objects in the three regions are changed corresponding to the value of α and β in optimistic similarity measure-based MG-IF-DTRS. And the degree of dependence in optimistic similarity measure-based MG-IF-DTRS is defined by
Proposition 4.Let MIFIS = (U, A), 0 ≤ λ ≤ 1. For 0 ≤ β < α ≤ 1 and X ⊆ U, then:
, ;
;
, ;
If X ⊆ Y ⊆ U, then and ;
if 0 ≤ β1 ≤ β2 < α1 ≤ α2 ≤ 1, then , and .
Proof. They are straightforward. □
Proposition 5.Let MIFIS = (U, A), 0 ≤ λ ≤ 1, and 0 ≤ β < α ≤ 1. ∀X ⊆ U, if B ⊆ A, then:
;
.
Proof. (1) such that . Since B ⊆ A, from Definition 11, we have .
(2) It can be obtained similarly. □
Definition 12. Let MIFIS = (U, A), 0 ≤ λ ≤ 1. For 0 ≤ β < α ≤ 1 and X ⊆ U, the pessimistic similarity measure-based α-lower and β-upper approximations of X w.r.t. A in MIFIS are defined as follows
where ∨ and ∧ mean the logical operator “or” and “and”, respectively.
If , then X is definable according to MIFIS; otherwise, X is undefinable in MIFIS. The order pair is denoted as pessimistic similarity measure-based MG-IF-DTRS of X w.r.t. A in MIFIS.
The positive, boundary and negative regions of X w.r.t. A in pessimistic similarity measure-based MG-IF-DTRS are defined, respectively, by
Objects in the three regions are changed corresponding to the value of α and β in similarity measure-based pessimistic MG-IF-DTRS. And the dependent degree in similarity measure-based pessimistic MG-IF-DTRS is defined by
Proposition 6.Let MIFIS = (U, A), 0 ≤ λ ≤ 1. For 0 ≤ β < α ≤ 1 and X ⊆ U, then
, ;
;
, ;
If X ⊆ Y ⊆ U, then and ;
if 0 ≤ β1 ≤ β2 < α1 ≤ α2 ≤ 1, then and .
Proof. They are straightforward. □
Proposition 7.Let MIFIS = (U, A), 0 ≤ λ ≤ 1. For 0 ≤ β < α ≤ 1 and X ⊆ U. If B ⊆ A, then:
;
.
Proof. They are obtained similarly as Proposition 5.
□
Example 3. In Example 2, let A1 = {a1, a2}, A2 = {a3, a4}, A3 = {a5}, A4 = {a6}, the IFIS is changed into the MIFIS. Setting λ = 0.85, the similarity measure classes based on similarity measure defined by Definition 9 for attribute set Ai are as in Table 3.
The similarity measure based similarity measure classes
x
x1
{x1, x2, x3, x6, x10}
{x1, x2, x3, x5, x6}
{x1, x3, x4, x8, x14}
{x1, x3, x4, x8, x14}
x2
{x1, x2, x6, x10}
{x1, x2, x3, x6, x9}
{x2, x3, x12}
{x2, x3, x12}
x3
{x1, x3}
{x1, x2, x3, x5, x6}
{x1, x2, x3, x4}
{x1, x2, x3, x4}
x4
{x4}
{x4}
{x1, x3, x4, x5}
{x1, x3, x4, x5}
x5
{x5, x9, x14}
{x1, x3, x5, x6}
{x4, x5}
{x4, x5}
x6
{x1, x2, x6, x10}
{x1, x2, x3, x5, x6, x9}
{x6, x7, x9, x10, x13}
{x6, x7, x9, x10, x13}
x7
{x7, x12}
{x7}
{x6, x9, x13}
{x6, x9, x13}
x8
{x8}
{x8}
{x1, x8, x14}
{x1, x8, x14}
x9
{x5, x9, x13}
{x2, x6, x9}
{x6, x7, x9, x10, x13}
{x6, x7, x9, x10, x13}
x10
{x1, x2, x6, x10}
{x10}
{x6, x9, x10, x11}
{x6, x9, x10, x11}
x11
{x11}
{x11}
{x10, x11}
{x10, x11}
x12
{x7, x12}
{x12}
{x2, x7, x12, x13}
{x2, x7, x12, x13}
x13
{x9, x13}
{x13}
{x6, x7, x9, x12, x13}
{x6, x7, x9, x12, x13}
x14
{x5, x14}
{x14}
{x1, x4, x8, x14}
{x1, x4, x8, x14}
Let X = {x2, x3, x4, x6, x7, x9, x12, x13} and α = 0.80, β = 0.50. On the basis of A, the similarity measure-based α-lower and β-upper approximations of X for both optimistic and pessimistic MG-IF-DTRSs can be calculated as
The positive, boundary and negative regions of X w.r.t A in MIFIS are
Optimal granulation selection in MG-IF-DTRSs
Granulation selection is an important issue in knowledge representation and data mining in rough set theory [52]. In this section, we investigated optimal granulation selection with different requirements in optimistic and pessimistic similarity measure-based MG-IF-DTRSs.
Let MIFDS = (U, A, D), the similarity measure-based optimistic and pessimistic α-lower and β-upper approximations distribution functions of MIFDS are defined as follows:
Definition 13. Let MIFIS = (U, A), 0 ≤ λ ≤ 1 and 0 ≤ β < α ≤ 1, B ⊆ A = {A1, A2, ⋯ , Am}.
If (), B is an optimistic α-lower (β-upper)distribution granulation selection of MIFDS. Moveover, ∀B′ ⊂ B is not an optimistic α-lower (β-upper) distribution granulation selection of MIFDS, B is an optimistic α-lower (β-upper) distribution optimal granulation selection of MIFDS.
If (), B is a pessimistic α-lower (β-upper) distribution granulation selection of MIFDS. Moveover, ∀B′ ⊂ B is not a pessimistic α-lower (β-upper) distribution granulation selection of MIFDS, B is a pessimistic α-lower (β-upper) distribution optimal granulation selection of MIFDS.
In order to select the optimal granulation of MIFDS, some measures are discussed in MG-IF-DTRS.
The optimistic and pessimistic similarity measure-based α-lower and β-upper approximations quality functions are denoted as follows:
Proposition 8.Setting 0 ≤ λ ≤ 1 and 0 ≤ β < α ≤ 1, let B ⊆ A = {A1, A2, ⋯ , Am}. Then we have:
, ;
, .
Proof. They are obtained by Propositions 5and 7. □
Definition 14. Let MIFDS = (U, A, D), 0 ≤ λ ≤ 1 and 0 ≤ β < α ≤ 1, B ⊆ A = {A1, A2, ⋯ , Am}.
If (), B is an optimistic α-lower (β-upper)quality granulation selection of MIFDS. Moveover, ∀B′ ⊂ B is not an optimistic α-lower (β-upper) distribution granulation selection of MIFDS, B is an optimistic α-lower (β-upper) quality optimal granulation selection of MIFDS.
If (), B is a pessimistic α-lower (β-upper) quality granulation selection of MIFDS. Moveover, ∀B′ ⊂ B is not a pessimistic α-lower (β-upper) quality granulation selection of MIFDS, B is a pessimistic α-lower (β-upper) quality optimal granulation selection ofMIFDS.
Theorem 2.Let MIFDS = (U, A, D), 0 ≤ λ ≤ 1. For 0 ≤ β < α ≤ 1 and B ⊆ A = {A1, A2, ⋯ , Am.
B is an optimistic α-lower (β-upper) distribution optimal granulation selection ⇔ B is an optimistic α-lower (β-upper) quality optimal granulation selection of MIFDS.
B is a pessimistic α-lower (β-upper) distribution optimal granulation selection ⇔ B is a pessimistic α-lower (β-upper) quality optimal granulation selection of MIFDS.
Proof. (1) For optimistic α-lower distribution optimal granulation selection, is obvious.
For any Dj ∈ U/RD, form (1) of Proposition 4, we have , then . If , for any j ≤ r, we have . Otherwise, if there exists Dk ∈ U/RD such that , then .
Therefore,
Thus, B is an optimistic α-lower distribution optimal granulation selection ⇔ B is an optimistic α-lower quality optimal granulation selection of MIFDS.
Similarly, we have B is an optimistic β-upper distribution optimal granulation selection ⇔ B is an optimistic β-upper quality optimal granulation selection of MIFDS.
(2) can be proved by similar way. □
Definition 15. Let MIFDS = (U, A, D), 0 ≤ λ ≤ 1. For 0 ≤ β < α ≤ 1, and B = A - {Aj}.
If (), we say that Aj is anoptimistic α-lower (β-upper) quality dispensable granularity of MG-IF-DTRS; otherwise, Aj is an optimistic α-lower (β-upper) quality indispensable granularity of MG-IF-DTRS.
If (), we say that Aj is a pessimistic α-lower (β-upper) quality dispensable granularity of MG-IF-DTRS; otherwise, Aj is a pessimistic α-lower (β-upper) quality indispensable granularity of MG-IF-DTRS.
In fact, if Aj is an optimistic (pessimistic) α-lower (β-upper) quality dispensable granularity of MG-IF-DTRS, after deleting Aj, the optimistic (pessimistic) α-lower (β-upper) quality will not change.
Theorem 3.Let MIFDS = (U, A, D), 0 ≤ λ ≤ 1 and 0 ≤ β < α ≤ 1. B ⊆ A = {A1, A2, ⋯ , Am} is an α-lower (β-upper) optimistic (pessimistic) quality granulation selection. Then Aj ∈ B is an indispensable granularity in B if and only if B - {Aj} is not an α-lower (β-upper) optimistic (pessimistic) quality granulation selection.
Proof. It is easily verified by Definition 14. □
The core of MIFDS is composed of all indispensable granules.
The α-lower (β-upper) optimistic quality decreases monotonically with the granular structures decrease. Keeping the α-lower (β-upper) optimistic quality unchanged, we can get the α-lower (β-upper) optimistic quality optimal granulation. Here, we use the α-lower (β-upper) optimistic quality to characterize the significance of each granularity.
Definition 16. Let MIFDS = (U, A, D), 0 ≤ λ ≤ 1 and 0 ≤ β < α ≤ 1. The optimistic α-lower and β-upper inner significance measures of Aj are defined by
() reflects the extent to which the optimistic α-lower (β-upper) quality of A respect to MIFDS decreases after removing granularity Aj.
Similarly, the optimistic α-lower and β-upper outer significance measures of Aj can also be defined.
Definition 17.MIFDS = (U, A, D), 0 ≤ λ ≤ 1 and 0 ≤ β < α ≤ 1. B ⊆ A = {A1, A2, ⋯ , Am} and Aj ∉ B. The optimistic α-lower and β-upper outer significance measure of Aj according to B are defined by
() reflects the extent to which the optimistic α-lower (β-upper) quality of B respect to MIFDS increases after adding granularity Aj to B.
Theorem 4.Let MIFDS = (U, A, D), 0 ≤ λ ≤ 1 and 0 ≤ β < α ≤ 1. B ⊆ A = {A1, A2, ⋯ , Am} is an optimistic α-lower (β-upper) granulation selection. Then Aj ∈ B is an optimistic α-lower (β-upper) indispensable granularity if and only if ().
Proof. ⇒ If Aj is an optimistic α-lower (β-upper) indispensable granularity in B, according to Definition 15 and Theorem 3, it follows that (), i.e., ().
⇐ B is an optimistic α-lower (β-upper) granulation selection. If (), then (). From Theorem 3, if B - {Aj} is not an optimistic α-lower granulation selection. According to Definition 15, Aj is an optimistic α-lower (β-upper) indispensable granularity. □
Theorem 5.Let MIFDS = (U, A, D), 0 ≤ λ ≤ 1 and 0 ≤ β < α ≤ 1. We denote and as the optimistic α-lower and β-upper core granularity set, then (1) ; (2) .
Proof. (1) It is easy to know that A is an optimistic α-lower granulation selection. According toTheorem 3, Aj is an optimistic α-lowerindispensable granularity if and only if . The set of all indispensable granularity in MIFDS = (U, A, D) is denoted as the core of A. Therefore, .
(2) can be proved by similar way. □
Theorem 6.Let MIFDS = (U, A, D), 0 ≤ λ ≤ 1 and 0 ≤ β < α ≤ 1, B ⊆ A = {A1, A2, ⋯ , Am}. If () and (), then B is an optimistic α-lower (β-upper) optimal granulation selection.
Proof. The proof is obvious from Definition 16, Theorems 3, 4 and 5. □
The pessimistic α-lower (β-upper) quality increases monotonically with the granular structures decrease. Keeping the pessimistic α-lower (β-upper) quality unchanged, we can get the pessimistic α-lower quality optimal granulation. Here, we use the pessimistic α-lower (β-upper) quality to characterize the significance of each granularity.
Definition 18. Let MIFDS = (U, A, D), 0 ≤ λ ≤ 1 and 0 ≤ β < α ≤ 1. The pessimistic α-lower and β-upper inner significance measures of Aj are defined by
() reflects the extent to which the pessimistic α-lower (β-upper) quality of A respect to MIFDS increases after removing granularity Aj.
Similarly, the pessimistic α-lower and β-upper outer significance measures of Aj can also be defined.
Definition 19.MIFIS = (U, A), 0 ≤ λ ≤ 1 and 0 ≤ β < α ≤ 1. B ⊆ A = {A1, A2, ⋯ , Am} and Aj ∉ B. The pessimistic α-lower and β-upper outer significance measure of Aj w.r.t. B are defined by
() reflects the extent to which the pessimistic α-lower (β-upper) quality of B respect to MIFDS decreases after adding granularity Aj to B.
Theorem 7.Let MIFIS = (U, A), 0 ≤ λ ≤ 1 and 0 ≤ β < α ≤ 1. B ⊆ A = {A1, A2, ⋯ , Am} is a pessimistic α-lower (β-upper) granulation selection. Then pessimistic Aj ∈ B is an α-lower (β-upper) indispensable granularity if and only if ().
Proof. According to Theorem 4, similarly, we have Theorem 7. □
Theorem 8.Let MIFIS = (U, A), 0 ≤ λ ≤ 1 and 0 ≤ β < α ≤ 1. We denote and as the pessimistic α-lower and β-upper core granularity set, then
;
.
Proof. According to Theorem 5, similarly, we have Theorem 8. □
Theorem 9.Let MIFIS = (U, A), 0 ≤ λ ≤ 1 and 0 ≤ β < α ≤ 1, B ⊆ A = {A1, A2, ⋯ , Am}. If () and (),then B is an optimistic α-lower (β-upper) optimal granulation selection.
Proof. According to Theorem 6, similarly, we have Theorem 8. □
Algorithm 1 A heuristic algorithm to select the optimistic α-lower optimal granulation
Input:MIFDS = {IFDSi : IFDSi = (U, Ai, D)}, 0 ≤ λ ≤ 1 and 0 ≤ β < α ≤ 1, B = {Ai : i ≤ m}.
Output: The optimistic α-lower optimal granulation selection(short for LOGα).
1: set S : =∅, LOGα : =∅, CORE (B) : =∅ and t : =1
forAj ∈ B
3: compute
4: if
5: set S : = CORE (B) ∪ {Bj
end if
end for
set S : = CORE (B)
While
choose any
compute
For Aj ∈ B - S
compute
If
set
end if
end for
set
End While
set LOGα : = S
While t do
set t : =0
for Ak∈ LOGα
if set t : =1 set LOGα : = LOGα - {Ak
break
end if
end for
end while
Return LOGα
The process of obtaining the four kinds of optimal granulation is basically the same. Thus, we only consider the optimistic α-lower quality optimal granulation selection of MG-IF-DTRS in the followingdiscussion.
According to the above discussion, we present a heuristic algorithm to select the optimistic α-lower optimal granulation. The process is outlined inAlgorithm 1.
In Algorithm 1, steps 2–7 is to compute CORE (B), the time complexity is O (|U||B|). Steps 8–19, set S = CORE (B), the granules that have higher outer significance are heuristically added to S until S becomes the α-lower optimistic granulation selection, the time complexity is O (|U||B - S||S|). After that, steps 20–31 remove the dispensable granules from LOGα, the time complexity is O (|U||LOGα|2). Finally, the optimistic α-lower optimal granulation is obtained, and the total time complexity of Algorithm 1 isO (|U||B|2).
Example 4. In Example 3, we discussed the optimistic 0.8-lower optimal granulation in MIFDS = {IFDSi : IFDSi = (U, Ai, D)} which considers a decision attribute d in MIFIS = (U, A). Let
Step 2–3: Computing the optimistic 0.8-lower inner significance of Aj ∈ B according to Definition 17:
Step 4–8: Computing the according to Theorem 5:
Since , then . Set .
Step 9: Computing the values of and :
Step 10–13: Computing the optimistic 0.8-lower outer significance measures of Aj, j = 1, 3, 4 according to {A2} by Definition 19:
, , .
Step 14–17: Ranking the granule by the optimistic 0.8-lower outer significance:
Step 18–19: Adding the granule which has higher 0.8-lower outer significance to S until S becomes the optimistic granulation selection:
Adding A3 to {A2}, , and adding A4 to {A2}, .
Step 20: Set S1 = {A2, A3} , S2 = {A2, A4}. According to Definition 15, S1, S2 are two optimistic 0.8-lower granulation selections.
Step 21–30: Computing the optimistic 0.8-lower inner significance of Aj ∈ S1 according to S1 and Aj ∈ S2 according to S2, removing the dispensable granules form the two optimistic 0.8-lower granulation selections S1 and S2:
Since , , ;
Since , , .
According to Theorem 5, Aj ∈ S1 are all optimistic 0.8-lower indispensable granularity, the same asAj ∈ S2.
Step 31: Return the optimistic optimal granulation:
According to Theorem 9, S1, S2 are two optimistic 0.8-lower optimal granulation selections.
Similarly, B is a pessimistic 0.5-upper optimal granulation of MIFDS = {IFDSi : IFDSi = (U, Ai, D)}.
Conclusion
For the defects of the existing similarity measures in IFSs, this paper has improved the similarity measures by considering the hesitancy degree and the endpoint distance of membership and non-membership. The numerical example shows that the results of the proposed method are more reasonable and elaborate than other methods. According to similarity measure, a novel similarity measure-based IF-DTRSs and two corresponding MG-IF-DTRSs have been proposed. This study can be considered as generalization model of DTRS and MGRSs in multi-source intuitionistic fuzzy information, which make the models more meaningful in practical applications. Finally, we have established a heuristic algorithm to explore the optimal granulation selection in MIFDS, and an example has been employed to illustrate the processing of the algorithm above.
As future work, we plan to consider the rule extraction in similarity measure-based MG-IF-DTRSs.
Footnotes
Acknowledgements
This paper is supported by the National Natural Science Foundation of China (No. 61573127), Natural Science Foundation of Hebei Province (No. A2018210120).
References
1.
X.K.Wang, H.G.Peng, J.Q.Wang. Hesitant linguistic intuitionistic fuzzy sets and their application in multi-criteria decision-making problems, Int J Uncertain Quantif8(4) (2018), 321–341.
2.
P.Liu, S.M.Chen, Multiattribute group decision making based on intuitionistic 2-tuple linguistic information, Inf Sci s430–431 (2018), 599–619.
K.T.Atanassov, More on intuitionistic fuzzy sets, Fuzzy Sets Syst33(1) (1989), 37–45.
6.
W.L.Hung, M.S.Yang, Similarity measures of intuitionistic fuzzy sets based on hausdorff distance, Pattern Recogn Lett25(14) (2004), 1603–1611.
7.
D.F.Li, C.T.Cheng, New similarity measures of intuitionistic fuzzy sets and application to pattern recognitions, Pattern Recogn Lett23(1) (2002), 221–225.
8.
D.H.Liu, X.H.Chen, D.Peng, The Intuitionistic Fuzzy Linguistic Cosine Similarity Measure and Its Application in Pattern Recognition, Complexity (1) (2018), 1–11.
9.
H.B.Mitchell, On the Dengfeng-Chuntian similarity measure and its application to pattern recognition, Pattern Recogn Lett24(16) (2003), 3101–3104.
R.J.Kuo, T.C.Lin, F.E.Zulvia, C.Y.Tsai, A hybrid metaheuristic and kernel intuitionistic fuzzy c-means algorithm for cluster analysis, Appl Soft Comput67 (2018), 299–308.
12.
S.K.De, A.Biswas, R.Roy, An application of intuitionistic fuzzy sets in medical diagnosis, Fuzzy Sets Syst117(2) (2001), 209–213.
13.
J.F.Bota, A.M.Crdenas, C.M.Sierra, Fuzzy cellular automata and intuitionistic fuzzy sets applied to an optical frequency comb spectral shape, Eng Appl Artif Intel62 (2017), 181–194.
14.
S.M.Chen, Measures of similarity between vague sets, Fuzzy Sets Syst74(2) (1995), 217–223.
15.
S.M.Chen, Similarity measures between vague sets and between elements, IEEE Transs Syst Man Cybern Part B27(1) (1997), 153–158.
16.
D.H.Hong, C.Kim, A note on similarity measures between vague sets and between elements, Inf Sci115(1) (1999), 83–96.
17.
H.W.Liu, New similarity measures between intuitionistic fuzzy sets and between elements, Math Comput Model42(1) (2005), 61–70.
18.
S.M.Chen, Y.Randyanto, A novel similarity measure between intuitionistic fuzzy sets and its applications, Int J Pattern Recogn Artif Intel27(07) (2013), 1087–1129.
19.
C.M.Hwang, M.S.Yang, W.L.Hung, M.G.Lee, A similarity measure of intuitionistic fuzzy sets based on the sugeno integral with its application to pattern recognition, Inf Sci189(6) (2012), 93–109.
20.
I.Iancu, Intuitionistic fuzzy similarity measures based on frank t-norms family, Pattern Recogn Lett42(1) (2014), 128–136.
H.Garg, K.Kumar, Distance measures for connection number sets based on set pair analysis and its applications to decision-making process, Applied Intelligence48(10) (2018), 3346–3359.
23.
E.Szmide, J.Kacprzyk, A new concept of a similarity measure for intuitionistic fuzzy sets and its use in group decision making, Lecture Notes in Computer Science (Subseries LNAI)3558 (2005), 272íC282.
24.
Y.Song, X.Wang, L.Lei, A.Xue, A new similarity measure between intuitionistic fuzzy sets and its application to pattern recognition, Abstract and Applied Analysis175 (2014), 1–11.
25.
Z.Pawlak, Rough set, Int J Comput Inf Sci11(5) (1982), 341–356.
26.
W.Z.Wu, Y.H.Qian, T.J.Li, et al. On rule acquisition in incomplete multi-scale decision tables, Inf Sci378(C) (2017), 282–302.
27.
C.Q.Tan, X.H.Chen, Dynamic similarity measures between intuitionistic fuzzy sets and its application, Int J Fuzzy Syst16(4) (2014), 511–519.
28.
J.H.Dai, Y.J.Yan, Z.W.Li, et al. Dominance-based fuzzy rough set approach for incomplete interval-valued data, J Intell Fuzzy Syst34(1) (2018), 423–436.
29.
X.D.Fan, W.D.Zhao, C.Z.Wang, et al. Attribute reduction based on max-decision neighborhood rough set model[J], Knowl Based Syst151 (2018), 16–23.
30.
H.Garg, K.Kumar, An advanced study on the similarity measures of intuitionistic fuzzy sets based on the set pair analysis theory and their application in decision making, Soft Comput22(15) (2018), 4959–4970.
31.
X.Zhang, C.L.Mei, D.F.Chen, et al. A fuzzy rough setbased feature selection method using representative instances, Knowl Based Syst151 (2018), 216–229.
32.
W.S.Du, B.Q.Hu, Aggregation distance measure and its induced similarity measure between intuitionistic fuzzy sets, Pattern Recogn Lett 60–61(C) (2015), 65–71.
33.
A.K.Tiwari, S.Shreevastava, T.Som, et al., Tolerancebased intuitionistic fuzzy-rough set approach for attribute reduction, Expert Syst Appl101 (2018), 205–212.
34.
W.H.Xu, Q.R.Wang, X.T.Zhang, Multi-granulation fuzzy rough sets in a fuzzy tolerance approximation space, Int J Fuzzy Syst13(4) (2011), 246–259.
35.
W.T.Li, W.H.Xu, Multigranulation decieion-theoretic rough set in ordered system, Fundam Inf139 (2015), 191–207.
W.T.Li, W.H.Xu, Multigranulation decieion-theoretic rough set in ordered system, Fundam Inf139 (2015), 191–207.
45.
A.H.Tan, W.Z.Wu, J.H.Li, G.P.Lin, Evidence-theory-based numerical characterization of Multigranulation rough sets in incompletely information systems, Fuzzy Sets Syst294(7) (2016), 18–35.
46.
B.Huang, H.X.Li, G.F.Feng, et al. Inclusion measurebased multi-granulation intuitionistic fuzzy decision-theoretic rough sets and their application to ISSA, Knowl Based Syst138 (2017), 220–231.
47.
Y.Y.Yao, S.K.M.Wang, A decision theoretic framework for approximation concepts, Int J Man Mach Stud37(6) (1992), 793–809.
48.
Y.H.Qian, H.Zhang, Y.L.Sang, et al. Multigranulation decision-theoretic rough sets, Int J Approx Reason55(1) (2014), 225–237.
49.
Y.Li, D.L.Olson, Z.Qin, Similarity measures between intuitionistic fuzzy (vague) sets: a comparative analysis, Pattern Recogn Lett28(2) (2007), 278–285.
50.
H.Garg, R.Arora, Distance and similarity measures for dual hesitant fuzzy soft sets and their applications in multi criteria decision-making problem, Int J Uncertain Quantif7(3) (2017), 229–248.
51.
Z.P.Tian, H.Y.Zhang, J.Q.Wang, et al. Green supplier selection using improved TOPSIS and best-worst method under intuitionistic fuzzy enviroment, Informatica (2018), DOI: 10.15388.
52.
M.S.Liang, J.S.Mi, T.Feng, Optimal granulation selection for multi-label data based on multi-granulation rough sets, Granular Comput (6) (2018), 1–13.