Abstract
In this paper, we propose a new correlation coefficient between intuitionistic fuzzy sets. We then use this new result to compute some examples through which we find that it benefits from such an outcome with some well-known results in the literature. As in statistics with real variables, we refer to variance and covariance between two intuitionistic fuzzy sets. Then, we determined the formula for calculating the correlation coefficient based on the variance and covariance of the intuitionistic fuzzy set, the value of this correlation coefficient is in [–1,1]. Then, we develop this direction to build correlation coefficients between the interval-valued intuitionistic fuzzy sets and apply it in the pattern recognition problem. Finally, we apply this correlation coefficient in clustering problem with intuitionistic fuzzy information.
Keywords
Introduction
In 1986, Atanassov introduced the intuitionistic fuzzy set [1], which is a generalization of Zadeh’s fuzzy set [19]. An intuitionistic fuzzy set (IFS) consider the information both the membership function and non-membership function. After that, the interval-valued intuitionistic fuzzy set (IVIFS) was introduced by Atanassov and Gargov [2]. In which, the membership function and non-membership function are subintervals of [0, 1]. As opposed to fuzzy sets, intuitionistic fuzzy set also have broad applications for uncertain data processing such as decision making, medical diagnose, agriculture [3, 20]. Along with similar measurements, distance measurements, correlation measurements of intuitionistic fuzzy sets and interval – valued intuitionistic fuzzy set are also studied and widely used in many areas and now it is a hot topic [4–8, 21].
The concept of correlation coefficient of intuitionistic fuzzy sets was first studied by Gerstenkorn, and Mańko in 1991 [6]. In this correlation coefficient, the variance and covariance are constructed directly from the scalar product of the values of the membership function and the non-membership function, respectively, of two intuitionistic fuzzy sets. Then, the correlation coefficient between the interval–valued intuitionistic fuzzy set is introduced by the Bustince and Burillo [4] in 1995. Later, many authors have studied and developed correlations in different trends and in other spaces. In the results of the correlation studies, in the 1990 s, we find that the values of the correlation coefficient are in the range [0, 1].
In the 21st century, scholars have developed new methods for building correlation coefficients for fuzzy sets. In the new results, some studies show the value of the correlation coefficient value received in the interval [–1, 1]. We can include some methods as Hung’s method [7], and the method of Liu et al. [11]. Hung’s method is based on a statistical viewpoint to calculate the correlation coefficient. In 2016, Liu et al. [11] constructed the correlation coefficient between intuitionistic fuzzy sets based on the concept of the deviation of the intuitionistic fuzzy numbers, after that they also extend this approach to the IVIFSs case.
As the correlation coefficient reflect the relationship between two objects. When the correlation coefficient has values ±1, it shows a linear relationship. Thus it reflects the “consistency” of the two sets. That is the motivation for us to study the correlation coefficient. Moreover, the correlation coefficients found in the literature still have certain limitations, as shown in the examples in this paper.
However, there are some cases where the correlation coefficient according to Hung’s method or method of (Liu et al) does not cover them. These cases can be viewed in the examples presented in this article. In those cases, if using our method, there are reasonable results. In this paper, we propose a new method to determine the correlation coefficient between the intuitionistic fuzzy sets, in which the value of the correlation coefficient computed according to our method lies in interval [–1, 1]. Then, we develop the method to construct the correlation coefficient between the interval–valued intuitionistic fuzzy sets. Finally, we also provide examples to illustrate our approach, and apply it to diagnostic problems in medicine and to the problem of pattern recognition.
The contribution of this article is to provide a new method for determining the correlation coefficient between intuitionistic fuzzy sets. It has overcome the limitations of existing methods. This method is quite simple and its applications are quite varied. As in sample identification, medicine and clustering analysis.
The rest of this paper is organized as follows. In Section 2, we recall the concept of intuitionistic fuzzy set, interval –valued intuitionistic fuzzy set, the correlation coefficients which are computed by using the methods of Gerstenkorn and Mansko [6], Hung [7], Xu [16] and Liu et al. [11]. In Section 3, we construct the new correlation coefficient between the IFSs, in this section we also give some examples that compare the results computed based on our method to other methods. In Section 4, we extend our method to determine the correlation coefficient between the IVIFSs. Finally, we present the conclusion in Section 5.
Preliminaries
Let X be a universal set. We have
in which μ A (x) ∈ [0, 1] and ν A (x) ∈ [0, 1] are the membership degree and the non-membership of the element x in X to A, respectively, and μ A (x) + ν A (x) ≤1, ∀ x ∈ X .
Now, we recall some existing correlation coefficient of intuitionistic fuzzy sets in literature. Given X ={ x1, x2, …, x
n
} is a universal set and
are two IFSs on X.
where
where
where
where
in which
for all i = 1, 2, …, n .
Let X ={ x1, x2, …, x n } be a finite set, A and B are two arbitrary IFSs in X.
where
It is easily to obtain (1), (2).
(3). According to the Cauchy –Schwarz inequality, we have
So that |COV (A, B) | ≤ D (A) 0.5D (B) 0.5□
Now, we can define the correlation coefficient of the intuitionistic fuzzy sets. It is similar to the correlation coefficient of real number variables in the statistic theory.
ρ (A, B) = ρ (B, A) -1 ≤ ρ (A, B) ≤1 If A = kB + b for some k > 0, then ρ (A, B) =1. Here, A = kB + b means that μ
A
= kμ
B
+ b and ν
A
= kν
B
+ b. If A = kB + b for some k < 0, then ρ (A, B) = -1 .
Straightforward. From proposition 1, we have |COV (A, B) |≤ D (A) 0.5D (B) 0.5. It means that -D (A) 0.5 D (B) 0.5 ≤ COV (A, B) ≤ D (A) 0.5D (B) 0.5 Hence, we have
If μ
A
= kμ
B
+ b and ν
A
= kν
B
+ b we have
and
If k > 0 then
If k < 0 then
Now, we consider some examples to compare our proposed correlation coefficient and some other knowledge correlation coefficient.
X ={ x1, x2, x3 } where
COV (A, B) = -0.035; D (A) =0.02 and D (B) =0.075 .
So that
These results are consistent, because A is the set of elements whose value increases, B is a set whose values are decreasing. But not exist k ≠ 0 such that B = kA + b, in particular μ B = - μ A + 0.4 and ν B = -2ν A + 0.4.
E (A) = (0.17, 0.19) ; E (B) = (0.283, 0.281); COV (A, B) = -0.00012 ; D (A) =0.00012 ; D (B) =0.00012 and ρ (A, B) = -1 .
We cannot determine the correlation coefficient according to this method.
In this example, the results of our method match those of the Hung’s method.In this example, the results calculated according to our method coincide with the results calculated by the Hung’s method, both methods yield have correlation coefficient ρ (A, B) = -1 . This result is reasonable, because two intuitionistic fuzzy sets A, B have a linear relation B = kA + b with k = -0.1 and b = 0.3. But the method of Liu et al. in [11] does not tell us anything about this data.
E (A) = (0.17, 0.19) ; E (B) = (0.434, 0.438); COV (A, B) =0.00024 ; D (A) =0.00012 ; D (B) =0.000048 and ρ (A, B) =1 .
We cannot determine the correlation coefficient according to this method.
In this example, the results of our method match those of the Hung’s method.In this example, the results calculated according to our method also coincide with the results calculated by the Hung’s method, both methods yield have correlation coefficient ρ (A, B) =1. This result is reasonable, because two intuitionistic fuzzy sets A, B have a linear relation B = kA + b with k = 0.2 and b = 0.4. But the method of Liu et al. in [11] does not tell us anything about this data.
E (A) = (0.1, 0.4) ; E (B) = (0.41, 0.44); COV (A, B) =0.002 ; D (A) =0.02 ; D (B) =0.0002 and ρ (A, B) =1 .
We cannot determine the correlation coefficient according to this method.
In this example, the results of our method match those of Liu et al. [11]. In this example, the results calculated according to our method also coincide with the results calculated by using the method of Liu et al. both methods yield a correlation coefficient ρ (A, B) =1 . This result is reasonable, because two intuitionistic fuzzy sets A, B have a linear relation B = kA + b with k = 0.1 and b = 0.4. But the method of Hung in [7] does not tell us anything about this data.
E (A) = (0.3, 0.2) ; E (B) = (0.37, 0.38);
COV (A, B) = -0.006 ; D (A) =0.06 ; D (B) =0.0006 and ρ (A, B) = -1 .
We cannot determine the correlation coefficient according to this method.
In this example, the results of our method match those of Liu et al. In this example, the results calculated according to our method also coincide with the results calculated by using the method of Liu et al, both methods yield the correlation coefficient ρ (A, B) = -1 . This result is reasonable, because two intuitionistic fuzzy sets A, B have a linear relation B = kA + b with k = -0.1 and b = 0.4. But the method of Hung in [7] does not tell us anything about this data.
Symptoms characteristic for the considered diagnoses
Symptoms characteristic for the considered patients
To select the appropriate diagnostic method we calculate the correlation of each patient with the diagnostic methods. For each patient, the appropriate diagnostic method will have the highest correlation coefficient.
The correlation coefficients of a diagnosis d
k
∈ D (k = 1, 2, …, 5) for each patient p
i
(i = 1, 2, 3, 4) is
The computed results of correlation coefficients are listed in Table 3. From the results, we see that Al should use diagnostic method corresponding to Malaria, Bob uses Stomach problem, Joe uses Typhoid and Ted uses Malaria. We also cite the results listed in [11]. These results together with the results calculated according to our method are listed in Table 4. All six methods point to Bob in accordance with our S diagnosis. Our method and the other four methods indicate that Joe should use T diagnostics. Patient Al should use V, and Ted use diagnosis V.
Correlation coefficients of symptoms for each patient to the possible diagnose sets
The most possible diagnosis for each patient under different methods
In this section, we extend our method to the IVIFSs.
Let X ={ x1, x2, …, x
n
} be a universal set. Given an IVIFSs
We denote
and
We can determine variance, covariance and correlation coefficient between IVIFSs.
The covariance of
Proof similar to proposition, we have some properties of the covariance of two interval –valued intuitionistic fuzzy sets
Now, we can define the correlation coefficient of the interval - valued intuitionistic fuzzy sets (IVIFSs). It is similar to the correlation coefficient of real number variables in statistic.
If If
(1), (2) is easy to verify.
(3). If
So that
Hence, we have
and
If k > 0 then
(4) If k < 0 then
Now, we apply the new correlation coefficient of intuitionistic fuzzy sets in a pattern recognition problem as follows.
Given m pattern {A1, A2, … , A m } in the form of the interval valued intuitionistic fuzzy sets on the universal set X.
There is a new sample A ∈ IVIFS (X).
Question: What pattern does B belong to?
To answer this question, we consider the correlation coefficient of intuitionistic fuzzy sets ρ (A i , A) of sample A to each pattern A i for all i = 1, 2, …, m. If ρ (A i , A) > ρ (A k , A) then we put A belongs to the class of pattern A i for i, k = 1, 2, …, m.
Data information of minerals
From the result in Table 6, we can comment that the correlation coefficients between
Correlation coefficients of
In this section, we use the correlation coefficient in clustering intuitionistic fuzzy sets. This ideal is based on Xu et al. [18].
The clustering algorithm based on the correlation coefficient of intuitionistic fuzzy sets as follows.
until C2k+1 = C2 k .
and
Here, we see that C8 = C4. So that C4 is equivalent matrix.
+ If 0 ≤ λ ≤ 0.375, then we have one cluster:
+ If 0.375 < λ ≤ 0.625, then we have two clusters: {A1, A2} and {A3, A4}
+If 0.625 < λ ≤ 0.75, then we have three clusters:
{A1, A2} , { A3 } and {A4}.
+If 0.75 < λ ≤ 1, then we have three clusters:
{A1} , { A2 } , { A3 } and {A4}.
Conclusion
From a statistical standpoint, the domain of the correlation coefficient is [–1,1]. Not only does our research indicate that, but many other studies point out. This value domain is more significant than correlation coefficients for only [0,1], because, besides pointing out the linear relationship between two sets of data in a space of observation objects, the correlation coefficient also indicates the variability of the two sets of data. Two datasets may have the same tendency, or tend to decrease (in case of positive correlation). It is also possible that the first data set is incremented, the second data set is reduced; in contrast, the first data set is reduced, the second data set is incremented (in the case of a negative correlation). The correlation coefficient of two intuitionistic fuzzy sets should also reflect this. In addition, because of the characteristics of intuitionistic fuzzy sets, there are two functions: a membership function, a non-membership function of an intuitionistic fuzzy set by the feature (elements) of the sample space. Thus, the correlation between intuitionistic fuzzy sets has its own characteristics. As many of the authors have previously studied, we consider the correlation coefficients of intuitionistic fuzzy sets based on both membership functions and non-member functions. Our method can effectively solve some cases where a previous method was difficult. This is demonstrated by the examples we present in this article. In this article, we also apply the methods we propose in determining the appropriate diagnostic methods in medicine, and in the problem of pattern recognition, clustering problems.
