In the paper, we define the Shannon-typeentropyof experiments in the intuitionistic fuzzy case and derive its basic properties. Subsequently, the results concerning the entropy are used to introduce the concept of mutual information of intuitionistic fuzzy experiments. In addition, we define the Kullback-Leibler divergenceand its conditional versionin the intuitionistic fuzzy caseand examine the algebraic properties of the proposed quantities. The results are explained with several examples to illustrate the developed theory.
It is well–known that a classical Cantor’s set corresponds to an indicator function, of which values are only taken to be 0 and 1. The concept of fuzzy set, introduced by L.A. Zadeh in 1965 [42], extends the classical set theory. Recall that a fuzzy set is a mapping f : X → [0, 1] (where f (x) is interpreted as the degree of membership of the element x ∈ X to the considered fuzzy set f). The fuzzy set theory allows mathematically describing inaccurate, vague terms and working with them. It has been applied in various fields of mathematical research and it is continually developing. In addition, the fuzzy set theory has important applications also in practice, e.g., in control engineering, data processing, management, logistics, artificial intelligence, computer science, medicine, decision theory, expert systems, logic, management science, operations research, pattern recognition, and robotics.
The theory of intuitionistic fuzzy sets, developed by K. Atanassov [1, 2], further extends both concepts by allowing the assessment of the elements by two functions: μA for membership and νA for non-membership, which belong to the real unit interval [0,1] and whose sum belongs to the same interval, as well. So, the intuitionistic fuzzy set theory is a more powerful tool to deal with vagueness. In the last two decades, many authors have paid attention to the intuitionistic fuzzy set theory that has been successfully applied in various areas such as logic programming, decision making problems, medical diagnosis, etc. Recently important applications of intuitionistic fuzzy set to artificial intelligence have appeared – intuitionistic fuzzy expert systems, intuitionistic fuzzy neural networks, intuitionistic fuzzy decision making, intuitionistic fuzzy machine learning, intuitionistic fuzzy semantic representations, etc. We refer the interested reader to the paper [21], which provides an insight on the characteristics of the highly cited intuitionistic fuzzy publications.
When planning experiments, it is important to know how much information we gain from their realization. As it is known, a measure of information is entropy, the standard approach being based on the Shannon entropy [39]. Let us remind the reader that the Shannon entropy of probability distribution P = (p1, p2, …, pn) is the number , where is the Shannon entropy function defined by the following equation:
The usual mathematical model of a random experiment in classical informationtheory [19] is a measurable partition of a probability space. Partitions are standardly defined in the context of classical Cantor’s set theory.In many cases, however, it has been shown that the partitions defined by means of fuzzy set theoryare more appropriate to solving real problems. That is why various suggestions for a generalization of the classical partitions to fuzzy partitions have been created. A fuzzy partition can serve as a mathematical model of a random experiment whose results are unclearly, vaguely defined events, the so-called fuzzy events. We refer the interested readers to (for example) [12, 40] for some results related to the concepts of fuzzy partition and its entropy. It is known that there are many possibilities to define operations for modeling the union and intersection of fuzzy sets; an overview can be found in [11]. It should be noted that while the model studied in [23–26] was based on Zadeh’s max-min operations (cf. [42]), in our article [27], the Łukasiewicz connectives were used to define the fuzzy set operations. Let us remind that the union of fuzzy subsets f, g of X is defined by Łukasiewicz as min(f + g, 1) . We note that in [25, 26], we have exploited our results regarding the entropy of fuzzy partitions provided in [24] to introduce the concepts of mutual information and Kullback-Leibler divergence for the fuzzy case. The notion of Kullback and Leibler divergence(often shortened to K-L divergence) was introduced in [22] as a distance measure between two probability distributions. It plays significant roles in information theory and various disciplines such as statistics, machine learning, physics, neuroscience, computer science, linguistics, etc.
We note that some suggestions for generalizing the fuzzy partitions to intuitionistic fuzzy partitions (IF-partitions) are given, for example, in [4, 28]. Some results concerning the entropy in the intuitionistic fuzzy case can be found e.g., in [5, 41]. The purpose of the present study is to provide a generalization of the results presented in [26] from the fuzzy case to the intuitionistic fuzzy one. Analogously as in the fuzzy case, there are many possibilities to define operations over intuitionistic fuzzy sets (see e.g., [3, 38]). We note that while the model studied in this work is based on the Łukasiewicz connectives, the authors of the above cited papers used other connectives to define operations over intuitionistic fuzzy sets.
It has been shown that in addressing some specific problems, instead of Shannon entropy,it is more appropriate to use an approach based on the concept of logical entropy (see e.g., [8, 17]). In our article [28], the logical entropy and logical mutual information of experiments in the intuitionistic fuzzy case have been studied. We remind the reader that if P = (p1, p2, …, pn) is a probability distribution, then the logical entropy of P (cf. [15, 16]) is defined as the number , where is the logical entropy function defined by
for every x ∈ [0, 1]. Note that while the theory of entropy for the intuitionistic fuzzy case provided in [28] is based on the concept of logical entropy function, in this paper we use an approach based on the concept of the Shannon entropy function.
The rest of the paper is structured as follows. In the following section, we provide basic definitions, notations and facts used in the article. The main results of the paper are presented in Sections 3 and 4. In Section 3, we define the Shannon–typeentropy of IF-partitions and its conditional version. It is shown that the proposed entropy measures have properties analogous to the properties of Shannon’s entropy of classical measurable partitions. Subsequently, using the results concerning the entropy of IF-partitions, the concepts of mutual information and conditional mutual information of experiments in the intuitionistic fuzzy case are introduced and basic properties of the suggested measures are proved. In Section 4, we define the K-L divergenceand its conditional version in the intuitionistic fuzzy case and examine the algebraic properties of the proposed quantities. The results are explained with several examples to illustrate the theory developed in the article. The final section provides a brief summary.
Basic definitions, notations and facts
In this section, we recall basic definitions, notations and facts used in the paper.
Definition 2.1. Let X be a non–empty set. By an intuitionistic fuzzy set (IF-set for short), we will mean a pair A = (μA, νA) of functions μA, νA : X → [0, 1] such that μA (x) + νA (x) ≤1, for every x ∈ X.
As already mentioned in the introduction, there are many possibilities to define operations over intuitionistic fuzzy sets. We will use the partial binary operation ⊕, and the binary operation · defined as follows. If A = (μA, νA), and B = (μB, νB) are IF-sets, then we define A ⊕ B = (μA + μB, νA + νB - 1X), and A · B = (μA · μB, νA + νB - νA · νB). Here, 1X denotes the constant function with the value 1; similarly, 0X denotes the constant function with the value 0. It is obvious that if A, B are two IF-sets, then A ⊕ B is an IF-set if and only if μA + μB ≤ 1X, and νA + νB ≥ 1X. In the case that A ⊕ B is an IF-set, we will say that A ⊕ B exists. Put 1 = (1X, 0X), 0 = (0X, 1X). It can be verified that A · 1 = A, and A ⊕ 0 = A, for any IF-set A. For any IF-sets A, B, C, the following conditions are satisfied: (i) if A ⊕ B exists, then B ⊕ A exists, and A ⊕ B = B ⊕ A (commutativity); (ii) if (A ⊕ B) ⊕ C exists, then A ⊕ (B ⊕ C) exists, and (A ⊕ B) ⊕ C = A ⊕ (B ⊕ C) (associativity); (iii) if A ⊕ B exists, then C · A ⊕ C · B exists, and C · (A ⊕ B) = C · A ⊕ C · B (distributivity).
In the class of all IF-sets, we define the relation of partial ordering ≤ in the following way: if A = (μA, νA), and B = (μB, νB) are two IF-sets, then A ≤ B if and only if μA ≤ μB, and νA ≥ νB. It is evident that 0 ≤ A ≤ 1 for any IF-set A. Gutierrez Garcia and Rodabaugh have shown in [20] that the intuitionistic fuzzy sets ordering and topology are reduced to the ordering and topology of fuzzy sets. Another situation is in measure theory [4], where the intuitionistic fuzzy case cannot be reduced to the fuzzy one. We note that a probability theory for the intuitionistic fuzzy case has been developed in [35], see also [36].
Example 2.2. A fuzzy set f : X → [0, 1] can be regarded as an IF-set, if we put A = (f, 1X - f). If f = IA, where IA is the indicator function of a set A ⊂ X, then the corresponding IF-set has the form A = (IA, 1X - IA) = (IA, IAC). Here, AC denotes the complement of a set A ⊂ X. In this case A ⊕ B corresponds to the union of sets A, B ⊂ X with empty intersection, A · B to the intersection of sets A, B ⊂ X, and the relation ≤ to the inclusion of sets A, B ⊂ X.
In the paper, we shall denote by the symbol any family of IF-sets satisfying the following two conditions: (i) , ; (ii) if , then . Any IF-set from the family is considered as an intuitionistic fuzzy event. The IF-set 0 = (0X, 1X) is interpreted as an impossible event; the IF-set 1 = (1X, 0X) as a certain event. Analogously as in [28], we define a state on the family . It plays the role of a probability measure on the family of IF-events.
Definition 2.3. A mapping is called a state if the following two conditions are satisfied: (i)s (1) = 1; (ii) if such that , then s (A ⊕ B) = s (A) + s (B) .
Definition 2.4. [28] By an IF-partition of we will mean an n-tuple α = (A1, A2, …, An) of (not necessarily different)elements of such that and .
Definition 2.5. [28] Let α = (A1, A2, …, Ak), and β = (B1, B2, …, Bl) be two IF-partitions of . We will say that the IF-partition β is a refinement of α (with respect to a states) and write α ≺ β if for each Ai ∈ α there exists a subset I (i)⊂ { 1, 2, …, l } such that s (Ai) = s (⊕ j∈I(i)Bj), where I (i)∩ I (j) = ∅ whenever i ≠ j, and
Definition 2.6. [28] Let α = (A1, A2, …, Ak), and β = (B1, B2, …, Bl) be two IF-partitions of . Their join α ∨ β is defined as an r-tuple(where r = k · l) consisting of the elements Ai · Bj, i = 1, 2, …, k, j = 1, 2, …, l.
Theorem 2.7. [28] Ifα, βare two IF-partitions of, thenα ∨ βis also an IF-partition of. Moreover, α ≺ α ∨ β and β ≺ α ∨ β
Proposition 2.8. [28] Letα = (A1, A2, …, An) be an IF-partition of. Then, for any.
Example 2.9. Consider a probability space (X, S, P), and the class of all S-measurable IF-sets, i.e., are S-measurable with μA+ νA ≤ 1X }. Let c ∈ [0, 1]. It can be verified that the mapping defined, for any element A = (μA, νA) of , by the formula
is a state.
Remark 2.10. We note that any continuous state s defined on the class of all S-measurable IF-events (i.e., a state satisfying the condition An ↗ A ⇒ s (An) ↗ s (A)) has the form (3); for more details, see [7, 36]. The appropriate entropy theory for the case of the class of all S-measurable IF-events was provided by Ďurica in [14]. While an IF-partition considered by Ďurica in [14] is a set α ={ A1, A2, …, An } of S-measurable IF-setssuch that , the model studied in [28], as well as in this work is more general.
Entropy and mutual information of IF-partitions
Each IF-partition α = (A1, A2, …, An) of represents, from the point of view of classical probability theory, a random experiment with a finite number of results Ai, i = 1, 2, …, n (which are intuitionistic fuzzy events) with a probability distribution pi = s (Ai), i = 1, 2, …, n. Namely, pi ≥ 0, for i = 1, 2, …, n, and . Hence, we definethe entropy of α = (A1, A2, …, An) by Shannon’s formula:
If α = (A1, A2, …, Ak), and β = (B1, B2, …, Bl) are two IF-partitions of , then we define the conditional entropy of α given Bj ∈ β by:
where
The conditional entropy of α assuming a realization of the experiment β is defined by the formula:
It is assumed (based on continuity arguments) that if a ≥ 0. The base of the logarithm can be any positive number, but as a rule one takes logarithms to the base 2. The entropy is then expressed in bits.
Remark 3.1. Evidently, the entropy Hs (α) is always non-negative, and since s (Ai · Bj) ≤ s (Bj), the conditional entropy Hs (α/β) is non-negative as well. In general, Hs (α/β) ≠ Hs (β/α). Let us consider the IF-partition α0 ={ 1 } representing an experiment, the result of which is a certain event. It is easy to see that Hs (α0) =0, and Hs (α/α0) = Hs (α) .
Example 3.2. Consider a class of IF-events and a state . Let α = (A1, A2) be an IF-partition of with s (A1) = p, where p ∈ (0, 1) . Then s (A2) =1 - p, and the entropy of α is the number Hs (α) = - p log p - (1 - p) log(1 - p). If we set p = 0.5, then Hs (α) =1 bit.
The entropy and the conditional entropyof IF-partitions satisfy properties analogous to the properties of Shannon’s entropy of classical measurable partitions as shown by the succeeding theorems. The proof of the following theorem is based on the property stated in Lemma 3.3.
Lemma 3.3.Letβ = (B1, B2, …, Bq), andγ = (C1, C2, …, Cr) be IF-partitions ofsuch thatβ ≺ γ. Then, for everyandj = 1, 2, …, q, it holds thats (A · (⊕ k∈I(j)Ck)) = s (A · Bj), whereI (j)⊂ { 1, 2, …, r } such thats (Bj) = s (⊕ k∈I(j)Ck), forj = 1, 2, …, q, I (j)∩ I (k) = ∅, forj ≠ k, and
Proof. Let us suppose that the assertion is not valid. This means that there exists j0∈ { 1, 2, …, q } such that
or
Then:
or
But this conclusion is a contradiction, because by Proposition 2.8 we have:
and
Proof. Let us suppose that α = (A1, A2, …, Ap), β = (B1, B2, …, Bq), γ = (C1, C2, …, Cr), β ≺ γ. Then for any Bj ∈ β there existsa subset I (j)⊂ { 1, 2, …, r } such that s (Bj) = ∑k∈I(j)s (Ck), where I (j)∩ I (k) = ∅, for j ≠ k, and . Put (for fixed j such that . Then .
Let usconsider the function defined by Equation (1). Since the function φ is concave, we have:
Therefore, using Lemma 3.3, we can write:
□
Theorem 3.5.For every IF-partitionsα, β, γof, we have:
Proof. Let α = (A1, A2, …, Ap), β = (B1, B2, …, Bq), γ = (C1, C2, …, Cr). It is not hard to verify that the function φ defined by Equation (1) satisfies, for every x, y ∈ [0, ∞), the condition
Using the identity (5), we get:
□
Remark 3.6. Since α ∨ β = β ∨ α, we have also:
Theorem 3.7.For each IF-partitionsα, βof, we have:
Proof. In Equation (6) it suffices to put γ ={ 1 }. □
Remark 3.8. Using the principle of mathematical induction, we get the following generalization of Equation(6):
for every IF-partitions α1, α2, …, αn, γof. If we put γ ={ 1 }, we get the following chain rule for entropyof IF-partitions:
Theorem 3.9.For every IF-partitionsα, β, γof, we have:Hs (α ∨ β/γ) ≤ Hs (α/γ) + Hs (β/γ) .
Proof. The inequality is a consequence of Theorems 3.5, 2.7 and 3.4. □
Example 3.10. Consider the following class of Borel measurable IF-events: are Borel measurable with μA+ νA ≤ 1X } and define a state by the formula:
for every A = (μA, vA) ∈ F. Further, we put A1 = (0 . 2X, 0 .5X), and A2 = (0 . 3X, 0 .5X). Since A1 ⊕ A2 = (0 . 5X, 0X) (therefore, , and , the pair α = (A1, A2) is an IF-partition. By simple calculations, we get that s (A1) = s (A2) =0.5, and the entropy Hs (α) =1 bit. In addition, put β = (B1, B2), where B1 = (0 . 4X, 0 .6X), B2 = (0 . 5X, 0 .4X) . The pair (B1, B2) is an IF-partition with s (B1) =0.4, s (B2) =0.6, and the entropy Hs (β) =0.97095 bit. The join of IF-partitions α and β is the quadruple α ∨ β = (A1 · B1, A1 · B2, A2 · B1, A2 · B2), where s (A1 · B1) =0.2, s (A1 · B2) =0.3, s (A2 · B1) =0.2, s (A2 · B2) =0.3. The entropy of α ∨ β is the number Hs (α ∨ β) =1.97095 bits.
The conditional entropy of α assuming a realization of the experiment β is the number:
Now, it is possible to verify that the equality Hs (α ∨ β) = Hs (β) + Hs (α/β) is fulfilled.
In the following part, using the previous results, we define the notions of mutual information and conditional mutual information of IF-partitions and derive basic properties of these measures.
Definition 3.11. Let α, β be two IF-partitions of . Then we define information about α in β by the formula:
Evidently, Is (α, β) = Is (β, α), i.e., the information measure Is is symmetric. This means that information about α in β is equal to information about β in α. For this reason, the value Is (α, β) is said to be mutual information of IF-partitions α, β .
Proposition 3.13.Ifα = (A1, A2, …, Ak), andβ = (B1, B2, …, Bl) are two IF-partitions of, then
Proof. Since by Proposition 2.8, it holds that , for i = 1, 2, …, k, we get:
□
Theorem 3.14.Letα, βbe IF-partitions of. ThenIs (α, β) ≥ 0 with the equality if and only if the IF-partitionsα, βare statistically independentwith respect to s, i.e.,s (A · B) = s (A) · s (B) , for everyA ∈ α, and for everyB ∈ β.
Proof. Let us assume that α = (A1, A2, …, Ak), and (B1, B2, …, Bl). Put δ ={ (i, j) ; s (Ai · Bj) >0 }. Then according to the inequality ln x ≤ x - 1, that holds for all real numbers x > 0 (with the equality if and only if x = 1), for every (i, j) ∈ δ, we get:
The equality holds if and only if , i.e., when s (Ai · Bj) = s (Ai) · s (Bj). Therefore, by Propositions 3.13 and 2.8, we obtain:
It follows that Is (α, β) ≥ 0. The equality Is (α, β) = 0 holdsif and only if s (Ai · Bj) = s (Ai) · s (Bj), for i = 1, 2, …, k, and j = 1, 2, …, l, i.e., when the IF-partitions α, β are statistically independent with respect to s. □
As a consequence of Equation (11) and Theorem 3.14, we obtain the properties of sub-additivity and additivity of entropy IF-partitions stated in the following theorem.
Theorem 3.15.For IF-partitionsα, βof, it holdsHs (α ∨ β) ≤ Hs (α) + Hs (β) with the equality if and only ifα, βare statistically independent with respect to s.
In our article [28], we proved the fundamental properties of the logical entropy HL (α) of IF-partitions. It turned out that, unlike the Shannon entropy of IF-partitions, the logical entropy of IF-partitions does not have the additivity property. It satisfies the following weaker property: if IF-partitions α, β of are statistically independent, then 1 - HL (α ∨ β) = (1 - HL (α)) · (1 - HL (β)). As a consequence of Equation (10) and Theorem 3.14 we obtain the following property of entropy IF-partitions.
Theorem 3.16.For IF-partitionsα, βof, it holdsHs (α/β) ≤ Hs (α) with the equality if and only if the IF- partitionsα, βare statistically independent with respect to s.
Example 3.17. Consider the class of Borel measurable IF-events from Example 3.10 and the state defined by the formula (9). Let us calculate the mutual information Is (α, β) of IF-partitions α, β considered in Example 3.10. It is possible to verify that the IF-partitions α, β are statistically independent with respect to s. By Equation (10), we get that Is (α, β) = Hs (α) - Hs (α / β) = 1 -1 = 0 bit. It can be verified that the equality Is (α, β) = Hs (α) + Hs (β) - Hs (α ∨ β) is satisfied.
Definition 3.18. Let α, β, γ be IF-partitions of . Then the conditional mutual information of α and β given γ is defined by the formula
Remark 3.19. Since γ ≺ β ∨ γ, by Theorem 3.4, we have the inequality Hs (α/γ) ≥ Hs (α/β ∨ γ). Therefore, the conditional mutual information is always non-negative. In addition, it is easy to see that Is (α, β/γ) = Is (β, α/γ) .
Theorem 3.20.For any IF- partitionsα, β, γof, we haveIs (α ∨ β , γ) = Is (α, γ) + Is (β, γ/α) .
Proof. Let us calculate:
□
The following theorem will establishchain rulesfor mutual information of IF-partitions.
Theorem 3.21.Letα1, α2, …, αn, βbe IF- partitions of. Then, forn = 2, 3, …, the following equality holds:
Proof. The assertion follows by applying Equations (10), (8), Remark 3.8, and Definition 3.18. □
Definition 3.22. Let α, β, γ be IF-partitions of . We say that α is conditionally independent to γ given β if Is (α, γ/β) = 0 .
Remark 3.23. Since Is (α, γ/β) = Is (γ, α/β), we can say that IF-partitions α and γ are conditionally independent given β.
Theorem 3.24.Letα, β, γbe IF-partitions ofsuch thatIs (α, γ/β) = 0, i.e.,αandγare conditionally independent givenβ. Then we have:
Is (α∨ β, γ) = Is (β, γ) ;
Is (β, γ) = Is (γ, α)+ Is (γ, β/α) ;
Is (α, β/γ)≤ Is (α, β) ;
Is (α, β) ≥ Is (α, γ) (data processing inequality).
Proof. The proof can be easily done in the same way as the proof of Theorem 8 in [29]. □
In the following part, we analyze a concavity of entropy Hs (α) and mutual information Is (α, β) as functions of s. We will use the symbol defined on . It is routine to prove that if , then, for every real number c ∈ [0, 1], it holds that
Theorem 3.25.Letandαbe a given IF-partition ofThen, for every real numberc ∈ [0, 1], the following inequality holds:
Proof. Let α = (A1, A2, …, An). The function φ defined by Equation (1) is concave, therefore, we get:
The result proves that the function s → Hs (α) is concave on the class .□
Remark 3.26. If α = (A1, A2, …, Ak), and β = (B1, B2, …, Bl) are two IF-partitions of , then we can write , i.e., there exists cij = φ (s (Bj/Ai)) ≥0 such that
Put
Theorem 3.27.Letα, βbe given IF-partitionsof. Then the mutual informations → Is (α, β) is a concave function on the class
Proof. Let s be a state from the class . By definition Is (α, β) = Hs (β) - Hs (β / α), thus the mutual information Is (α, β) is the sum of two concave functions on the class , therefore it is concave on the class .□
K-L Divergence in the intuitionistic fuzzy case
In this section we introduce the concept of K-L divergence for the intuitionistic fuzzy case and we examine the properties of this quantity. In the proofs we use the known log sum inequality. Let a1, a2, …, an, and b1, b2, …, bn be non-negative real numbers. Denote the sum by a, and the sum by b. The log sum inequality states that with the equality if and only if ai = c · bi, for all i, where c is constant. It is assumed that if x > 0, and if x ≥ 0 .
Definition 4.1. Let s1, s2 be states defined on , and α = (A1, …, An) be a given IF-partitionof . Then we define the K-L divergence dα (s1||s2) by:
Example 4.2. Consider any class of IF-events and states s1, s2, s3 defined on . Let α = (A1, A2) be an IF-partition of with s1 (A1) = p, s2 (A1) = q, s3 (A1) = r, where p, q, r ∈ (0, 1). Then it holds s1 (A2) =1 - p, s2 (A2) =1 - q, and s3 (A2) =1 - r. Put , , . Simple calculations will show that:
Analogously we get that dα (s1||s3) =0.207519 bit, and dα (s2||s3) =0.025062 bit. Evidently,
The result means that the K-L divergence is not a metric in a true sense since the triangle inequality does not hold, in general.
Theorem 4.3.Lets1, s2be states from the classandα = (A1, A2, …, An) be any IF-partition of. Thendα (s1||s2) ≥0 with the equality if and only ifs1 (Ai) = s2 (Ai), fori = 1, 2, …, n.
Proof. The assertion follows by applying the log-sum inequality. It suffices to put ai = s1(Ai) and bi = s2(Ai), for i = 1, 2, …, n. Then a1, a2, …, an, b1, b2, …, bn are non-negative real numbers such that and . Namely, ; analogously we obtain that . Therefore, we get:
with the equality if and only if s1 (Ai) = c · s2 (Ai), for i = 1, 2, …, n, where c is constant. Taking the sum for all i = 1, 2, …, n, we obtain the equality , which implies that c = 1. This means that dα (s1||s2) =0 if and only if s1 (Ai) = s2 (Ai), for i = 1, 2, …, n.□
We note that in the classical theory, the inequality from the previous theorem is known as Gibb’s inequality. In the following text, the symbol cardα denotes the cardinality of the set α .
Theorem 4.4.Letsbe a state defined on, andα = (A1, A2, …, An) be an IF-partition of. Then it holdswhereis a state uniform overα, i.e.,, fori = 1, 2, …, n .
Proof. Let us calculate:
□
By combining the previous two assertions, we obtain the following property of entropy of IF-partitions.
Corollary 4.5.For arbitrary IF-partitionαof, we haveHs (α) ≤ log cardαwith the equality if and only if the statesis uniform overα.
Proof. Let α = (A1, A2, …, An) and be a state uniform over α. According to Theorem 4.4, we have dα (s||t) = log cardα - Hs (α) . By Theorem 4.3 it holds dα (s||t) ≥0, and therefore, we get the inequality Hs (α) ≤ log cardα . Moreover, by Theorem 4.3 dα (s||t) =0 if and only if s (Ai) = t (Ai), for i = 1, 2, …, n. Thus the equality Hs (α) = log cardα holds if and only if s is uniform over α .□
Theorem 4.6.The K-L divergencedα (s1||s2) is convex in the pair (s1, s2), i.e., if, are pairs of states defined on, then, for every real numberc ∈ [0, 1], it holds:
Proof. Assume that α = (A1, A2, …, An and fix i ∈ { 1, 2, …, n } . Put , , , , in the log-sum inequality. Then:
Summing these inequalities over i = 1, 2, …, n, we obtain the assertion.□
Finally, we define the conditional K-L divergence and, using this notion, we formulate the chain rule for the K-L divergence in the intuitionistic fuzzy case.
Definition 4.7. Let s1, s2 be states defined on , and α = (A1, A2, …, Ak), β = (B1, B2, …, Bl) be two IF-partitions of . Then we define the conditionalK-L divergence dβ/α (s1||s2) by:
Theorem 4.8.Ifα, βare two IF-partitions of, then:
Proof. Let α = (A1, A2, …, Ak), β = (B1, B2, …, Bl). Since by Proposition 2.8, it holds
For i = 1, 2, …, k, we get:
We usedthe implication s1 (Ai) =0 ⇒ s1 (Ai · Bj) =0 that follows from the equality . □
Conclusion
In the paper, we introduced the concept of Shannon-type entropy of IF-partitions and its conditional version. We showed that the suggested entropy measures have properties analogous to properties of Shannon entropy in the classical case. Specifically, it was shown that the Shannon entropy of IF-partitions satisfies the properties of additivity and sub-additivity.Subsequently, the concept of entropy of IF-partitions was exploited to define the mutual information of experiments in the intuitionistic fuzzy case. We proved basic properties of this quantity, inter alia, the data processing inequality for conditionally independent IF-partitions. The proposed measures can be used whenever we need to know the amount of information obtained by realization of experiments whose results are intuitionistic fuzzy events.
In the final section, we introduced the concept of K-L divergence in the intuitionistic fuzzy case and we examined the properties of this distance measure. In particular, a convexity of K-L divergence in the intuitionistic fuzzy case was proved and the relationship between K-L divergence and entropy of IF-partitions was studied. The chain rules for entropy, mutual information and K-L divergence in the intuitionistic fuzzy case were also derived. To illustrate the results, we have provided several numerical examples.
Footnotes
Acknowledgments
The first author thanks the editor and the referees for their valuable comments and suggestions. It is my sad duty to announce that Prof. RNDr. Beloslav Riečan, DrSc., Dr.h.c. passed away unexpectedly on August 13, 2018, before this article was published. Rest in peace, our beloved friend, co-worker, and teacher.
DeS.K., BiswasR., RoyA.R., Some operations on intuitionistic fuzzy sets, Fuzzy Sets Syst114 (2000), 477–484.
11.
DuboisD., PradeM., A review of fuzzy set aggregation connectives, Inf Sci36 (1985), 85–121.
12.
DumitrescuD., Fuzzy partitions with the connectives T∞, S∞, Fuzzy Sets Syst47 (1992), 193–195.
13.
DumitrescuD., Fuzzy measures and entropy of fuzzy partitions, J Math An and Appl176 (1993), 359–373.
14.
ĎuricaM., Entropy on IF-events, Notes Intuit Fuzzy Sets13 (2007), 30–40.
15.
EllermanD., An introduction to logical entropy and its relation to shannon entropy, Int J Seman Comput7 (2013), 121–145.
16.
EllermanD., Logical information theory: New foundations for information theory, Log J IGPL25 (2017), 806–835.
17.
EllermanD., Logical entropy: Introduction to classical and quantum logical information theory, Entropy20 (2018), 679.
18.
FarnooshR., RahimiM., KumarP., Removing noise in a digital image using a new entropy method based on intuitionistic fuzzy sets, Proceedings of the International Conference on Fuzzy Systems, Vancouver, BC, Canada,2016; Institute of Electrical and Electronics Engineers Inc.: New York, NY, USA, 2016; pp. 1328–1332.
19.
GrayR.M., Entropy and Information Theory, Springer: Berlin/Heidelberg, Germany, 2009.
20.
Gutierrez GarciaJ. and
RodabaughS.E., Order-theoretic, topological, categorical redundancies of interval-valued sets, grey sets, vague sets, interval-valued “intuitionistic” sets, “intuitionistic” fuzzy sets and topologies, Fuzzy Sets Syst156 (2005), 445–484.
21.
HeX., WuY., Global research trends of intuitionistic fuzzy set: A bibliometric analysis, J Intelligent Systems (Published Online: 2017-09-11), doi: 10.1515/jisys-2017-0240
22.
KullbackS., LeiblerR.A., On information and sufficiency, Ann Math Stat22 (1951), 79–86.
23.
MarkechováD., The entropy of fuzzy dynamical systems and generators, Fuzzy Sets Syst48 (1992), 351–363.
24.
MarkechováD., Entropy of complete fuzzy partitions, Math Slovaca43 (1993), 1–10.
25.
MarkechováD., Entropy and mutual information of experiments in the fuzzy case, Neural Netw World23 (2013), 339–349.
26.
MarkechováD., Kullback-leibler divergence and mutual information of experiments in the fuzzy case, Axioms6(1) (2017). doi: 10.3390/axioms6010005
27.
MarkechováD., RiečanB., Entropy of fuzzy partitions and entropy of fuzzy dynamical systems, Entropy18(1) (2016). doi: 10.3390/e18010019
28.
MarkechováD., RiečanB., Logical entropy and logical mutual information of experiments in the intuitionistic fuzzy case, Entropy19(8) (2017). doi: 10.3390/e19080429
29.
MarkechováD., RiečanB., Kullback-leibler divergence and mutual information of partitions in product MV algebras, Entropy19(6) (2017). doi: 10.3390/e19060267
30.
MesiarR., ReuschB., ThieleH., Fuzzy equivalence relations and fuzzy partition, J Mult Valued Log Soft Comput12 (2006), 167–181.
31.
MesiarR., RybárikJ., Entropy of fuzzy partitions—A general model, Fuzzy Sets Syst99 (1998), 73–79.
32.
MontesS., et al., Fuzzy delta-epsilon partitions, Inf Sci152 (2003), 267–285.
33.
PiaseckiK., Fuzzy partitions of sets, Busefal25 (1986), 52–60.
34.
RahimiM., RiaziA., On local entropy of fuzzy partitions, Fuzzy Sets Syst234 (2014), 97–108.
35.
RiečanB., Probability theory on IF events. In Algebraic and Proof-Theoretic Aspects of Non-Classical Logics, Papers in Honor of Daniele Mundici on the Occasion of his 60th Birthday, Lecture Notes in Computer Science, Springer: New York, NY, USA, 2007, pp. 290–308.
36.
RiečanB., On a problem of Radko Mesiar: General form of IF-probabilities, Fuzzy Sets Syst152 (2006), 1485–1490.
37.
RiečanB. and
AtanassovK.
, Some properties of operations conjuction and disjunction from Łukasiewicz type over intuitionistic fuzzy sets Part 1, Notes Intuit Fuzzy Sets20(3) (2014), 1–5.
38.
RiečanB. and
AtanassovK., Some properties of operations conjuction and disjunction from Łukasiewicz type over intuitionistic fuzzy sets Part 2, Notes Intuit Fuzzy Sets20(4) (2014), 1–6.
39.
ShannonC.E., A mathematical theory of communication, Bell Syst Tech J27 (1948), 379–423.