The comparison of sets is an important topic with application in several fields. Divergence measures were introduced as an adequate measure of comparison of two fuzzy sets and an alternative of the dissimilarities. The particular study for local divergences is here generalized to any t-conorm instead just the sum. The concept of divergence is revisited and studied in detail. This study is complemented with the whole characterization of a new family of divergence measures, the generalized local divergence measures. Some applications of these divergences in pattern recognition and decision making illustrate their utility.
Estimation of similarity between sets (patterns or alternatives) is crucial in many aspects. This task is usually performed by applying a suitable comparison measure to fuzzy sets corresponding to given patterns or alternatives.
This can be done from different points of view. In some cases we measure the equality degree and in other ones the difference degree. A study of measures of comparison was given by Bouchon-Meunier et al. [2]. Recently, a review of the measures based on the differences was proposed by Couso et al. [4]. Among these measures, divergences appears as a good alternative [7, 8]. An important family of such measures fulfills the following natural property: if we change one of the two images only at a singleton, then the variation of divergence only depends on the changed values. This is called locality and divergence measures fulfilling it are called local. The computational advantage is obvious and so they were studied and characterized in [8]. The definition of a divergence measure is based on the union and intersection of fuzzy sets in a standard way by Zadeh [13] based on the maximum t-conorm and the minimum t-norm, respectively. Our previous studies are now completed for any t-conorm and any t-norm. Apart from that, the family of local divergences is generalized to the family of S-local divergences. They are obtained by combining the divergence at particular points of the universe with “distances” among them using a triangular conorm.
Basic concepts
The universal set is denoted by X. A fuzzy subset of X is a mapping from X into the unit interval [0, 1]. In this framework, we use the following notations:
is the set of all subsets of X,
is the set of all fuzzy subsets of X,
We identify a fuzzy set and its membership function. Thus we have that X (x) =1 for all x ∈ X and for the empty set we have ∅ (x) =0 for all x ∈ X.
Another important concepts will be the containment relation and the complement set. We consider the standard Zadeh’s negation for the complement (see [13]).
Definition 1. Let . The complement of is the fuzzy set . is contained in , denoted by if for all x ∈ X.
Apart from the previous relation of containment, we consider the concepts of intersection and union of fuzzy sets. The initial definitions were also given in [13] as minimum and maximum. However, they are not the only way to generalize the classical set operations, but there exist a broad class of functions to represent them. For the intersection, this class is referred as t-norms and for the union as t-conorm.
The triangular norm, (t-norm) is a function T : [0, 1] × [0, 1] → [0, 1] satisfying the following:
T (a, b) = T (b, a), for all a, b ∈ [0, 1],
T (T (a, b), c) = T (a, T (b, c)), for all a, b, c ∈ [0, 1],
b ≤ c ⇒ T (a, b) ≤ T (a, c), for all a, b, c ∈ [0, 1],
T (a, 1) = a, for all a ∈ [0, 1].
Some important examples of t-norms are:
Minimum: TM (a, b) = min(a, b), for all a, b ∈ [0, 1],
Product: TP (a, b) = a · b, for all a, b ∈ [0, 1],
Łukasiewicz t-norm: TL (a, b) = max(a + b - 1, 0), for all a, b ∈ [0, 1],
Drastic t-norm:
For these basic t-norms, it holds that TD ≤ TL ≤ TP ≤ TM. In fact, for any t-norm T, there is TD ≤ T ≤ TM.
By changing the neutral element from 1 to 0, we obtain the triangular conorm (t-conorm).
The t-norm T and t-conorm S are dual iff for each a, b ∈ [0, 1] there is T (a, b) =1 - S (1 - a, 1 - b).
For the previous examples, the duals are:
Maximum: SM (a, b) = max(a, b), for all a, b ∈ [0, 1],
Probabilistic sum: SP (a, b) = a + b - a · b, for all a, b ∈ [0, 1],
Łukasiewicz t-conorm: SL (a, b) = min(a + b, 1), for all a, b ∈ [0, 1],
Drastic t-conorm:
Using t-norms and t-conorms, we can define in general the intersection and union of two fuzzy as follows.
Definition 2. Let . Given a t-norm T and a t-conorm S,
;
.
Thus, we can denote by (X, T, S) the triple formed by the universe with the t-norm and the t-conorm defining the intersection and the union, respectively.
Divergences
In 1996, Bouchon-Meunier [2] tried to define a general measure of comparison between fuzzy sets. Since then more measures for comparing fuzzy sets have been introduced (see [1, 15]). A nice study about that can be found in [4]. The usual measures of comparison are dissimilarities [6].
Definition 3. A map is a dissimilarity if it satisfies the following axioms:
, for every .
, for every .
For every such that , there is .
As this definition is not too restrictive, it is possible to define a counterintuitive measure of comparison for which the above axioms hold. The restriction associated to this definition is given by the fact that the requirement in Axiom (Diss.3) is only given for sets such that , but there are a lot of sets which are not comparable with respect to ⊆ and therefore, nothing is required for them. Thus, we need a concept where the restriction about “proximity” are given for any set.
So, another measure of comparison was proposed in [7], the divergence, with the following properties:
It becomes zero when the two sets coincide.
It is a nonnegative and symmetric function.
It decreases when the two subsets become “more similar” in some sense.
While it is easy to formulate the first and the second conditions analytically, the third one depends on the formalization of the concept “more similar”. We base our approach on the fact that if we add a subset to both fuzzy subsets , we obtain two subsets which are closer to each other; the same with the intersection.
Definition 4. Let (X, T, S) be a triple with X a universe and T and S any t-norm and t-conorm, respectively. A map is a divergence measure with respect to (X, T, S) iff for all , D satisfies the following conditions:
;
;
, for all , where the union and intersection are defined by means of S and T, respectively.
It is clear that a divergence measure is associated to a triple (X, T, S) and a map D can be a divergence measure with respect a t-norm and it cannot be with respect to a different t-norm. However, when there is not ambiguity, we will say just a divergence measure without specifying the used t-norm and t-conorm.
We present some examples of divergence measures.
Example 1.
Clearly D is a divergence for any triple (X, T, S).
Example 2. We consider the triple (X, TM, SM) where X is finite. For any pair of fuzzy sets in X we put:
where αx ≥ 0 for any x ∈ X and ∑x∈Xαx = 1.
Again, the first and second conditions are trivial. Denote . Without loss of generality assume that a ≥ b and then T (a, c) ≥ T (b, c) for any t-norm T, so in particular for TM. Thus,
if c > a ≥ b, then |TM (a, c) - TM (b, c) | = |a - b| ≤ |a - b|,
if a ≥ c ≥ b, then |TM (a, c) - TM (b, c) | = |c - b| ≤ |a - b|,
if a ≥ b ≥ c, then |TM (a, c) - TM (b, c) | = |c - c|=0 ≤ |a - b|,
and therefore, in all the cases, this inequality holds. From here we have that
Analogously we prove with the maximum t-conorm. Thus, D is a divergence.
D is also a divergence if we consider the product t-norm or the Łukasiewicz t-norm and their dualt-conorms, since
|TP (a, c) - TP (b, c) | = |a · c - b · c| = c · |a - b| ≤ |a - b| and |SP (a, c) - SP (b, c) | = |a + c - a · c - (b + c - b · c) | = (1 - c) · |a - b| ≤ |a - b|;
if a + c ≥ 1 and b + c ≥ 1 then |TL (a, c) - TL (b, c) | = | (a + c - 1) - (b + c - 1) | = |a - b| and |SL (a, c) - SL (b, c) | = |1 - 1|=0 ≤ |a - b|,
if a + c < 1 and b + c ≥ 1 then |TL (a, c) - TL (b, c) | = |0 - (b + c - 1) | = |b - (1 - c) |,but a < 1 - c ≤ b and therefore |TL (a, c) - TL (b, c) | ≤ |b - a| = |a - b|. Moreover,|SL (a, c) - SL (b, c) | = |a + c - 1 | = |(1 - c) - a| ≤ |b - a|.
the case a + c ≥ 1 and b + c < 1 is analogous to the previous one,
if a + c < 1 and b + c < 1 then |TL (a, c) - TL (b, c) | = |0 - 0|=0 ≤ |a - b| and |SL (a, c) - SL (b, c) | = | (a + c) - (b + c) | = |a - b|.
However, this does not hold in general. For instance, if we consider the drastic t-norm, for the case a = 1, b = 0.2, c = 0.9 we have that |TD (a, c) - TD (b, c) | = |TD (1, 0.9) - TD (0.2, 0.9) | = |0.9 - 0|=0.9 > 0.8 = |a - b|. Then, .
By the last axiom of the Definition 4, we could think that conditions and could be equivalent. However, this is not true in general, even in the case T and S are dual.
Example 3. We will show that the two conditions in Axiom (Div3) are not equivalent, even in (X, TM, SM).
Let be defined by:
Axiom (Diss2) from the definition of divergence is satisfied since the function h is symmetric. By definition h (x, x) =0 and so Axiom (Diss1) is also satisfied.
If we consider the following partition of the universal set X = X1 ∪ X2 ∪ X3 ∪ X4 ∪ X5 ∪ X6, where:
then
Applying the standard maximum t-conorm we have:
Since h is, by definition, a positive and decreasing function in both components, we have that:
We have shown that . However, the inequality for the intersection is not fulfilled in general. Thus, if we consider the universal set X = {x}, the dual minimum t-norm and the fuzzy sets defined as , we have that and .
In the previous example we have shown that does not imply . If we change the definition of D with h (x, y) = xy if x ≠ y, then we can show that even the converse implication does not hold, as it fulfills but not .
So, we can conclude that both conditions in Axiom 3 of Definition 4 are independent.
Proposition 1.Let D be a divergence measure for (X, T, S). Then for all .
Proof. For any and in , we have that , for any t-norm T.
Divergences appeared as an alternative to dissimilarities, but they are not related in general. We will show a divergence which is not a dissimilarity and conversely.
Example 4. Let us consider the divergence measure D defined on the one-point set X ={ x } as follows:
Now we verify that the D is a divergence measure. The first two conditions are trivial. We prove the third one.
if then and for any . Thus, .
if and , then by definition and since by definition, they are lower than or equal to .
if and , then .
For the intersection, we have that , and therefore . Thus, .
For the union, we have that , and so . Thus,
if , then .
if , then if we consider the union defined by means of the drastic t-conorm. Thus, .
In all the cases, .
the case and is analogicalto (iii).
if and then by definition. Moreover, and , since 1 is the neutral element of any t-norm and the annihilating element of any t-conorm. Thus, the divergence between the intersections is 0 in for any if we consider the drastic t-norm, since
if , then by definition.
if , then if we consider TD to define the intersection. Thus, .
In both cases, . Moreover, from the definition of the divergence D.
the case is analogical to (v).
Thus, the map D is a divergence. However, if we consider , we have that but . Thus, D does not fulfill Axiom (Diss.3) in Definition 3.
Thus not all divergences are dissimilarities. The converse does not hold either:
Example 5. Let be defined by:
Let us check that D is a dissimilarity. The first and second conditions are trivial. Consider now the fuzzy set for which . We are going to prove that . only can attain three values:
if , then .
if , then . Since we take and hence .
if , then by definition. Since we have and hence .
Thus, D is a dissimilarity, but it is not a divergence. To prove that it is enough to consider the fuzzy sets defined on X ={ x1, x2 } as follows: . Thus, in this case, we have that for any t-conorm. Therefore, (Div.3) from Definition 4 is not satisfied.
We see that divergences and dissimilarities are not related in general. However, there are some maps in both families, as we can see in the next example.
Example 6. For the map from Example 1, we proved that D is a divergence measure for any t-norm and t-conorm. Moreover, if , we have:
,
.
Thus, it is also a dissimilarity.
Although in general both concept are not related, they are some functions which are both divergences and dissimilarities. In fact, in the particular case we reduce our focus on the standard fuzzy set operations using only minimum t-norm and its dual t-conorm, we can consider the divergences as a subset of dissimilarities (see Proposition 2.4 in [8]). Their relationship is even stronger for a particular kind of families, which have the local property, as we will see in the next section.
Local divergences
Comparing two fuzzy sets it seems natural to suppose that if we only change their values at one point, the divergence should only depend on this change.
Definition 5. Let D be a divergence measure for a triple (X, T, S). Than D has the local property (is local), if for all and for all x ∈ X, there exists a map such that
A particular case of locality was introduced in [7, 8], but in that case the map hx was fixed and all the elements in the universe were of the same importance. However, based on some application of comparison of multivalued sets (see, e.g. [10, 12]), this is not always the case and different maps should be considered. Hence we introduce a more general definition.
In the case X is finite, a representation theorem for local divergences was obtained in [8]. However, this result holds only for the minimum t-norm and its dual t-norm, that is, in (X, TM, SM). We will present a general result for any t-norm and any t-conorm.
Theorem 1. (Representation Theorem 1)Let (X, T, S) be a triple with X a finite universe and T and S any t-norm and t-conorm, respectively. Let D be a divergence associated to X. D is local if and only if
where {hx} x∈X is a family of maps from [0, 1] × [0, 1] into such that, for any x ∈ X, a, b, c ∈ [0, 1] there is
hx (a, a) =0, for all a∈ [0, 1];
hx (a, b) = hx (b, a), for all a, b∈ [0, 1];
hx (a, b) ≥ max(hx (S (a, c), S (b, c)), hx (T (a, c), T (b, c))).
Proof. If D is local then, by definition, for any x ∈ X. We apply this equation for other elements in X. Therefore: . But, and analoguously, .
Thus, , since D is a divergence measure.
We have to check if for any x ∈ X, the function fulfills (i)-(iii). For any fixed x ∈ X, we can define the fuzzy sets if t = x and otherwise.
In this case we have .
By definition we have that , but and .
Since the D be a divergence measure we have that , but and = hx (S (a, c), S (b, c)) + ∑t∈X-{x}ht (S (0, 0), S (0, 0)) = hx (S (a, c), S (b, c)) by property (i). Analogously we could prove that and therefore the proof of this implication is concluded.
To show the converse implication let be a map, which can be expressed as a sum
where {hx} x∈X is a family of maps from [0, 1] × [0, 1] into such that, for any x ∈ X, hx fulfills (i)-(iii) from the previous definition. We will show that D is a local divergence measure. It is immediate that D is well-defined. Apart from that, D is a divergence, since for any in we have
and analogously,
Moreover, for any . So D is a local divergence.
We present an example of a local divergence.
Example 7. The divergence proposed in Example 2 is local. It is clear that for any x ∈ X, hx (a, b) = αx · |a - b|, ∀ a, b ∈ [0, 1] where αx ≥ 0 for any x ∈ X and ∑x∈Xαx = 1. Thus, it suffices to prove that it fulfills the conditions in Theorem 1. The first two are trivial. For the third one follows immediately from Example 2.
In the previous example, the map hx was based on a distance between real numbers. However, this is not true in general, as we proved in [8]. Only some specific distances can be used for generating divergence measures. In fact, hx is itself a divergence measure on the referential {x}.
Although the divergence given in Example 2 is local, this is not true in general. Clearly there are divergences which are not local. As an example we can consider the measure proposed in Example 1. Let us suppose D is local, then there is a family of maps {hx} x∈X with the appropriate properties, such that for any .
In particular, if we consider three fuzzy sets such that for x, y ∈ X,
with a1 ≠ b1 and a2 ≠ c2, we have and .
On the other hand, the fuzzy sets are also different, then . But if D is local, we also have that which contradicts the definition of D. So, D is a divergence measure but it has not the local property.
Thus, the family of local divergences is a proper subset of the family of divergences. This subfamily has some specific properties as we will see in the following results. The first one is an equivalent definition for locality based on the intersection instead of the union.
Theorem 2.Let (X, T, S) be a triple with X finite and T, S any t-norm and t-conorm, respectively. Let D be a divergence. D is local if and only if for all , there is such that
Proof. If D is local, then by Theorem 1, , since for any y ≠ x and . Analogously, for y ≠ x and .
Conversely, if D fulfills the condition for the intersection, we can apply recursively the equation for all the elements in X. Therefore:
, applying again that and for y ≠ x.
Moreover, it is easy to see that hx fulfills (i)–(iii) in Theorem 1 for any x ∈ X. Thus D is a local divergence.
Apart from the definition for local divergences, we can consider a different one. Its advantage is that it allows us to generalize this notion later.
Proposition 2.Let (X, T, S) be a triple with X finite and T, S any t-norm and t-conorm, respectively. Let D be a divergence. D has the local property if and only if
for any and any .
Proof. If D is local, then there is a family of function {hx} x∈X such that for any and ,
and
by the properties of the t-norms and t-conorms. Therefore, .
Conversely, if D fulfills for any and any , for any x ∈ X we can consider Z = {x} and so . Thus, we have to prove fulfills the conditions in Theorem 1 for any x ∈ X, where and for any y ≠ x.
The first two conditions are very easy: and . For the third one, let us consider any u, v, z ∈ [0, 1], then
So, D is a local divergence from Theorem 1.
The previous properties were proven in the general case of any t-norm and t-conorm. In the particular case we consider the standard union or intersection, SM and TM, an interesting property can be proven. It will allow us to relate divergences and dissimilarities. First we characterize the dissimilarities with the local property.
Theorem 3. (Representation Theorem 2)Let (X, T, S) be a triple with X finite and T, S any t-norm and t-conorm, respectively. Let D be a dissimilarity on X. D is local, that is, for all and for all x ∈ X, there exists a map such that
if and only if
where, for any x ∈ X, hx has the following properties:
hx (a, a) =0, for all a∈ [0, 1];
hx (a, b) = hx (b, a), for all a, b∈ [0, 1];
hx (a, c) ≥ max {hx (a, b), hx (b, c)}, for all a, b, c ∈ [0, 1] with a < b < c.
Proof. Let D be a dissimilarity with the local property. From locality we can prove that
and as the first axioms of dissimilarity are the same as the axioms for divergences, we proved in Theorem 1 that hx fulfills that hx (a, a) =0 and hx (a, b) = hx (b, a), for all a, b ∈ [0, 1].
Moreover, for any a, b, c ∈ [0, 1] such that a ≤ b ≤ c, if we apply Axiom (Diss.3) in Definition 3 to with , , for a fixed x ∈ X and if y ≠ x. Thus, we have that .
The converse implication is analogical to the proof for local divergences (Theorem 1).
In the following we will study the relationship between the locality of divergence and locality of dissimilarity for the minimum t-norm and its dual t-conorm.
Proposition 3.Let (X, TM, SM) be a triple with X finite. A map is a local divergence if only if D is a local dissimilarity.
Proof. It is sufficient to prove that (iii) in Theorem 1 is equivalent to (iii’) in Theorem 3.
If (iii) is fulfilled and a ≤ b ≤ c, then hx (b, c) = hx (SM (a, b), SM (c, b)) ≤ hx (a, c) and hx (a, b) = hx (TM (a, b), TM (c, b)) ≤ hx (a, c).
Conversely, we consider the case a ≤ b without loss of generality. Now we discuss three cases:
If c ≤ a ≤ b, then hx (SM (a, c), SM (b, c)) = hx (a, b) and hx (TM (a, c), TM (b, c)) = hx (c, c) =0 ≤ hx (a, b).
If a ≤ c ≤ b, then hx (SM (a, c), SM (b, c)) = hx (c, b) ≤ hx (a, b) and hx (TM (a, c), TM (b, c)) = hx (a, c) ≤ hx (a, b), by applying (iii’) in both cases.
If a ≤ b ≤ c, then hx (SM (a, c), SM (b, c)) = hx (c, c) =0 ≤ hx (a, b) and hx (TM (a, c), TM (b, c)) = hx (a, b).
Thus, although divergences and dissimilarities are not the same concept, we know that in the case of the minimum t-norm and its dual t-conorm, any divergence is a dissimilarity. Now we have proven that the family of the local divergences is the same as the family of the local dissimilarities.
As we could see in Proposition 1, any divergence measure has 0 as a lower bound. Moreover, if it is local and we consider TM and SM, it also fulfills Axiom (Diss.3) of Definition 3. Thus, for any . Moreover, for any , if D is local we also have that with
Thus, , for any , that is, D (∅, X) is a upper bound for D.
From now on, we can consider the normalized version of a divergence measure, that is, .
Generalizated local divergences
As we could see in Proposition 2, a divergence D is local if and only if , for any and any . In this case we are decomposing by means of the sum, but any other aggregation operator (see, for instance, [3]) could be considered. Thus, we could consider the particular family of divergence measures such that
for any and any and a fixed aggregation operator Ag.
Let us study the behaviour of this map Ag in detail:
If Z is a crisp set, then and and therefore Ag has to be commutative.
If we consider as the crisp set the referential X, then . Thus, zero has to be the neutral element for Ag.
We could see that for local divergences, if and for a crisp set Z them . If we require this property in a general context, then Ag is increasing in both components.
Moreover, we can apply Ag recursively and obtain but also
Thus, Ag has to be associative. As Ag is associative, commutative, monotone with neutral element 0, then it is a t-conorm.
Definition 6. Let be a divergence and let S be a t-conorm. Then D has the S-local property (is S-local), if for all and for all x ∈ X, it is fulfilled that
Again we can characterize these divergences.
Theorem 4. (Representation Theorem 3)Let (X, T, S) be a triple with X finite and T, S any t-norm and t-conorm, respectively. Let S′ be another t-conorm. A map D is S′-local divergence measure if and only if
where {hx} x∈X is a family of maps such that, for any x ∈ X, hx : [0, 1] × [0, 1] → [0, 1] and
hx (a, a) =0, for all a∈ [0, 1];
hx (a, b) = hx (b, a), for all a, b∈ [0, 1];
hx (a, b) ≥ max(hx (S (a, c), S (b, c)), hx (T (a, c), T (b, c))), for all a, b, c ∈ [0, 1].
Proof. If D is S′-local then
Thus, it is enough to prove that for any x ∈ X, the map hx defined by is well-defined and it fulfills Conditions (i)–(iii) for any x ∈ X, where and for any y ≠ x.
The proof is analogical to the one in Proposition 2.
It is easy to prove that the map D defined in the statement is a divergence measure. For the S′-locality, and
Analogously, . Thus,
for any , that is, D is a S′-local divergence.
From the previous theorem it is immediate that any divergence measure assuming values on the interval [0, 1] is local iff it is SL-local.
As this holds for local divergences, an equivalent definition will be given by means of the intersection instead of the union in the following proposition.
Proposition 4.A divergence measure D has theS-local property iff for all and for all the following equality is fulfilled:
Proof. If D is S-local, by the properties of thet-conorms and the t-norms, we have that
Conversely, if D is a divergence, it is easy to prove that D can be decomposed as in Theorem 4 for the map hx defined by , where and for any y ≠ x. Thus, D is S-local.
Another equivalent definition is shown below.
Proposition 5.Let D be a divergence measure and let S be a t-conorm. D is S-local if and only if for any and for any crisp partitions of the reference space X, the following equationholds
Proof. If D is S-local, we have that
But if we apply again the result to the second component, we have that
Since and and accounting the associativity of the t-conorm, we obtain
If we apply the previous procedure recursively, we obtain that
The converse implication is trivial, since we only have to consider for any the partition {Z, Zc} of X and Proposition 4.
In the following propositions we establish some important properties of the S-local divergences, which express natural characteristics of ourmeasure.
Proposition 6.Let D be a S-local divergence.Then
Proof. Let Z be any crisp subsets of X. Then:
where we have applied the symmetry of hx for any x ∈ X and the associativity of the t-conorm.
Although trivial, the following proposition allows us to change the scale factor of a divergence.
Proposition 7.Let D be a S-local divergence generated by the family of functions {hx} x∈X, and let φ : [0, 1] → [0, 1] be a increasing function with φ (0) =0. The maps Dφ and Dφ defined below byare also divergence measures and Dφ is S-local.
The proof is straightforward.
Proposition 8.Let (X, TM, SM) be a triple with X a finite universe, let S be any t-conorm and let D be a S-local divergence measure. For any such that for any x ∈ X either or , we have that
The proof that S-local divergence D is a dissimilarity is based on Proposition 2.4 in [8] by aggregating with S.
The divergence between a set and its complement increases and it attains the maximum when is crisp. Moreover, this is true when we compare any pair of fuzzy sets, as it follows from Proposition 6 and the next result.
Proposition 9.Let (X, TM, SM) be a triple with X finite and let D be a S-local divergence. For any we have
Proof. For any , if we consider then if x ∈ Z and if x ∈ Zc. Since D is also a dissimilarity, then .
Applications
We present two examples of application of generalized local divergence measures for pattern recognition and decision making. In both cases we will remark the different situations depending on the selected t-conorm.
Application to pattern recognition
Let X be finite and the patterns and a sample be represented by fuzzy sets.
We can measure the difference between and for i∈ { 1, . . ., m }. We obtain the finite set of divergences: . Finally, the sample will be associated to the pattern whenever
So, the sample is classified into the pattern from which it differs least.
Let us see how to proceed by an example, based on the one proposed in [10].
Example 8. Let us consider five kinds of mineral fields, each of them featured by the content of six minerals and containing one kind of typical hybrid mineral. The five kinds of typical hybrid mineral are represented by fuzzy sets , , , and in X = {x1, …, x6}, respectively. Assume that there is another kind of mineral B, and we want to classify it into one of the aforementioned mineral fields. The minerals are described by means of the fuzzy sets in Table 1.
The kinds of hybrid minerals represented by fuzzy sets
X
x1
x2
x3
x4
x5
x6
0.74
0.03
0.19
0.49
0.02
0.74
0.12
0.03
0.05
0.14
0.02
0.39
0.45
0.66
1.00
1.00
1.00
1.00
0.28
0.52
0.47
0.30
0.19
0.74
0.33
1.00
0.18
0.16
0.05
0.68
0.63
0.52
0.21
0.22
0.07
0.66
Let us also assume that experts established the weight vector α = {0.2, 0.3, 0.125, 0.125, 0.125, 0.125} on X. Let us use our method to classify B. If we consider the divergence proposed in Examples 2 and 7:
with αx ≥ 0 for any x ∈ X and ∑x∈Xαx = 1. Then and therefore, it is clear that D is a S-local divergence measure for all basic t-conorms. Thus, the values assumed by the four different divergences are obtained in Table 2.
The S-local divergences obtained for S∈ { SM, SP, SL, SD }
SM
SP
SL
SD
0.1470
0.2090
0.2215
1
0.1470
0.2864
0.3190
1
0.1163
0.3644
0.4330
1
0.0700
0.1314
0.1375
1
0.1440
0.2084
0.2203
1
We can see that in all the cases we should classify B into the hybrid mineral A4 (in the case of the drastic t-conorm it is just an option). However, the behaviour of any divergence is different. Thus, for instance, for the maximum t-conorm, the divergence is not able to distinguish between and , since in this case only the maximum distance is considered. In the other two cases, SP and SL, all the points in X are essential to obtain the value of the divergence and therefore the difference between and is remarked. In the case of SD, the information given by the divergence is null.
Apart from the t-conorm used to defined the S-local divergence, the weights also can play an interesting role. Thus, at the previous example, we will consider the weight vector α = {k, 0.5 - k, 0.125, 0.125, 0.125, 0.125} for k ∈ {0, 0.1, …, 0.5}.
In this case, if we consider the previous D for SL, we obtain the divergences shown in Table 3.
Variations of the SL-local divergence based on the weights
α = 0
α = 0.1
α = 0.2
α = 0.3
α = 0.4
α = 0.5
0.2975
0.2595
0.2215
0.1835
0.1455
0.1075
0.315
0.317
0.319
0.321
0.323
0.325
0.425
0.429
0.433
0.437
0.441
0.445
0.0675
0.1025
0.1375
0.1725
0.2075
0.2425
0.2563
0.2383
0.2203
0.2023
0.1843
0.1663
In this case we see that for α = 0.5 and therefore, in this case, we should classify B into the hybrid mineral A1. In fact, should be classified into
for α = 0.4, 0.5,
for α = 0, 0.1, 0.2, 0.3.
The obtained results are expected, since for α = 0.5, the most important point at the referential is x1. If we look at this element, and are the most similar. In the case α = 0, the most important point changes to be x2 and this is the reason of the selection of .
As we can see in the previous examples, if we consider different maps hx for any x on defining the divergence measure we can increase the information we can obtain from the data. The same happens if we consider the concept of generalized locality, since we can aggregate the divergences among the points by different t-conorm apart from SL (just the sum) and we can consider the most appropriate t-conorm to any problem. Thus, the information obtained by means of the class of divergence measures considered in this paper is much richer than the one considered in [8].
Application to decision making
Let denote a set of alternatives, let X = {x1, …, xn} be a set of attributes, and α = (α1, …, αn) be its weight vector (αi ≥ 0,∑iαi = 1).
Each alternative will be expressed by a fuzzy set with the elements xj, where represents the degree in which alternative agrees with attribute xj.
The new alternatives and will be and canbe interpreted as the “optimal” and “least optimal”, respectively. In this sense, the preferred alternative would be more similar to and more different from , simultaneously.
Finally, we consider the quotient ki defined as:
It means that if some alternative has a quotient kj for which kj < ki then the alternative is better as in the sense previous described. Thus, the optimal is the alternative whose ki is the minimum.
The previous procedure will now be explained by means of an example based on the one from [12].
Example 9. The government has to decide among five different energy strategies: . Each of them is assessing four attributes: economic (xEC), technological (xT), environmental (xEN) and socio-political (xP). The weight vector of these attributes is (αEC, αT, αEN, αP) = (0.4, 0.2, 0.3, 0.1).
Assume that any alternatives is defined by the fuzzy sets given in Table 4. If we consider SM to define the union and TM to define the intersection, we have the corresponding fuzzy sets and shown also in Table 4.
Definition of 5 energy strategies, the optimal and less optimal alternatives
X
xEC
xT
xEN
xP
0.2
0.7
0.6
0.5
0.4
0.5
0.8
0.6
0.5
0.6
0.9
0.7
0.3
0.8
0.7
0.5
0.8
0.7
0.1
0.3
0.8
0.8
0.9
0.7
0.2
0.5
0.1
0.3
If we consider again the SM-local divergence measure proposed in the previous example, that is, then the values of the quotients given in Table 5.
Comparison of five alternatives optimality
ki
i = 1
0.24
0.15
0.62
i = 2
0.16
0.21
0.43
i = 3
0.12
0.24
0.33
i = 4
0.2
0.18
0.53
i = 5
0.24
0.24
0.50
Since k3 < k2 < k5 < k4 < k1, we conclude that the most optimal alternative is A3.
Conclusion
We have introduced a new class of divergence measures between fuzzy sets, the S-local divergences, which is constructed by using a triangular conorm S instead of the sum. In these studies we have used the previous results about divergence measures with the local property, in fact we have shown that any local divergence is a S-local divergence. Since this new class of t-conorm dependent divergences satisfies all the properties already proposed for the local ones, we think that this generalization will allow us to manage a larger class of divergence measures and use them for define new classes of fuzziness measures.
One of the advantages of the proposed attitude is, that by considering different maps hx for the points (or ranges of points) in X we can stress the specific aspects of a particular model and thus obtain more accurate information when specifying the measure of difference.
The most immediate open problem is to generalize the S-local divergence measure to the case when the universe is infinite. Another important point is the choice of hx in some typical cases.
Footnotes
Acknowledgments
The research in this communication has been supported in part by MINECO-TIN2014-59543-P its financial support is gratefully acknowledged.
References
1.
AnthonyM. and HammerP.L., A Boolean measure of similarity, Discrete Applied Mathematics154/16 (2006), 2242–2246.
2.
Bouchon-MeunierB., RifqiM. and BothorelS., Towards general measures of comparison of objects, Fuzzy Sets and Systems84 (1996), 143–153.
3.
CalvoT., MayorG., MesiarR., Aggregation operators: New trends and Applications, in: Physica-Verlag, 2002.
4.
CousoI., GarridoL. and SánchezL., Similarity and dissimilarity measures between fuzzy sets: A formal relational study, Information Sciences229 (2013), 122–141.
5.
LiY., QinK. and HeX., Some new approaches to constructing similarity measures, Fuzzy Sets and Systems234/1 (2014), 46–60.
6.
LuiX., Entropy, distance measure and similarity measure of fuzzy sets and their relations, Fuzzy Sets and Systems52 (1992), 305–318.
7.
MontesS., Partitions and divergence measures in fuzzy models, Ph. D. Dissertation, University of Oviedo, Spain, 1998.
8.
MontesS., CousoI., GilP. and BertoluzzaC., Divergence measure between fuzzy sets, International Journal of Approximate Reasoning30 (2002), 91–105.
9.
ValverdeL. and OvchinnikovS., Representations of Tsimilarity relations, Fuzzy Sets and Systems159/17 (2008), 2211–2220.
10.
WangW. and XinX., Distance measure between intuitionistic fuzzy sets, Pattern Recognition Letters26 (2005), 2063–2069.
11.
WilbikA. and KellerJ.M., A fuzzy measure similarity between sets of linguistic summaries, IEEE Transactions on Fuzzy Systems21/1 (2013), 183–189.
12.
XuZ. and XiaM., Distance and similarity measures for hesitant fuzzy sets, Information Sciences181 (2011), 2128–2138.
13.
ZadehL.A., Fuzzy sets, Information and Control8 (1965), 338–353.
14.
ZadehL.A., A note on similarity-based definitions of possibility and probability, Information Sciences267 (2014), 334–336.
15.
ZhangC. and FuH., Similarity measures on three kinds of fuzzy sets, Pattern Recognition Letters27/12 (2006), 1307–1317.