Abstract
In decision making analysis, ranking decision with anti-noise capability is a very desirable issue. This paper focuses on analysing data in interval-valued information systems. A new approach ranking with weighted standardized cardinality (WSC) and inclusion indicator in interval-valued information systems is proposed. It is a novel generalization of ranking method based on dominance classes. The new approach overcomes the drawback of sensitivity to noise for raking using dominance classes-based method. In this paper, each object with all attribute values is regarded as an interval-valued fuzzy set (IVF-set). By defining WSC with controlling parameters and single interval inclusion indicator, two kinds of inclusion indicators of the IVF-set are constructed. The approach is based on the novel idea of raking with WSC and inclusion indicator rather than dominance classes. Experiments verify that the new approach is noise-resistant.
Keywords
Introduction
In many data analysis applications, information and knowledge are stored and represented in a table called an information system, which provides a convenient way to describe a finite set of objects within a universe by a finite set of attributes [19]. In decision making analysis, we often need to deal with data with various types value, such as single value, symbol value, set value, interval value, et al. [2, 33]. To discover knowledge in information systems, the theory of rough sets proposed by Pawlak [19] has been studied deeply and applied in decision analysis, patten recognition, machine learning et al. [11, 34]. In real world, it is important to order the objects according to their attributes in an information system. This kind of decision making is called ranking decision, which has been widely used in economy, management and engineering areas. To solve the ranking problem, Greco et al. [14, 15] present a dominance-based rough set approach (DRSA) which is an extension form of Pawlak’s rough set based on dominance relations. This approach is studied and developed in ranking problem and knowledge reduction in ordered information systems and interval-valued information systems [21, 40]. Zhang and Xu [37] proposed a new method for ranking intuitionistic fuzzy values by using the similarity measure and the accuracy degree, and then they applied the proposed ranking method to multi-attribute decision making.
In fuzzy set theory, which is proposed by Zadeh in 1965 [35], the term cardinality was most commonly used in many areas. As an important characteristic of a set, the cardinality of a crisp set is the number of elements in the set and the cardinality of a fuzzy set is often focused on ordinary cardinal, real numbers or some generalization of cardinal numbers. In scalar approaches, the cardinalities of fuzzy sets were defined with a mapping to each fuzzy set, assigning a single ordinary cardinal number or a non-negative real number. The scalar cardinality was proposed by De Luca and Termini who named it as the power of a finite fuzzy set [7]. The power of a finite fuzzy set A is defined as the sum of the membership degrees of the fuzzy set A. The problem of counting fuzzy sets has generated a lot of literature since Zadeh’s initial conception. As a widely used concept in fuzzy areas, cardinality plays an important role in fuzzy databases and information systems. The relationship between fuzzy mappings and scalar cardinalities was studied in [10, 24]. In the researches on cardinalities, Wygralak developed an axiomatic approach to scalar cardinalities which covers all previous approaches [29]. Casasnovas and Torrens proposed the axiomatic theory of fuzzy cardinalities and investigated the scalar cardinality of finite fuzzy sets based on t-norms and t-conorms [5, 6]. Szmidt and Kacprzyk introduced the scalar cardinality of intuitionistic fuzzy sets [27]. Considering the power character of cardinality, we propose a weighted standardized cardinality approach for interval fuzzy sets and apply it to rank the objects in interval-valued fuzzy systems.
Inclusion measure is an important topic in fuzzy set theory. Different from binary discrimination, the researchers proposed some indicators giving the degree to which a fuzzy set is contained another one. The inclusion indicator has been developed by constructive and axiomatic approaches and has been applied to various fields such as approximate reasoning, data redundancy in fuzzy relational database, rough set theory, uncertainty analysis, and fuzzy formal context analysis [3, 39]. Monotonicity is an important property of an inclusion indicator and Zhang studied the hybrid monotonic inclusion measure between two fuzzy sets [39]. In some sense, the monotonicity of an inclusion indicator can be used to measure the dominance degree between two fuzzy sets.
In decision making analysis, Zhang and Qiu proposed to order the objects based on a dominance relation [40] and Qian et al. generalized the approach for ranking all objects based on dominance classes and the entire dominance degree in interval ordered information systems [21]. However, the ranking approach based on dominance classes can be influenced by noise easily. To overcome the drawback, we propose the ranking approaches by two ways, one is an absolute ranking approach based on weighted standardized cardinality the other is a relative ranking method tie to inclusion indicator.
The remainder of this paper is organized as follows. In Section 2, we review some basic notions related to fuzzy sets, Atanassov’s fuzzy sets and interval-valued fuzzy sets, and then introduce the ranking method based on dominance classes and the entire dominance degree. In Section 3, we propose the weighted standardized cardinality of IVF-set, and give the ranking results of the objects according to the weighted standardized cardinality in an interval-valued information system. In Section 4, we construct inclusion indicators of IVF-sets by weighted standardized cardinality and single interval inclusion indicator respectively. The ranking result is obtained by the inclusion indicators. In Section 5, an illustrative example is presented. We then conclude this paper with a summary and directions for future research in Section 6.
Preliminaries
In this section we review some basic notions related to fuzzy sets (F-sets), Atanassov’s intuitionistic fuzzy sets (AIF-sets), interval valued fuzzy sets (IVF-sets) and ordered information systems.
F-sets, AIF-sets and IVF-sets
Let U be a finite non-empty set, an F-set A on U is defined as a set of ordered pairs
A = {〈x, μ A (x) 〉|x ∈ U} , where μ A : U ⟶ [0, 1] is the membership function of A and μ A (x) denotes the grade of x belonging into A. A family of all F-sets in U will be denoted by .
According to Zadeh’s seminal paper [35] introducing fuzzy sets, the intersection and union of two F-sets are defined as,
A ∩ B = {〈x, min (μ A (x) , μ B (x)) 〉|x ∈ U} ,
A ∪ B = {〈x, max (μ A (x) , μ B (x)) 〉|x ∈ U} .
The complement of F-set A is defined as,
thicksimA = {〈x, 1 - μ A (x) 〉|x ∈ U} .
In intuitionistic fuzzy sets theory [1], an AIF-set A on U is defined as
A = {〈x, μ A (x) , ν A (x) 〉|x ∈ U} , where μ A (x) and ν A (x) are degrees of membership and nonmembership of x in A. Moreover they satisfy μ A (x) , ν A (x) ∈ [0, 1] and 0 ≤ μ A (x) + ν A (x) ≤1. π A (x) =1 - μ A (x) - ν A (x) is called the degree of indeterminancy of x in A. A family of all AIF-sets in U will be denoted by .
Gau and Buehrer [12] defined vague sets. Bustince and Burillo [4] showed that the notion of vague set is the same as that of AIF-set.
The intersection and union of two AIF-sets are defined as,
The complement of AIF-set A is defined as,
thicksimA = {〈x, ν A (x) , μ A (x) 〉|x ∈ U} .
Another extension of fuzzy sets is interval-valued fuzzy sets theory introduced by Gorzalczany [13]. An IVF-set A on U is defined as
A = {〈x, [l A (x) , r A (x)] 〉|x ∈ U} , where l A (x) and r A (x) are the left (lower) and right (upper) ends of membership interval, and satisfy 0 ≤ l A (x) ≤ r A (x) ≤1. The greatest IVF-set is IVF universe set 1 U = {〈x, [1, 1] 〉|x ∈ U} and the least IVF-set is IVF empty set 0 U = {〈x, [0, 0] 〉|x ∈ U}. A family of all IVF-sets in U will be denoted by .
Note that IVF-sets are called grey sets by Deng [8].
The intersection and union of two IVF-sets are defined as,
The complement of IVF-set A is defined as,
thicksimA = {〈x, [1 - r A (x) , 1 - l A (x)] 〉|x ∈ U} .
Obviously, an ordinary F-set A can be written as an AIF-set A = {〈x, μ A (x) , 1 - μ A (x) 〉|x ∈ U} , or an IVF-set A = {〈x, [μ A (x) , μ A (x)] 〉|x ∈ U} . An AIF-set A = {〈x, μ A (x) , ν A (x) 〉|x ∈ U} can be represented by means of an IVF-set A = {〈x, [μ A (x) , 1 - ν A (x)] 〉|x ∈ U} and vice versa. In Glad Deschrijver’s opinion [9], there exists an isomorphism between and , so IVF-sets theory is equivalent to AIF-sets theory, and the generalization of F-sets theory. All results presented in the follows can be straightforwardly transformed to AIF-set and F-set.
Ranking all objects by using the dominance relation in an interval-valued information system
In many data analysis applications, information and knowledge are stored and represented in a table and it provides a convenient way to describe a finite set of objects within a universe by a finite set of attributes [19].
An interval-valued information system (IVIS) is an order tuple S = (U, AT, V, f), where U is a finite non-empty set of objects, AT is a finite non-empty set of attributes, V = ⋃ a∈ATV a and V a is a range of a ∈ AT, f : U × AT → V is called an information function [22] satisfying f (x, a) ∈ V a and denoted by f (x, a) = [a l (x) , a r (x)] ⊆ [0, 1] , where a l (x) and a r (x) are the least and greatest grade of object x owning attribute a, respectively. Obviously, an ordinary fuzzy information system satisfies f (x, a) = a l (x) = a r (x) for all x ∈ U and a ∈ AT, we regard it as a special form of an IVIS.
In [21], Qian, Liang and Dang defined a concept of interval ordered information system, in which all attributes are criteria, that is, the range of each attribute is ordered according to their preference. In an IVIS S = (U, AT, V, f), for all objects x i , x j ∈ U and attribute a k ∈ AT, x j ≽ a k x i means that x j is at least as good as x i with respect to a k . For any attribute subset A ⊆ AT, we can divide A into two subsets A inc and A dec , where A inc is according to increasing preference and A dec is according to decreasing preference, A = A inc ∪ A dec . For simplicity, we denote by [l ik , r ik ]. For any object x j ∈ U and attribute a k ∈ A, if a k ∈ A inc then x j ≽ a k x i ⇔ l ik ≤ l jk and r ik ≤ r jk , else if a k ∈ A dec then x j ≽ a k x i ⇔ l jk ≤ l ik and r jk ≤ r ik . Furthemore, we define x j ≽ A x i ⇔ x j ≽ a k x i , for any a k ∈ A (A ⊆ AT). The dominance relation is defined by According to the dominance relation , the dominance class can be induced as follows
One important problem in intelligent decision-making is to rank the objects. In [40], Zhang and Qiu defined a concept of dominance degree for ranking all objects in classical ordered information systems. In [21], Qian, Liang and Dang developed the method in interval orderd information systems. In their studies, the following measure
Firstly, we obtain the dominance classes of all objects as follows
Then, we get the dominance degree of all objects by Equation (1), and list them in a matrix as
According to the values above, we can get the ranking result as
In Qian’s methods, one can rank all objects in an ordering information system from the perspective of dominance classification. However, this method is so strict that it will be influenced by noise easily. From Example 2.1, we obtained the ranking result by the entire dominance degree Equation (2). Considering the object x5, its interval-values are greater than all the others’ in U with respect to all attributes in AT except a4, but the ranking result shows that x5 is not the best one. Obviously, ranking by dominance relation approach in a multi-attribute information system, some few attributes values (which may be seen as noise) can influence the whole ranking results, and the true best object can not be picked out. To avoid such problem and get an objective ranking result, we will make ranking by cardinality function and inclusion measure from a global point.
In mathematics, the cardinality of a set is a measure of the “number of elements of the set”. There are two approaches to cardinality—-one compares sets directly using bijections and injections, and the other is to use cardinal numbers. The cardinality of a fuzzy set is also called its power, when no confusion with other notions of power is possible. As a notion of power, the cardinality can be used to measure a fuzzy set, and compare the superiority and inferiority of fuzzy sets.
Wygralak [29] developed an axiomatic theory of scalar cardinality of fuzzy sets which covers all previous approaches. In the sense of Wygralak’s theories, we would like to introduce a notion of the cardinality of IVF-sets and discuss its applications for ranking in an IVIS.
Let [a, b]/u denote an IVF-set such that ([a, b]/u) (u) = [a, b] and ([a, b]/u) (v) = [0, 0] for each v ≠ u, where [a, b] ⊆ [0, 1] and u, v ∈ U. So each IVF-set A is a union of singletons as A = ⋃ u∈UA (u)/u.
In an IVIS S = (U, AT, V, f), for any x i ∈ U, all attributes values of x i with respect to (w.r.t.) AT form an IVF-set on AT and are called object-attribute IVF-set, denoted by , where for any a k ∈ AT. As the same, for any a k ∈ AT, the attribute values of all objects w.r.t. a k also form an IVF-set on U and are called attribute-object IVF-set, denoted by , where for any x i ∈ U. Obviously, for all x i ∈ U, a k ∈ AT, . In this paper, we only need to consider the object-attribute IVF-sets for ranking objects. In many real life cases, the significance of attributes in an IVIS are often not equal to each other. Considering the weighting of each attribute, we will introduce the weighted standardized cardinality of IVF-sets by average means.
In the following theorem we propose a kind of scalar cardinalities in the sense of Definition 3.1.
where f k : [0, 1] × [0, 1] ⟶ [0, 1] is a binary function for which (C1) f k (0, 0) =0, f k (1, 1) =1 ; (C2) a ≤ b ⇒ f k (a, d) ≤ f k (b, d) and f k (c, a) ≤ f k (c, b), for all [c, a] , [c, b] , [a, d] , [b, d] ⊆ [0, 1] , k = 1, 2, . . . , m.
(1) (P1) is fulfilled following postulate (1) immediately.
(2) In the IVIS (U, AT, V, f), for x i ∈ U, a k , a t ∈ AT, if , then SC for a ≤ b. Similarly, if , then for a ≤ b, i.e. (P2) is satisfied.
(3) Since f k (0, 0) =0, then for any x i , x j ∈ U, and . So ⇒∀a k ∈ AT ([l ik , r ik ] = [0, 0] or [l jk , r jk ] = [0, 0]) . Thus, (P3) is satisfied.
Summarizing all above, we conclude that the function SC is a scalar cardinality of IVF-sets.□
where and f k : [0, 1] × [0, 1] ⟶ [0, 1] is a binary function satisfies the conditions (C1) and (C2) in Theorem 3.1.
WSC in Definition 3.2 is a weighted average and standardized form which developed from Theorem 3.1. It not only reflects the cardinality of an IVF-set with respect to object x i but also considers the weighting of each attribute in AT. Then, we will construct the function f in WSC which will be used in ranking all objects in an IVIS.
(C1) In an IVIS S = (U, AT, V, f), it is easy to verify that and , for any i ∈ {1, 2, . . . , n} , k ∈ {1, 2, . . . , m} and p ∈ (0, ∞).
(C2) For any α ∈ [0, 1] , p ∈ (0, ∞), we can deduce the partial derivative about first parameter of in Equation (3) as and 2p-1p (1 - α) (1 - (1 - α) x - αy) p-1≥ 0 (0.5 <(1 - α) x + αy < 1). Synchronously, and . Thus is a left monotonicity increasing function. Since and . Therefore, is also a right monotonicity increasing function. Thus, (C2) is fulfilled.
Hence, WSC() in Equation (4) is a weighted standardized cardinality. □
In Equation (3), p and α are controlling parameters. Figure. 1 shows the surfaces of binary function with 0 ≤ x ≤ y ≤ 1 when p and α are assigned different values.
Therefore, according to the weighted standardized cardinality, we can get the ranking result as
From Example 3.1, a complete rank of objects can be obtained by using the weighted standardized cardinality. In the ranking result, objects x4 and x6 which are put into the same place according to the entire dominance degree in Example 2.1 have been further ranked, and x5 has become the best one to correspond with human perceptions. Simultaneously, the property of rank preservation is still remained, that is to say, the ordering of other objects (i.e. x1, x2 and x3) is unaltered. Obviously, as a ranking approach, the weighted standardized cardinality in an IVIS is effective. This approach only depends on the interval-values but not dominance relation which is disturbed by noise easily. Analogously, another ranking approach depending on inclusion indicator will be proposed in next section.
In classical set theory, a binary discrimination between two sets is one being or not being a subset of another. To relax the rigidity, inclusion indicator was proposed and many researches about inclusion indicators were discussed in the theories of F-sets, IVF-sets and AIF-sets.
Inclusion indicators of IVF-sets
Now let us consider the problem of measuring the inclusion degree of IVF-sets. In [3], Bunstince introduced an axiomatic definition for the inclusion indicator of IVF-sets and applied it to approximate reasoning. In [36], Zeng and Guo extended the inclusion measure and introduced an axiomatic definition of inclusion indicator of IVF-sets which is different from Bustince’s.
In an IVIS S = (U, AT, V, f), an inclusion indictor between two object-attribute IVF-sets can reflect one object being or not being dominance than the other. By this opinion, we will make ranking in an IVIS by inclusion indictors of IVF-sets.
(I1) Since S is regular, for all i, j ∈ {1, 2, . . . , n}, is not a universal IVF-set and is not an empty IVF-set. By logical reasoning theory, Equation (5) satisfies condition (I1) in Definition 4.1.
(I2) It is obvious that , when . Conversely, if , then .
By Equation (4), we have
,
. Since l ik ∧ l jk ≤ l ik , r ik ∧ r jk ≤ r ik , and from the proof of Theorem 3.2 we know that is left and right monotonicity increasing, so . S is regular, i.e. 0 < α k < 1, so is left and right strict monotonicity increasing. Assume that , then there exists k0 ∈ {1, 2, . . . , m} which satisfies l jk 0 < l ik 0 or r jk 0 < r ik 0 , and then . It follows that , which contradict with . Thus, .
(I3) In the IVIS, for all i, j, s ∈ {1, 2, . . . , n}, if , then , , so . By the monotonicity increasing property of , we have that , Therefore, . implies that , which means that . It follows that .
Summarizing (I1),(I2) and (I3) above, we conclude that Inc denoted by Equation (5) is an inclusion indicator of object-attribute IVF-sets in S. □
Therefore, according to the entire dominance degree, we can get the ranking result as
In Definition 4.1, if IVF-sets A and B are singletons, i.e. A = A (u)/u and B = B (u)/u, where A (u) , B (u) ⊆ [0, 1] and A (v) = B (v) = [0, 0] for each v ≠ u, then Inc (A, B) will be simplified to the single interval inclusion indicator between A (u) and B (u). We will present the equivalent definition as follows.
,
, where is a family of all intervals in [0, 1].
(I1) In a regular IVIS S, the proof of postulate (I1) is similar to the proof of (I1) in Theorem 4.1.
(I2) For any x i , x j ∈ U, if , then l ik ≤ l jk and r ik ≤ r jk for all a k ∈ AT, so , thus . Conversely, if , then for all a k ∈ AT follows . By postulate (D2), we have l ik ≤ l jk and r ik ≤ r jk for all a k ∈ AT, i.e. .
(I3)In the IVIS, for all i, j, s ∈ {1, 2, . . . , n}, if , then l ik ≤ l jk ≤ l sk and r ik ≤ r jk ≤ r sk for all a k ∈ AT. By postulate (D2), we have and , for all a k ∈ AT. So and , i.e. and .
Thus, we complete the proof of Theorem 4.2. □
In Theorem 4.2, the inclusion indicator constructed by single interval inclusion can also be used to make ranking.
Therefore, according to the entire dominance degree according to a single interval inclusion indicator, we can get the ranking result as
Example 4.1 and Example 4.2 show that the entire dominance degree based on inclusion indicator can not only get a complete rank of objects but also overcome the interference of noise.
Experiments
In Example 3.1, Example 4.1 and Example 4.2, we only give one case, i.e. the controlling parameter p = 1, every optimism degree α i gets the median of [0,1] and the weighting value w i are equal (i = 1, 2, 3, 4, 5). To demonstrate the effectiveness of the proposed ranking approaches, we perform experiments on the hybrid weighted IVIS which is shown in Table 1.
For each ranking approach, we divide the controlling parameter p into three instances: (1) 0 < p < 1 (let p = 0.5); (2) p = 1; (3) 1 < p (let p = 2).
The weighting vector
We calculate the weighted standardized cardinality WSC and entire dominance degree and of each object, which are list in Tables 2–4, where denotes the entire dominance degree with respect to inclusion indicator and denotes the entire dominance degree with respect to single interval inclusion indicator. Table 5 shows the ranking results in all cases.
This case study examines how the scalar cardinality and inclusion indicator are used in measuring the dominance of objects in an IVIS. The results show that the two ranking approaches can both obtain complete ranking results of the objects and suppress the interference of noise. In the case study, a function family defined by Equation (3) is introduced into a hybrid weighted IVIS for computing the scalar cardinality, the inclusion indicator and the single interval inclusion indicator. The ranking results may be different if the weight factors w, α and controlling parameter p are changed.
The dominance classes of all objects are
According to Equations.(1) and (2), the entire dominance degree of each object can be computed that D
A
(x1) = D
A
(x2) = D
A
(x3) = D
A
(x4) = D
A
(x5) = D
A
(x6) = D
A
(x7) = D
A
(x8) = D
A
(x9) = D
A
(x10) =0.9. In Table 6, there are some data especially smaller or larger than the other data according to the same second grade index. For instance, l2,2 = 0.01, l1,7 = 0.07, r2,9 = 0.85, l4,8 = 0.05, r5,4 = 0.98, l7,9 = 0.07 etc. The abnormality of these data may be due to recording error or some technological reasons and will influent the information system as noise. Obviously, under the influence of noise, all the water-samples can
Therefore, we can get that the ranking result according to the weighted standardized cardinality WSC and entire dominance degree are the same that x3 ≽ x9 ≽ x4 ≽ x7 ≽ x1 ≽ x5 ≽ x10 ≽ x6 ≽ x2 ≽ x8, and the ranking result according to the entire dominance degree is x3 ≽ x4 ≽ x9 ≽ x7 ≽ x1 ≽ x5 ≽ x10 ≽ x2 ≽ x6 ≽ x8. Although local differences occur in these two ranking results, their overall trends are consistent.
Conclusions
Weighted standardized cardinality and inclusion indicator of IVF-sets are two kinds of measuring functions, the former can reflect the power of an IVF-set and the latter can measure the degree of an IVF-set being a subset of another. In our study, they are used to compare the dominance of objects from absolute and relative points of view, respectively. The novelties of the approach are mainly represented from two aspects: completely ranking and overcoming the interference of noise with the objectivity of ranking result. The hybrid weighted IVIS is a generalization of classical information system, so the ranking methods based on weighted standardized cardinality and inclusion indicator are also used in classical information system. The controlling parameter p in the function family provide us with multiple choices for weighted standardized cardinality and inclusion indicator in practical application.
Acknowledgments
This work is supported by the National Natural Science Foundation of China (Nos. 61170107, 61300121), the Natural Science Foundation of Hebei Province of PR China (Nos. A2014205157,A2013208175), the Science Foundation of Hebei Education Department of PR China (No. Q2012093), and Training Program for Leading Talents of Innovation Teams in the Universities of Hebei Province (LJRC022).
