Abstract
Data Envelopment Analysis (DEA) is a widely applied approach for measuring the relative efficiencies of a set of Decision Making Units (DMUs), which use multiple inputs to produce multiple outputs. In real world problems the data available may be imprecise. With fuzzy inputs and fuzzy outputs, the optimality conditions for the crisp DEA Models need to be clarified and generalized. The corresponding fuzzy linear programming problem is usually solved using some ranking methods for fuzzy sets. The methods of solving fuzzy DEA problems can be categorized into four distinct approaches: tolerance approach, defuzzification approach, α-level based approach, and fuzzy ranking approach In this paper, we introduce a new α-level based approach and a numerical method for ranking DMUs with fuzzy data.
Keywords
Introduction
Since the pioneering work of Charnes et al. [1], data envelopment analysis (DEA) has been extensively used for evaluating the performance of many activities. DEA evaluates the relative efficiency of a set of homogeneous decision making units (DMUs) by using a ratio of the weighted sum of outputs to the weighted sum of inputs.
In recent years, fuzzy set theory has been proposed as a way to quantify imprecise and vague data in DEA models. The DEA models with fuzzy data (“fuzzy DEA” models) can more realistically represent real-world problems than the conventional DEA models.
Hatami-Marbini et al. [14] presented four groups of fuzzy DEA approaches: tolerance approach, defuzzification approach, α - level based approach, and fuzzy ranking approach.
Many researches focused on fuzzy data envelopment analysis and they provided structures to measure the relative efficiency of Decision Making Units [15, 16, 15, 16].These methods are mostly based on transforming the fuzzy model into a linear or parametric linear models or defuzzification and the obtained efficiency scores by these models have crisp values. But when the inputs and outputs of the DMUs are fuzzy numbers it is expected that the efficiency score has also fuzziness. In this paper we consider the efficiency score as a fuzzy set and then we prove that this fuzzy set has the properties of fuzzy number. The proposed fuzzy efficiency score is used to develop a ranking order of the DMUs. This ranking model is based on efficiency concept and is appropriate only for fuzzy efficiency score and is not suitable for other fuzzy numbers.
This paper is organized as follows: A DEA model and its use in interval data are introduced in section 2. Section 3 presents an approach to solving the afore-mentioned fuzzy DEA model by the α-level method and an algorithm for ranking DMUs. Also the approach is illustrated by solving an example. The conclusion is provided in Section 4.
Interval data in DEA
Consider n decision making units DMU j ,j = 1,...,n, where each DMU consumes input levels x ij , i = 1,...,m, to produce output levels y rj , r = 1,...,s. Let J = {1, . . . , n} and suppose that X j = (x1j, . . . , x mj ) T and Y j = (y1j, . . . , y sj ) T are the vectors of input and output values, respectively, for DMU j , in which it has been assumed that X j ≥ 0, X j ≠ 0 and Y j ≥ 0, Y j ≠ 0. The relative efficiency score of DMU o , o ∈ {1, . . . , n}, is obtained from the following model which is called the input-oriented CCR envelopment model
It can be proven that 0 < θ* ≤ 1 and DMU o is (technically) efficient in the CCR model if and only if θ* = 1. Otherwise, the DMU o is inefficient [1].
Unlike the original CCR model, let us assume here that the levels of inputs and outputs are not known exactly, but the true input and output data are known to lie within bounded intervals, i.e., and .
The worstcase of DMU o is defined as (i = 1,...,m), and (r = 1,...,s). Similarly, the bestcase of DMU o is defined as (i = 1,...,m), and (r = 1,...,s).
According to above explanations the efficiency of a DMU can be defined as an interval that the upper limit of this interval is obtained from the optimistic viewpoint and the lower limit is obtained from the pessimistic viewpoint. The following model provides such an upper bound for DMU
o
[13,14, 13,14]:
We denote by and the efficiency score attained by DMU o in (1) and (2) and name them the best case and the worst case of efficiency score, respectively.
We show that . Suppose that and are the feasible regions in relation to (1) and (2), respectively, and let (λ1, . . . , λ
n
, θ) be an optimal solution for (1). Because of optimality, at least for one i, (i=1,...,m), the constraints below;
is binding, such as,
Models (1) and (2) provide a bounded interval that contains the true efficiency score for each DMU and is called efficiency value interval.
On the basis of the above efficiency value intervals, DMUs can be classified in three subsets as follows:
Suppose that the exact level of x
ij
lies in or , such that
In this case, the efficiency value interval of DMU o can be attained by two intervals and , where and are the optimal solutions of (1) and (2), respectively, when x ij and y rj are in and , respectively, (k=1,2). It can be easily shown that the efficiency value intervals do not enlarge when data interval shrink.
Since we have
Hence when all interval data of inputs and outputs are shrinking for all DMUs, then the efficiency value interval of DMU o is not enlarging. Therefore
- If a DMU is in E++ for an interval data, then it remains in E++ after the shrinking of the data intervals of all DMUs.
- If a DMU is in E- for an interval data, then it remains in E- after the shrinking of the data intervals of all DMUs.
Fuzzy DEA
is called the membership function of x in . Range of the membership function is a subset of the nonnegative real numbers whose supremum is finite. If , the fuzzy set is called normal. A nonempty fuzzy set can always be normalized by dividing by . As a matter of convenience, we will generally assume that fuzzy sets are normalized and elements with a zero degree of membership are normally not listed.
Alternatively, a fuzzy set is convex if all α - level sets are convex.
There exists exactly one with (unimodal). is piecewise continuous.
DEA with fuzzy data
Assume we have n DMUs where DMU j (j = 1, . . . , n) consumes input levels to produce output levels , where all and are convex bounded fuzzy numbers.
As the input and the output levels of DMUs are fuzzy numbers, we expect the relative efficiency score of DMU o (o ∈ {1, . . . , n}) also to be a fuzzy number.
One can evaluate the relative efficiency score of DMU
o
by the CCR model in the output-oriented form, as follows:
A target of DEA models is the relative efficiency score evaluation for each DMU that is obtained by computing a numerical level θ. When the i-th input level of the j-th decision making unit is a fuzzy number such as a triangular fuzzy number , it means DMU j has, in an uncertain manner, used an imprecise level of the i-th input and m is one of them with a complete degree of membership 1. Any number that is less than m - δ and greater than m + β cannot be the value for x ij . All numbers in interval [m - δ, m + β] can be a value for x ij with some degree of membership.
With the assumption of fuzzy numbers and being convex and bounded, from [12] it can be derived that each α-level of and is a bounded interval as follows:
By any choice of α, a set is obtained that consists of n DMUs with interval data, and we can compute its efficiency interval by using (1), (2). Therefore, for any α, an efficiency interval is obtained. As previously mentioned, the efficiency interval of DMU
o
for an α-level is denoted by , where
and from Theorem 1, the proof is completed.□
From the above theorem, the efficiency value interval shrinks as α increases from 0 to 1. From properties of nested intervals the efficiency score can be introduced by a fuzzy set.
Figure. 1. represents an example for fuzzy efficiency score . In the following part, we show that fuzzy set satisfies fuzzy number conditions [12]. But, first, we present some important results. The following theorem states that the efficiency interval obtained from a special α-level data is equal to the α-level , that is denoted by .
Let ; therefore from Definition 5 we concluded and so . Conversely, let , then . By considering
we get , and from Theorem 3 we have,
Unimodality is another property of function which is necessary for a fuzzy number.
Hereafter, we assume .
From the above theorems, the function is maximized at point . In the next theorem, we show that this function is increasing and decreasing before and after , respectively.
From the above assumption and Definition 5, we have , therefore, , which contracts the initial assumption. In a similar manner, it can be shown that the function is decreasing on interval if .□
The following theorem shows that is a piecewise continuous function on the interval (0, 1].
b) If, thenis either continuous or left continuous on the interval.
and from Theorem 4, we also have
We know that belongs to all above sets and
Since , it results in .
There are two cases for the relationship between and , as follows:
In case i, we show that the function is continuous at , because by choosing
for any x, , and also from Theorem 6 and Definition 5, we have:
Similar to case (ii), the result also follows for α = 0 and the proof of (b) is similar to (a) and is left to the readers.□
From the above theorems, we conclude that if the data for the DMUs are fuzzy numbers, then the relative efficiency scores of the DMUs are also fuzzy numbers. Below, we define the concept of an efficient DMU in fuzzy DEA.
The above definition has a very strong condition for a DMU to be efficient (see Fig. 2). Even it may happen that none of DMUs is efficient. Because it is possible that E++ is empty when all DMUs have interval data. In the next section, we present a numerical method for the ranking of DMUs with fuzzy data.
Definitions 5 and 6 are fundamentally theoretical criteria with respect to the efficiency score and the efficient DMUs, but in practice, it is not possible that or efficient DMUs are exactly determined by these definitions.
Here we present a numerical ranking method to determine the rank order of DMUs approximately. Consider the following k real numbers
So from Theorems 3, 4, we have
Since each α-level set of is a closed interval, as mentioned previously, the following statements can be considered. If for some l′ (l′ < k):
–DMU o ∈ E++, then DMU o ∈ E++ for all l where l > l′.
–DMU o ∈ E-, then DMU o ∈ E- for all l where l > l′.
Moreover, for α k = 1, we know the inputs and outputs are crisp data and . At this level, if DMU o is efficient (θ o = 1), then DMU o is in E++ or E+ for all α l < 1, and otherwise (θ o < 1) then DMU o is in E+ or E- for all α1< 1.
It must be considered that the nearer α1 is to zero, the more exactly the efficient units are assessed. This does not mean that at least one unit will be certainly efficient. Also, after studying the ranking method below, it will be clear that the higher the number of , with the distances between them selected equally, the more fairly and precisely the ranking of the DMUs is carried out.
Therefore, suppose α1> 0 is sufficiently small and the other α1 are selected as follows:
Suppose p = 1 and J = {DMU
l
, . . . , DMU
n
}. There are two cases for as:
If , then one can ask which DMUs are more efficient, the DMUs in or in ? We claim the DMUs in are more efficient because these DMUs are the DMUs in that join for φ2-level data. Therefore, when , we turn to .
With the assumption, there are also two cases for . If , similary, we turn to and this continues until for some l′ (l′ = 1, . . . , k). There is certainly such an l′, because the end.
In this case, let
Now again we form and (l=1,...,k) for the reduced set J and compute E p similarly. This process continues until J = φ. It can be at most repeated for n iterations. In the end, we will obtain E1, . . . , E p (p ≤ n) from the above process. Clearly the DMUs in E1 have higher efficiency scores than the ones in E2, . . . , E p and also the DMUs in E2 have the same relation with other DMUs in E3, . . . , E p , and so on. Therefore, by arranging DMUs on the basis of these sets we can rank them. Figure. 3 shows a DMU with fuzzy efficiency score is more efficient than the DMU with fuzzy efficiency score .
It is possible that E1, . . . , E p have more than one member, so some DMUs may have the same rank. This problem can be removed by the following change to determine E1, . . . , E p . In each iteration, if E p has more than one member, then by going back to one level before , i.e., , we choose a DMU in E p which has the highest as the member of E p and omit the other members. Even, if E p still has more than one member, we can go back to αl-2,...,α1 and choose E p as a singleton. In the end, if E p is not a singleton yet, then we can conclude that all members of E p have the same rank order. In the following figure we can see that DMU with fuzzy efficiency score is more efficient than the DMU with fuzzy efficiency score when E p is not singleton.
Algorithm
Application in a practical example
The performance of teachers in an Iranian educational department is evaluated based on the indices introduced in the evaluation questionnaires which are filled out by students.
Assume that the fuzzy triangular number is the i-th input value, i = 1, . . . , m, and the fuzzy triangular number is the r-th out value, r = 1, . . . , s, of the j-th DMU, j = 1, . . . , n, where , , , are all non-negative numbers.
If , are positive numbers, then has the following membership function in :
One can discuss the membership function of in a similar manner.
By previously mentioned, for α ∈ (0, 1], the α-level of and will be as follows
So, for the fuzzy triangular numbers and , we have
Now, for α ∈ [0, 1], the efficiency bounds of DMU o resulting from an α-level, , can be obtained as follows, regarding models (5-6)
For each teacher, one of the qualitative expressions: Excellent, Good, Average, Poor, and Very Poor is assigned to each index by the students.
A suitable choice of fuzzy numbers for the students’ responses is to use fuzzy triangular numbers. Table 2 shows five typical triangular numbers in the interval [0, 1], corresponding to the responses.
The above numbers are depicted in Fig. 5.
To calculate the score of a certain teacher for each index, we perform the fuzzy addition on all the responses to that certain index and then divide the sum by the number of respondents. This operation, called normalizing the data, causes the resulting scores to lie in the interval [0, 1]. Normalizing data are independent of the unit of measurement and the number of respondents. Therefore, the effect of the number of the respondents not being equal from one teacher to the other is removed.
The indices for the evaluation of units are always divided into two categories, inputs and outputs,which are represented in Table 1.
Therefore, in the model employed for evaluating the teachers, they are considered as DMUs, such that have 11 outputs (s = 11) but no inputs (m = 0). The outputs of the j-th teacher are denoted by y1j to y11j in order of appearance in Table 1. The CCR model for this situation is as follows.
where n is the number of the teachers who are compared together. Thus, models (9-10) to determine the efficiency bounds of DMU o (o ∈ {1, . . . , 11}) for a certain α-level will be as follows.
and
Replacing model (5-6) by models (12-13) in the algorithm in the previous section makes it possible to rank the teachers.
Now, consider a ranking to be carried out for five teachers, the overall data of which is presented in Tables 3–7, according to the 11 questions in Table 1. In the last column, the score for each index is obtain for each teacher as a fuzzy triangular number, using the fuzzy valuing method in Table 2. Note that the number of respondents varies from one teacher to another, a problem which has been solved by normalizing the data.
By employing the ranking algorithm, all the teachers belong to the class E+ for α < 0.4, and teacher number 4 belongs to the class E++ for α = 0.4, therefore, teacher number 4 has the first rank. By omitting and employing the ranking algorithm again, teachers number 1 and 5 belong to the class E++ for α = 0.2, by using the algorithm E p , teacher number 5 has the second rank. Now, we also omit teacher 5, and employ algorithm again, teacher number 1 has the third rank for α = 0.2, and after omitting teacher number 1, teacher number 2 has the fourth rank for α = 0.1, and in the end, we will obtain that teacher number 3 has the fifth rank.
In real world problems the data available may be imprecise or in fuzzy format. This type of data makes the calculation of efficiency score more complicated. In this paper inputs and outputs of the decision making units are assumed fuzzy numbers. In such a condition it is expected that the efficiency score is an imprecise number. we assumed the efficiency score as a fuzzy number and we proposed a method based on α-cuts and interval data envelopment analysis to show the fuzzy structure of efficiency score. The properties of this fuzzy efficiency score allowed us to introduce a pattern for ranking decision making units. What distinguishes this model from others is proposing an algorithm for ranking fuzzy efficiency scores which is based on efficiency concept and is not valid for ranking other fuzzy numbers.
