Ranking units in Data Envelopment Analysis with fuzzy data

Abstract

Data Envelopment Analysis (DEA) is a widely applied approach for measuring the relative efficiencies of a set of Decision Making Units (DMUs), which use multiple inputs to produce multiple outputs. In real world problems the data available may be imprecise. With fuzzy inputs and fuzzy outputs, the optimality conditions for the crisp DEA Models need to be clarified and generalized. The corresponding fuzzy linear programming problem is usually solved using some ranking methods for fuzzy sets. The methods of solving fuzzy DEA problems can be categorized into four distinct approaches: tolerance approach, defuzzification approach, α-level based approach, and fuzzy ranking approach In this paper, we introduce a new α-level based approach and a numerical method for ranking DMUs with fuzzy data.

Keywords

Data envelopment analysis (DEA)fuzzy mathematical programming efficiency Interval data fuzzy data

1 Introduction

Since the pioneering work of Charnes et al. [1], data envelopment analysis (DEA) has been extensively used for evaluating the performance of many activities. DEA evaluates the relative efficiency of a set of homogeneous decision making units (DMUs) by using a ratio of the weighted sum of outputs to the weighted sum of inputs.

In recent years, fuzzy set theory has been proposed as a way to quantify imprecise and vague data in DEA models. The DEA models with fuzzy data (“fuzzy DEA” models) can more realistically represent real-world problems than the conventional DEA models.

Hatami-Marbini et al. [14] presented four groups of fuzzy DEA approaches: tolerance approach, defuzzification approach, α - level based approach, and fuzzy ranking approach.

Many researches focused on fuzzy data envelopment analysis and they provided structures to measure the relative efficiency of Decision Making Units [15, 16, 15, 16].These methods are mostly based on transforming the fuzzy model into a linear or parametric linear models or defuzzification and the obtained efficiency scores by these models have crisp values. But when the inputs and outputs of the DMUs are fuzzy numbers it is expected that the efficiency score has also fuzziness. In this paper we consider the efficiency score as a fuzzy set and then we prove that this fuzzy set has the properties of fuzzy number. The proposed fuzzy efficiency score is used to develop a ranking order of the DMUs. This ranking model is based on efficiency concept and is appropriate only for fuzzy efficiency score and is not suitable for other fuzzy numbers.

This paper is organized as follows: A DEA model and its use in interval data are introduced in section 2. Section 3 presents an approach to solving the afore-mentioned fuzzy DEA model by the α-level method and an algorithm for ranking DMUs. Also the approach is illustrated by solving an example. The conclusion is provided in Section 4.

2 Interval data in DEA

Consider n decision making units DMU_j,j = 1,...,n, where each DMU consumes input levels x_ij, i = 1,...,m, to produce output levels y_rj, r = 1,...,s. Let J = {1, . . . , n} and suppose that X_j = (x_1j, . . . , x_mj) ^T and Y_j = (y_1j, . . . , y_sj) ^T are the vectors of input and output values, respectively, for DMU_j, in which it has been assumed that X_j ≥ 0, X_j ≠ 0 and Y_j ≥ 0, Y_j ≠ 0. The relative efficiency score of DMU_o, o ∈ {1, . . . , n}, is obtained from the following model which is called the input-oriented CCR envelopment model

$\begin{array}{l} \sum_{j = 1}^{n} λ_{j} x_{i j} \leq θ x_{i o}, i = 1, ..., m \\ \sum_{j = 1}^{n} λ_{j} y_{r j} \geq y_{r o}, r = 1, ..., s \\ λ_{j} \geq 0, j = 1, ..., n . \end{array}$

It can be proven that 0 < θ^* ≤ 1 and DMU_o is (technically) efficient in the CCR model if and only if θ^* = 1. Otherwise, the DMU_o is inefficient [1].

Unlike the original CCR model, let us assume here that the levels of inputs and outputs are not known exactly, but the true input and output data are known to lie within bounded intervals, i.e., $x_{ij} \in [x_{ij}^{L}, x_{ij}^{U}]$ and $y_{rj} \in [y_{rj}^{L}, y_{rj}^{U}]$ .

The worstcase of DMU_o is defined as $x_{io} = x_{io}^{U}$ (i = 1,...,m), and $y_{ro} = y_{ro}^{L}$ (r = 1,...,s). Similarly, the bestcase of DMU_o is defined as $x_{io} = x_{io}^{L}$ (i = 1,...,m), and $y_{ro} = y_{ro}^{U}$ (r = 1,...,s).

According to above explanations the efficiency of a DMU can be defined as an interval that the upper limit of this interval is obtained from the optimistic viewpoint and the lower limit is obtained from the pessimistic viewpoint. The following model provides such an upper bound for DMU_o [13,14, 13,14]: $\begin{array}{l} h_{o}^{u} M i n θ \\ s . t . \sum_{j \neq o} λ_{j} x_{i j}^{U} + λ_{o} x_{i o}^{L} \leq θ x_{i o}^{L}, i = 1, ..., m \\ \sum_{j \neq o} λ_{j} y_{r j}^{L} + λ_{o} y_{r o}^{U} \geq y_{r o}^{U}, r = 1, ..., s \\ λ_{j} \geq 0, j = 1, ..., n . \end{array}$ (1) The model below provides a lower bound of the efficiency score for DMU_o: $\begin{array}{l} h_{o}^{L} = M i n θ \\ s . t . \sum_{j \neq o} λ_{j} x_{i j}^{L} + λ_{o} x_{i o}^{U} \leq θ x_{i o}^{U}, i = 1, ..., m \\ \sum_{j \neq o} λ_{j} y_{r j}^{U} + λ_{o} y_{r o}^{L} \geq y_{r o}^{L}, r = 1, ..., s \\ λ_{j} \geq 0, j = 1, ..., n . \end{array}$ (2)

We denote by $h_{o}^{U}$ and $h_{o}^{L}$ the efficiency score attained by DMU_o in (1) and (2) and name them the best case and the worst case of efficiency score, respectively.

We show that $h_{o}^{L} \leq h_{o}^{U}$ . Suppose that $S_{o}^{U}$ and $S_{o}^{L}$ are the feasible regions in relation to (1) and (2), respectively, and let (λ₁, . . . , λ_n, θ) be an optimal solution for (1). Because of optimality, at least for one i, (i=1,...,m), the constraints below; $\sum_{j \neq o} λ_{j} x_{ij}^{U} + λ_{o} x_{io}^{L} \leq θ x_{io}^{L}$

is binding, such as,

$\sum_{j \neq o} λ_{j} x_{ij}^{U} = (θ - λ_{o}) x_{io}^{L},$ and since λ_j, $x_{ij}^{U}$ and $x_{io}^{L}$ are not negative, then θ - λ_o ≥ 0 and λ_o ≤ θ ≤ 1. Therefore

$\sum_{j \neq o} λ_{j} x_{ij}^{L} \leq \sum_{j \neq o} λ_{j} x_{ij}^{U} \leq (θ - λ_{o}) x_{io}^{L} \leq (θ - λ_{o}) x_{io}^{U},$ $\sum_{j \neq o} λ_{j} y_{rj}^{U} \geq \sum_{j \neq o} λ_{j} y_{rj}^{L} \geq (1 - λ_{o}) y_{ro}^{U} \geq (1 - λ_{o}) y_{ro}^{L},$ then (λ₁, . . . , λ_n, θ) is a solution for (2), therefore $h_{o}^{L} \leq h_{o}^{U}$ .

Models (1) and (2) provide a bounded interval $[h_{o}^{L}, h_{o}^{U}]$ that contains the true efficiency score for each DMU and is called efficiency value interval.

On the basis of the above efficiency value intervals, DMUs can be classified in three subsets as follows: $\begin{array}{l} E^{+ +} = {j \in J h_{j}^{L} = 1}, \\ E^{+} = {j \in J h_{j}^{L} < 1 h_{j}^{U} = 1}, \\ E^{-} = {j \in J h_{j}^{U} < 1} \end{array}$ (3)

2.1 Nested interval data

Suppose that the exact level of x_ij lies in $[x_{ij}^{1, L}, x_{ij}^{1, U}]$ or $[x_{ij}^{2, L}, x_{ij}^{2, U}]$ , such that $[x_{ij}^{1, L}, x_{ij}^{1, U}] \subseteq [x_{ij}^{2, L}, x_{ij}^{2, U}], i = 1, . . ., m, j = 1, . . ., n$ and similarly, the exact level of y_rj lies in $[y_{rj}^{1, L}, y_{rj}^{1, U}]$ or $[y_{rj}^{2, L}, y_{rj}^{2, U}]$ , such that $[y_{rj}^{1, L}, y_{rj}^{1, U}] \subseteq [y_{rj}^{2, L}, y_{rj}^{2, U}], r = 1, . . ., s, j = 1, . . ., n .$

In this case, the efficiency value interval of DMU_o can be attained by two intervals $[h_{o}^{1, L}, h_{o}^{1, U}]$ and $[h_{o}^{2, L}, h_{o}^{2, U}]$ , where $h_{o}^{k, L}$ and $h_{o}^{k, U}$ are the optimal solutions of (1) and (2), respectively, when x_ij and y_rj are in $[x_{ij}^{k, L}, x_{ij}^{k, U}]$ and $[y_{rj}^{k, L}, y_{rj}^{k, U}]$ , respectively, (k=1,2). It can be easily shown that the efficiency value intervals do not enlarge when data interval shrink.

Theorem 1. With the above assumptions, $[h_{o}^{1, L}, h_{o}^{1, U}] \subseteq [h_{o}^{2, L}, h_{o}^{2, U}] .$

Proof. It is sufficient to show $h_{o}^{2, L} \leq h_{o}^{1, L}$ and $h_{o}^{1, U} \leq h_{o}^{2, U}$ . Suppose that (λ₁, . . . , λ_n, θ) is an optimal solution for (2) according to $[x_{ij}^{1, L}, x_{ij}^{1, U}]$ and $[y_{rj}^{1, L}, y_{rj}^{1, U}]$ interval data. As mentioned previously $θ - λ_{o} \geq 0, 1 - λ_{o} \geq 0 .$

Since we have $x_{ij}^{2, L} \leq x_{ij}^{1, L}, x_{ij}^{1, U} \leq x_{ij}^{2, U}, y_{rj}^{2, L} \leq y_{rj}^{1, L}, y_{rj}^{1, U} \leq y_{rj}^{2, U}$ for j = 1, . . . , n, i = 1, . . . , m and r = 1, . . . , s, we can obtain $\sum_{j \neq o} λ_{j} x_{ij}^{2, L} \leq \sum_{j \neq o} λ_{j} x_{ij}^{1, L} \leq (θ - λ_{o}) x_{io}^{1, U} \leq (θ - λ_{o}) x_{io}^{2, U}$ and $\sum_{j \neq o} λ_{j} y_{rj}^{2, U} \geq \sum_{j \neq o} λ_{j} y_{rj}^{1, U} \geq (1 - λ_{o}) y_{ro}^{1, L} \geq (1 - λ_{o}) y_{ro}^{2, L} .$ Hence (λ₁, . . . , λ_n, θ) is a feasible solution for (2) according to $[x_{ij}^{2, L}, x_{ij}^{2, U}]$ and $[y_{rj}^{2, L}, y_{rj}^{2, U}]$ interval data, therefore $h_{o}^{2, L} \leq h_{o}^{1, L}$ . Similarly, it can be shown that $h_{o}^{1, U} \leq h_{o}^{2, U}$ and the proof is completed.□

Hence when all interval data of inputs and outputs are shrinking for all DMUs, then the efficiency value interval of DMU_o is not enlarging. Therefore

- If a DMU is in E⁺⁺ for an interval data, then it remains in E⁺⁺ after the shrinking of the data intervals of all DMUs.

- If a DMU is in E^- for an interval data, then it remains in E^- after the shrinking of the data intervals of all DMUs.

3 Fuzzy DEA

Definition 1. If X is a collection of objects denoted generically by x, then a fuzzy set $\tilde{A}$ in X is a set of ordered pairs: $\tilde{A} = {(x, μ_{\tilde{A}} (x)) ∣ x \in X} .$

$μ_{\tilde{A}} (x)$ is called the membership function of x in $\tilde{A}$ . Range of the membership function is a subset of the nonnegative real numbers whose supremum is finite. If $sup_{x} μ_{\tilde{A}} (x) = 1$ , the fuzzy set $\tilde{A}$ is called normal. A nonempty fuzzy set $\tilde{A}$ can always be normalized by dividing $μ_{\tilde{A}} (x)$ by $sup_{x} μ_{\tilde{A}} (x) = 1$ . As a matter of convenience, we will generally assume that fuzzy sets are normalized and elements with a zero degree of membership are normally not listed.

Definition 2. The (crisp) set of elements that belong to the fuzzy set $\tilde{A}$ at least to the degree α (α > 0) is called the α - level set: $A_{α} = {x \in X ∣ μ_{\tilde{A}} (x) \geq α}$

Theorem 2.If $\tilde{A}$ is a fuzzy set and 0 < α₁ < α₂, thenA_{α
₂} ⊆ A_{α
₁}.

Definition 3. A fuzzy set $\tilde{A}$ is convex if for each x₁, x₂ ∈ X and λ ∈ [0, 1] $μ_{\tilde{A}} (λ x_{1} + (1 - λ) x_{2}) \geq min {μ_{\tilde{A}} (x_{1}), μ_{\tilde{A}} (x_{2})}$

Alternatively, a fuzzy set is convex if all α - level sets are convex.

Definition 4. A fuzzy number $\tilde{M}$ is a convex normalized fuzzy set of the real line $ℝ$ such that

There exists exactly one $x_{0} \in ℝ$ with $μ_{\tilde{M}} (x_{0}) = 1$ (unimodal).

$μ_{\tilde{M}} (x)$ is piecewise continuous.

3.1 DEA with fuzzy data

Assume we have n DMUs where DMU_j (j = 1, . . . , n) consumes input levels ${\tilde{x}}_{ij} (i = 1, . . ., m)$ to produce output levels ${\tilde{y}}_{rj} (r = 1, . . ., s)$ , where all ${\tilde{x}}_{ij}$ and ${\tilde{y}}_{rs}$ are convex bounded fuzzy numbers.

As the input and the output levels of DMUs are fuzzy numbers, we expect the relative efficiency score of DMU_o (o ∈ {1, . . . , n}) also to be a fuzzy number.

One can evaluate the relative efficiency score of DMU_o by the CCR model in the output-oriented form, as follows: $\begin{array}{l} {\tilde{θ}}_{0}^{*} = M i n \tilde{θ} \\ s . t . \sum_{j \neq o} λ_{j} {\tilde{x}}_{i j} \leq \tilde{θ} {\tilde{x}}_{i o}, i = 1, ..., m \\ \sum_{j \neq o} λ_{j} {\tilde{y}}_{r j} \geq {\tilde{y}}_{r o}, r = 1, ..., s λ_{j} \geq 0, j = 1, ..., n, \end{array}$ (4) where we assume that λ_j is a real variable. In the above model, we use the algebraic sum and product as defined in [12]. Essentially, the inequality concept used in Model (4) must be in the form of fuzzy relations. Therefore, Model (4) is not a linear programming problem, so we cannot solve it with linear techniques. In the following section, a method for solving such models with a review of the efficiency concept is presented.

A target of DEA models is the relative efficiency score evaluation for each DMU that is obtained by computing a numerical level θ. When the i-th input level of the j-th decision making unit is a fuzzy number such as a triangular fuzzy number ${\tilde{x}}_{ij} = (m, δ, β)$ , it means DMU_j has, in an uncertain manner, used an imprecise level of the i-th input and m is one of them with a complete degree of membership 1. Any number that is less than m - δ and greater than m + β cannot be the value for x_ij. All numbers in interval [m - δ, m + β] can be a value for x_ij with some degree of membership.

With the assumption of fuzzy numbers ${\tilde{x}}_{ij}$ and ${\tilde{y}}_{rj}$ being convex and bounded, from [12] it can be derived that each α-level of ${\tilde{x}}_{ij}$ and ${\tilde{y}}_{rj}$ is a bounded interval as follows: $[{\tilde{x}}_{ij}]_{α} = [x_{ij}^{α, L}, x_{ij}^{α, U}], [{\tilde{y}}_{rj}]_{α} = [y_{rj}^{α, L}, y_{rj}^{α, U}],$ $α \in (0, 1], j = 1, . . ., n, i = 1, . . ., m, r = 1, . . ., s .$

By any choice of α, a set is obtained that consists of n DMUs with interval data, and we can compute its efficiency interval by using (1), (2). Therefore, for any α, an efficiency interval is obtained. As previously mentioned, the efficiency interval of DMU_o for an α-level is denoted by $[h_{o}^{α, L}, h_{o}^{α, U}]$ , where $\begin{array}{l} h_{o}^{α, L} = M i n θ \\ s . t . \sum_{j \neq o} λ_{j} x_{i j}^{α, L} + λ_{o} x_{i o}^{α, U} \leq θ x_{i o}^{α, U}, i = 1, ..., m \\ \sum_{j \neq o} λ_{j} y_{r j}^{α, U} + λ_{o} y_{r o}^{α, L} \geq y_{r o}^{α, L}, r = 1, ..., s \\ λ_{j} \geq 0, j = 1, ..., n \end{array}$ (5) and $\begin{array}{l} h_{o}^{α, U} = M i n θ \\ s . t . \sum_{j \neq o} λ_{j} x_{i j}^{α, U} + λ_{o} x_{i o}^{α, L} \leq θ x_{i o}^{L}, i = 1, ..., m \\ \sum_{j \neq o} λ_{j} y_{r j}^{α, L} + λ_{o} y_{r o}^{α, U} \geq y_{r o}^{α, U}, r = 1, ..., s \\ λ_{j} \geq 0, j = 1, ..., n \end{array}$ (6) with $h_{o}^{α, L}$ and $h_{o}^{α, U}$ sometimes being equal.

Theorem 3.If 0 < α₁ < α₂ ≤ 1, then $[h_{o}^{α_{2}, L}, h_{o}^{α_{2}, U}] \subseteq [h_{o}^{α_{1}, L}, h_{o}^{α_{1}, U}] .$

Proof. From α-level definition, it can be inferred that for any i, j and r, we have:

$[x_{ij}^{α_{2}, L}, x_{ij}^{α_{2}, U}] \subseteq [x_{ij}^{α_{1}, L}, x_{ij}^{α_{1}, U}]$

$[y_{rj}^{α_{2}, L}, y_{rj}^{α_{2}, U}] \subseteq [y_{rj}^{α_{1}, L}, y_{rj}^{α_{1}, U}]$

and from Theorem 1, the proof is completed.□

From the above theorem, the efficiency value interval shrinks as α increases from 0 to 1. From properties of nested intervals the efficiency score can be introduced by a fuzzy set.

Definition 5. With the above assumptions, the fuzzy set ${\tilde{θ}}_{o}$ on interval (0, 1] with the membership function is the relative efficiency score of DMU_o, ${\tilde{θ}}_{o} (x) = sup {t ∣ 0 < t \leq 1, x \in [h_{o}^{t, L}, h_{o}^{t, U}]}$ if there is some t such that $x \in [h_{o}^{t, L}, h_{o}^{t, U}]$ , otherwise ${\tilde{θ}}_{o} (x) = 0$ .

Figure. 1. represents an example for fuzzy efficiency score ${\tilde{θ}}_{o}$ . In the following part, we show that fuzzy set ${\tilde{θ}}_{o}$ satisfies fuzzy number conditions [12]. But, first, we present some important results. The following theorem states that the efficiency interval obtained from a special α-level data is equal to the α-level ${\tilde{θ}}_{o}$ , that is denoted by $[{\tilde{θ}}_{o}]_{α}$ .

Theorem 4.If 0 < α ≤ 1, then $[{\tilde{θ}}_{o}]_{α} = [h_{o}^{α, L}, h_{o}^{α, U}]$ .

Proof. From Definition 5, we have, $[{\tilde{θ}}_{o}]_{α} = {x \in (0, 1] ∣, {\tilde{θ}}_{o} (x) \geq α} .$

Let $x \in [h_{o}^{α, L}, h_{o}^{α, U}]$ ; therefore from Definition 5 we concluded ${\tilde{θ}}_{o} (x) \geq α$ and so $x \in [{\tilde{θ}}_{o}]_{α}$ . Conversely, let $x \in [{\tilde{θ}}_{o}]_{α}$ , then ${\tilde{θ}}_{o} (x) \geq α$ . By considering

${\tilde{θ}}_{o} (x) = sup {t ∣ x \in [h_{o}^{t, L}, h_{o}^{t, U}]} = \bar{α}$

we get $\bar{α} \geq α$ , and from Theorem 3 we have, $[h_{o}^{\bar{α}, L}, h_{o}^{\bar{α}, U}] \subseteq [h_{o}^{α, L}, h_{o}^{α, U}]$ hence $x \in [h_{o}^{α, L}, h_{o}^{α, U}]$ and so proof is completed.□

Conclusion 1. ${\tilde{θ}}_{o}$ is a convex bounded fuzzy set.

Unimodality is another property of function ${\tilde{θ}}_{o} (x)$ which is necessary for a fuzzy number.

Theorem 5. The function ${\tilde{θ}}_{o} (x)$ is unimodal.

Proof. Since ${\tilde{x}}_{ij}$ and ${\tilde{y}}_{rj}$ are fuzzy numbers, they are unimodal. By choosing α = 1, Models (5) and (6) are converted to a linear programming problem (with crisp data), which is exactly the CCR model and $h_{o}^{1, L} = h_{o}^{1, U}$ . Therefore $[{\tilde{θ}}_{o}]_{1} = {\bar{x}}$ where $0 < \bar{x} = h_{o}^{1, L} = h_{o}^{1, U} \leq 1$ .□

Hereafter, we assume $[{\tilde{θ}}_{o}]_{1} = {{\bar{x}}_{o}}$ .

Conclusion 2. If $[{\tilde{θ}}_{o}]_{α} = {x}$ for any 0 < α < 1, then $x = {\bar{x}}_{o}$ .

From the above theorems, the function ${\tilde{θ}}_{o} (x)$ is maximized at point ${\bar{x}}_{o}$ . In the next theorem, we show that this function is increasing and decreasing before and after ${\bar{x}}_{o}$ , respectively.

Theorem 6.The function ${\tilde{θ}}_{o} (x)$ is increasing on interval $(0, {\bar{x}}_{o}]$ and if ${\bar{x}}_{o} < 1$ , then it is decreasing on interval $[{\bar{x}}_{o}, 1]$ .

Proof. Suppose that $0 < x_{1} < x_{2} \leq {\bar{x}}_{o}$ and ${\tilde{θ}}_{o} (x_{1}) = α_{1}, {\tilde{θ}}_{o} (x_{2}) = α_{2} .$ To complete the proof, we need to show that α₁ ≤ α₂. By contradiction, suppose α₂ < α₁. Therefore by theorems 3,4 we have $[{\tilde{θ}}_{o}]_{α_{1}} \subseteq [{\tilde{θ}}_{o}]_{α_{2}}$ .

From the above assumption and Definition 5, we have $x_{2} \notin [{\tilde{θ}}_{o}]_{α_{1}}$ , therefore, $0 < x_{2} < x_{1} \leq {\bar{x}}_{o}$ , which contracts the initial assumption. In a similar manner, it can be shown that the function ${\tilde{θ}}_{o} (x)$ is decreasing on interval $[{\bar{x}}_{o}, 1]$ if ${\bar{x}}_{o} < 1$ .□

The following theorem shows that ${\tilde{θ}}_{o} (x)$ is a piecewise continuous function on the interval (0, 1].

Theorem 7.a) The function ${\tilde{θ}}_{o} (x)$ is either continuous or right continuous on the interval $(0, {\bar{x}}_{o})$ .

b) If ${\bar{x}}_{o} < 1$ , then ${\tilde{θ}}_{o} (x)$ is either continuous or left continuous on the interval $({\bar{x}}_{o}, 1]$ .

Proof.(a) Suppose $\hat{x} \in (0, {\bar{x}}_{o})$ and ${\tilde{θ}}_{o} (\hat{x}) = \hat{α}$ , therefore, $0 < \hat{α} < 1$ . Consider an arbitrary number ɛ > 0 and first let $\hat{α} > 0$ . If $\hat{α} - ɛ ⊁ 0$ , then without loss of generality it can be supposed that $ɛ < \frac{\hat{α}}{2}$ and similarly, if $\hat{α} + ɛ ⊀ 1$ , then it can be supposed that $ɛ < \frac{1 - \hat{α}}{2}$ , hence, $0 < \hat{α} - ɛ < \hat{α} < \hat{α} + ɛ < 1$ . From Theorems 3,4 we have, $[{\tilde{θ}}_{o}]_{\hat{α} + ɛ} \subseteq [{\tilde{θ}}_{o}]_{\hat{α}} \subseteq [{\tilde{θ}}_{o}]_{\hat{α} - ɛ}$

and from Theorem 4, we also have $[{\tilde{θ}}_{o}]_{\hat{α} - ɛ} = [h_{o}^{\hat{α} - ɛ, L}, h_{o}^{\hat{α} - ɛ, U}]$ $[{\tilde{θ}}_{o}]_{\hat{α}} = [h_{o}^{\hat{α}, L}, h_{o}^{\hat{α}, U}]$ $[{\tilde{θ}}_{o}]_{\hat{α} + ɛ} = [h_{o}^{\hat{α} + ɛ, L}, h_{o}^{\hat{α} + ɛ, U}] .$

We know that ${\bar{x}}_{o}$ belongs to all above sets and $0 < h_{o}^{\hat{α} - ɛ, L} \leq h_{o}^{\hat{α}, L} \leq h_{o}^{\hat{α} + ɛ, L} \leq {\bar{x}}_{o} .$

Since $\hat{x} \notin [{\tilde{θ}}_{o}]_{\hat{α} + ɛ}$ , it results in $h_{o}^{\hat{α}, L} < h_{o}^{\hat{α} + ɛ, L}$ .

There are two cases for the relationship between $h_{o}^{\hat{α}, L}$ and $h_{o}^{\hat{α} - ɛ, L}$ , as follows:

$h_{o}^{\hat{α} - ɛ, L} < h_{o}^{\hat{α}, L}$

$h_{o}^{\hat{α} - ɛ, L} = h_{o}^{\hat{α}, L}$

In case i, we show that the function ${\tilde{θ}}_{o} (x)$ is continuous at $\hat{x}$ , because by choosing $0 < δ < min {(h_{o}^{\hat{α} + ɛ, L} - h_{o}^{\hat{α}, L}), (h_{o}^{\hat{α}, L} - h_{o}^{\hat{α} - ɛ, L})}$

for any x, $\hat{x} - δ < x < \hat{x}$ , and also from Theorem 6 and Definition 5, we have: ${\tilde{θ}}_{o} (\hat{x}) - ɛ = \hat{α} - ɛ < {\tilde{θ}}_{o} (x) < {\tilde{θ}}_{o} (\hat{x}) = \hat{α}$ and for any x, $\hat{x} < x < \hat{x} + δ$ , we have $\hat{α} = {\tilde{θ}}_{o} (\hat{x}) < {\tilde{θ}}_{o} (x) < {\tilde{θ}}_{o} (\hat{x}) + ɛ = \hat{α} + ɛ;$ therefore, the result is obtained. In case ii, we show that the function $\tilde{θ_{o}} (x)$ is right continuous at $\hat{x}$ . By choosing $0 < δ < h_{o}^{\hat{α} + ɛ, l} - h_{o}^{\hat{α}, l}$ , for any x, $\hat{x} < x < \hat{x} + δ$ , we have $\hat{α} = {\tilde{θ}}_{o} (\hat{x}) < {\tilde{θ}}_{o} (x) < {\tilde{θ}}_{o} (\hat{x}) + ɛ = \hat{α} + ɛ$ and the result is obtained.

Similar to case (ii), the result also follows for α = 0 and the proof of (b) is similar to (a) and is left to the readers.□

From the above theorems, we conclude that if the data for the DMUs are fuzzy numbers, then the relative efficiency scores of the DMUs are also fuzzy numbers. Below, we define the concept of an efficient DMU in fuzzy DEA.

Definition 6.DMU_o is efficient if and only if $[\tilde{θ_{o}}]_{α} = {1}$ for any 0 < α ≤ 1.

The above definition has a very strong condition for a DMU to be efficient (see Fig. 2). Even it may happen that none of DMUs is efficient. Because it is possible that E⁺⁺ is empty when all DMUs have interval data. In the next section, we present a numerical method for the ranking of DMUs with fuzzy data.

3.2 Ranking

Definitions 5 and 6 are fundamentally theoretical criteria with respect to the efficiency score $\tilde{θ} o$ and the efficient DMUs, but in practice, it is not possible that $\tilde{θ} o$ or efficient DMUs are exactly determined by these definitions.

Here we present a numerical ranking method to determine the rank order of DMUs approximately. Consider the following k real numbers

$0 < α_{1} < ... < α_{k} = 1$ Hence, these numbers form k nested interval of inputs and outputs as follows $[\tilde{x} i] α_{l + 1} \subseteq [\tilde{x} i] α_{l}$

$[\tilde{y} r]_{α_{l + 1}} \subseteq [\tilde{y} r] α l$

$i = 1, . . ., m, r = 1, . . ., s, l = 1, . . ., k - 1 .$

So from Theorems 3, 4, we have $[\tilde{θ} o] α l + 1 \subseteq [\tilde{θ} o] α l, l = 1, . . ., k - 1 .$

Since each α-level set of $\tilde{θ} o$ is a closed interval, as mentioned previously, the following statements can be considered. If for some l′ (l′ < k):

–DMU_o ∈ E⁺⁺, then DMU_o ∈ E⁺⁺ for all l where l > l′.

–DMU_o ∈ E^-, then DMU_o ∈ E^- for all l where l > l′.

Moreover, for α_k = 1, we know the inputs and outputs are crisp data and $[{\tilde{θ}}_{o}]_{1} = θ_{o} = {{\bar{x}}_{o}}$ . At this level, if DMU_o is efficient (θ_o = 1), then DMU_o is in E⁺⁺ or E⁺ for all α_l < 1, and otherwise (θ_o < 1) then DMU_o is in E⁺ or E^- for all α₁< 1.

It must be considered that the nearer α₁ is to zero, the more exactly the efficient units are assessed. This does not mean that at least one unit will be certainly efficient. Also, after studying the ranking method below, it will be clear that the higher the number of , with the distances between them selected equally, the more fairly and precisely the ranking of the DMUs is carried out.

Therefore, suppose α₁> 0 is sufficiently small and the other α₁ are selected as follows: $α_{l} = α_{1} + \frac{(1 - α_{1}) (l - 1)}{k - 1}, l = 2, ..., k$ hence α_k = 1.

Definition 7. The set of DMUs are in E⁺⁺ for -level data are denoted by ; that is $E_{α l}^{+ +} = {D M U_{o} h_{o}^{α l, L} = 1}, l = 1, ..., k$ and also the set of DMUs are in E⁺ for -level data are denoted by ; that is $E_{α l}^{+ +} = {D M U_{o} h_{o}^{α l, L} < 1 & h_{o}^{α l, U} = 1}, l = 1, ..., k$ Obviously $E_{+}^{1} \neq φ$ and the DMUs in $E_{α l}^{++}$ have the highest efficiency score. If a DMU is efficient as defined in Definition 6, then it must be in , but the reverse does not necessarily hold true.

Suppose p = 1 and J = {DMU_l, . . . , DMU_n}. There are two cases for $E_{α l}^{++}$ as:

$E_{α l}^{++} \neq φ$

If $E_{α l}^{++} \neq φ$ , then one can ask which DMUs are more efficient, the DMUs in $E_{α l}^{+}$ or in ? We claim the DMUs in $E_{α 2}^{++}$ are more efficient because these DMUs are the DMUs in $E_{α l}^{++} φ$ that join $E_{α 2}^{++}$ for φ₂-level data. Therefore, when $E_{α l}^{++} = φ$ , we turn to $E_{α 2}^{++} = φ$ .

With the $E_{α l}^{++} φ$ assumption, there are also two cases for $E_{α 2}^{++}$ . If $E_{α 2}^{++} = φ$ , similary, we turn to $E_{α k}^{++}$ and this continues until $E_{α 2}^{++} \neq φ$ for some l′ (l′ = 1, . . . , k). There is certainly such an l′, because $E_{α k}^{++} = E_{l}^{++} \neq φ$ the end.

In this case, let $\begin{array}{l} E_{p} = E_{α l'}^{+ +} \end{array}$ $J = J - E_{p}$ $p = p + 1 .$

Now again we form $E_{α l}^{++}$ and $E_{α l}^{+}$ (l=1,...,k) for the reduced set J and compute E_p similarly. This process continues until J = φ. It can be at most repeated for n iterations. In the end, we will obtain E₁, . . . , E_p (p ≤ n) from the above process. Clearly the DMUs in E₁ have higher efficiency scores than the ones in E₂, . . . , E_p and also the DMUs in E₂ have the same relation with other DMUs in E₃, . . . , E_p, and so on. Therefore, by arranging DMUs on the basis of these sets we can rank them. Figure. 3 shows a DMU with fuzzy efficiency score ${\tilde{θ}}_{1}$ is more efficient than the DMU with fuzzy efficiency score ${\tilde{θ}}_{2}$ .

It is possible that E₁, . . . , E_p have more than one member, so some DMUs may have the same rank. This problem can be removed by the following change to determine E₁, . . . , E_p. In each iteration, if E_p has more than one member, then by going back to one level before , i.e., , we choose a DMU in E_p which has the highest $h_{j}^{L, α k - 1}$ as the member of E_p and omit the other members. Even, if E_p still has more than one member, we can go back to α_l-2,...,α₁ and choose E_p as a singleton. In the end, if E_p is not a singleton yet, then we can conclude that all members of E_p have the same rank order. In the following figure we can see that DMU with fuzzy efficiency score ${\tilde{θ}}_{1}$ is more efficient than the DMU with fuzzy efficiency score ${\tilde{θ}}_{2}$ when E_p is not singleton.

Definition 8.q denotes the rank order of a DMU when the DMU ∈E_q, q = 1, . . . , p, after using the ranking method above.

3.3 Algorithm

Main Algorithm

Step 1- Let J = {DMU₁, . . . , DMU_n}, p = 1. Choose the first level α₁ > 0 and choose k as the number of levels and let $α_{l} = α_{1} + \frac{(1 - α_{1}) (l - 1)}{k - 1}, l = 2, . . ., k .$ Step 2- Let t = 1 and form $E_{α_{l}}^{+ +}$ by Models(5),(6) and Definition 7 for any α_l (l = 1,...,k).

Step 3- If $E_{α_{t}}^{+ +} = φ$ , then go to step 4, otherwise go to step 5.

Step 4- Let t = t + 1 and go to step 3.

Step 5- Let $E_{p} = E_{α_{t}}^{+ +}$ and go to subalgorithm E_p.

Step 6- Let J = J - E_p. If J = φ then go to step 7, otherwise let p = p + 1 and go to step 2.

Step 7- set q as the rank order of each DMU in E_q and stop.

Subalgorithm E_p

Step 1- Let c = 0 and go to step 2.

Step 2- If E_p is a singleton then return to the main algorithm (step 6), otherwise let c = c + 1, F = E_p and go to step 3.

Step 3- If c > k - 1 then return to the main algorithm (step 6), otherwise let $E_{p} = {{DMU}_{j^{'}} ∣ h_{j^{'}}^{α_{k - c}, L} = max_{j \in F} h_{j}^{α_{k - c}, L}}$ and go to step 2.

3.4 Application in a practical example

The performance of teachers in an Iranian educational department is evaluated based on the indices introduced in the evaluation questionnaires which are filled out by students.

Assume that the fuzzy triangular number ${\tilde{x}}_{ij} = (x_{ij}, δ_{ij}^{x}, β_{ij}^{x})$ is the i-th input value, i = 1, . . . , m, and the fuzzy triangular number ${\tilde{y}}_{rj} = (y_{rj}, δ_{rj}^{y}, β_{rj}^{y})$ is the r-th out value, r = 1, . . . , s, of the j-th DMU, j = 1, . . . , n, where $δ_{ij}^{x}$ , $δ_{rj}^{y}$ , $β_{ij}^{x}$ , $β_{rj}^{y}$ are all non-negative numbers.

If $δ_{ij}^{x}$ , $β_{ij}^{x}$ are positive numbers, then ${\tilde{x}}_{ij}$ has the following membership function in $ℝ$ : ${\tilde{x}}_{ij} (t) = {\begin{matrix} \frac{t - x_{ij} + δ_{ij}^{x}}{δ_{ij}^{x}} & x_{ij} - δ_{ij}^{x} \leq t < x_{ij} \\ \frac{x_{ij} + β_{ij}^{x} - t}{β_{ij}^{x}} & x_{ij} \leq t < x_{ij} + β_{ij}^{x} \\ 0 & otherwise, \end{matrix}$ and if $δ_{ij}^{x} = 0$ and $β_{ij}^{x}$ is positive, then ${\tilde{x}}_{ij} = (x_{ij}, 0, β_{ij}^{x})$ has the following membership function in $ℝ$ : ${\tilde{x}}_{ij} (t) = {\begin{matrix} \frac{x_{ij} + β_{ij}^{x} - t}{β_{ij}^{x}} & x_{ij} \leq t < x_{ij} + β_{ij}^{x} \\ 0 & otherwise, \end{matrix}$ and if $β_{ij}^{x} = 0$ and $δ_{ij}^{x}$ is positive, then ${\tilde{x}}_{ij} = (x_{ij}, δ_{ij}^{x}, 0)$ has the following membership function in $ℝ$ :

${\tilde{x}}_{ij} (t) = {\begin{matrix} \frac{t - x_{ij} + δ_{ij}^{x}}{δ_{ij}^{x}} & x_{ij} - δ_{ij}^{x} \leq t < x_{ij} \\ 0 & otherwise . \end{matrix}$

One can discuss the membership function of ${\tilde{y}}_{rj}$ in a similar manner.

By previously mentioned, for α ∈ (0, 1], the α-level of ${\tilde{x}}_{ij}$ and ${\tilde{y}}_{rj}$ will be as follows $({\tilde{x}}_{ij})_{α} = (x_{ij}, δ_{ij}^{x}, β_{ij}^{x})_{α} = [x_{ij} - δ_{ij}^{x} (1 - α), x_{ij} + β_{ij}^{x} (1 - α)]$ (7) $({\tilde{y}}_{rj})_{α} = (y_{rj}, δ_{rj}^{y}, β_{rj}^{y})_{α} = [y_{rj} - δ_{rj}^{y} (1 - α), y_{rj} + β_{rj}^{y} (1 - α)] .$ (8) The important advantage of using fuzzy triangular numbers is that for α = 0, as well, one can define an α-level to form a convex bounded interval, since the 0-level of fuzzy triangular number N = (m, δ, β) will be as fillows $N_{0} = (m - δ, m + β) = supp (N) .$

So, for the fuzzy triangular numbers ${\tilde{x}}_{ij}$ and ${\tilde{y}}_{rj}$ , we have $({\tilde{x}}_{ij})_{0} = (x_{ij} - δ_{ij}^{x}, x_{ij} + β_{ij}^{x})$ $({\tilde{y}}_{rj})_{0} = (y_{rj} - δ_{rj}^{y}, y_{rj} + β_{rj}^{y}),$ which can also be obtained by setting α = 0 in (7-8).

Now, for α ∈ [0, 1], the efficiency bounds of DMU_o resulting from an α-level, $[h_{o}^{α, L}, h_{o}^{α, U}]$ , can be obtained as follows, regarding models (5-6)

$\begin{array}{l} h_{o}^{α, L} = M a x \sum_{r = 1}^{s} u_{r} y_{r o} - (1 - α) \sum_{r = 1}^{s} u_{r} δ_{r o}^{y} \\ s . t . \sum_{i = 1}^{m} v_{i} x_{i o} + (1 - α) \sum_{i = 1}^{m} v_{i} β_{i o}^{x} = 1, \\ \sum_{r = 1}^{s} u_{r} y_{r j} - \sum_{i = 1}^{m} v_{i} x_{i j} + (1 - α) (\sum_{r = 1}^{s} u_{r} β_{r j}^{y} + \sum_{i = 1}^{m} v_{i} δ_{i j}^{x}) \leq 0, \\ (j = 1, \dots, n, j \neq o) \sum_{r = 1}^{s} u_{r} y_{r o} - \sum_{i = 1}^{m} v_{i} x_{i o} - (1 - α) (\sum_{r = 1}^{s} u_{r} δ_{r o}^{y} + \sum_{i = 1}^{m} v_{i} β_{i o}^{x}) \leq 0, \\ (v_{i}, u_{r} \geq ε, \forall i, r .) \end{array}$ (9) and $\begin{array}{l} h_{o}^{α, U} = M a x \sum_{r = 1}^{s} u_{r} y_{r o} + (1 - α) \sum_{r = 1}^{s} u_{r} β_{r o}^{y} \\ s . t . \sum_{i = 1}^{m} v_{i} x_{i o} - (1 - α) \sum_{i = 1}^{m} v_{i} δ_{i o}^{x} = 1, \\ \sum_{r = 1}^{s} u_{r} y_{r j} - \sum_{i = 1}^{m} v_{i} x_{i j} - (1 - α) (\sum_{r = 1}^{s} u_{r} δ_{r j}^{y} + \sum_{i = 1}^{m} v_{i} β_{i j}^{x}) \leq 0, \\ (j = 1, \dots, n, j \neq o) \\ \sum_{r = 1}^{s} u_{r} y_{r o} - \sum_{i = 1}^{m} v_{i} x_{i o} + (1 - α) (\sum_{r = 1}^{s} u_{r} β_{r o}^{y} + \sum_{i = 1}^{m} v_{i} δ_{i o}^{x}) \leq 0, \\ (v_{i}, u_{r} \leq ε, \forall i, r .) \end{array}$ (10) The indices by which the students were asked to evaluate teachers are presented in Table 1.

For each teacher, one of the qualitative expressions: Excellent, Good, Average, Poor, and Very Poor is assigned to each index by the students.

A suitable choice of fuzzy numbers for the students’ responses is to use fuzzy triangular numbers. Table 2 shows five typical triangular numbers in the interval [0, 1], corresponding to the responses.

The above numbers are depicted in Fig. 5.

To calculate the score of a certain teacher for each index, we perform the fuzzy addition on all the responses to that certain index and then divide the sum by the number of respondents. This operation, called normalizing the data, causes the resulting scores to lie in the interval [0, 1]. Normalizing data are independent of the unit of measurement and the number of respondents. Therefore, the effect of the number of the respondents not being equal from one teacher to the other is removed.

The indices for the evaluation of units are always divided into two categories, inputs and outputs,which are represented in Table 1.

Therefore, in the model employed for evaluating the teachers, they are considered as DMUs, such that have 11 outputs (s = 11) but no inputs (m = 0). The outputs of the j-th teacher are denoted by y_1j to y_11j in order of appearance in Table 1. The CCR model for this situation is as follows.

$\begin{array}{l} θ = M a x \sum_{r = 1}^{11} u_{r} y_{r o} \\ s . t . \sum_{r = 1}^{11} u_{r} y_{r j} \leq 1, j = 1, ..., n \\ u_{r} \geq ε, r = 1, ..., 11 \end{array}$ (11)

where n is the number of the teachers who are compared together. Thus, models (9-10) to determine the efficiency bounds of DMU_o (o ∈ {1, . . . , 11}) for a certain α-level will be as follows.

$\begin{array}{l} h_{o}^{α, U} = M a x \sum_{r = 1}^{11} u_{r} y_{r o} + (1 - α) \sum_{r = 1}^{11} u_{r} β_{r o}^{y} \\ s . t . \sum_{r = 1}^{11} u_{r} y_{r j} - (1 - α) \sum_{r = 1}^{11} u_{r} δ_{r j}^{y} \leq 1, \\ (j = 1, \dots, n, j \neq o) \\ \sum_{r = 1}^{11} u_{r} y_{r o} + (1 - α) \sum_{r = 1}^{11} u_{r} β_{r o}^{y} \leq 1, \\ (u_{r} \geq ε, r = 1, ..., 11.) \end{array}$ (12)

and $\begin{array}{l} h_{o}^{α, L} = M a x \sum_{r = 1}^{11} u_{r} y_{r o} - (1 - α) \sum_{r = 1}^{11} u_{r} δ_{r o}^{y} \\ s . t . \sum_{r = 1}^{11} u_{r} y_{r j} + (1 - α) \sum_{r = 1}^{11} u_{r} β_{r j}^{y} \leq 1, \\ (j = 1, \dots, n, j \neq o) \\ \sum_{r = 1}^{11} u_{r} y_{r o} - (1 - α) \sum_{r = 1}^{11} u_{r} δ_{r o}^{y} \leq 1, \\ (u_{r} \geq ε, r = 1, ..., 11) \end{array}$ (13)

Replacing model (5-6) by models (12-13) in the algorithm in the previous section makes it possible to rank the teachers.

Now, consider a ranking to be carried out for five teachers, the overall data of which is presented in Tables 3–7, according to the 11 questions in Table 1. In the last column, the score for each index is obtain for each teacher as a fuzzy triangular number, using the fuzzy valuing method in Table 2. Note that the number of respondents varies from one teacher to another, a problem which has been solved by normalizing the data.

By employing the ranking algorithm, all the teachers belong to the class E⁺ for α < 0.4, and teacher number 4 belongs to the class E⁺⁺ for α = 0.4, therefore, teacher number 4 has the first rank. By omitting and employing the ranking algorithm again, teachers number 1 and 5 belong to the class E⁺⁺ for α = 0.2, by using the algorithm E_p, teacher number 5 has the second rank. Now, we also omit teacher 5, and employ algorithm again, teacher number 1 has the third rank for α = 0.2, and after omitting teacher number 1, teacher number 2 has the fourth rank for α = 0.1, and in the end, we will obtain that teacher number 3 has the fifth rank.

4 Conclusion

In real world problems the data available may be imprecise or in fuzzy format. This type of data makes the calculation of efficiency score more complicated. In this paper inputs and outputs of the decision making units are assumed fuzzy numbers. In such a condition it is expected that the efficiency score is an imprecise number. we assumed the efficiency score as a fuzzy number and we proposed a method based on α-cuts and interval data envelopment analysis to show the fuzzy structure of efficiency score. The properties of this fuzzy efficiency score allowed us to introduce a pattern for ranking decision making units. What distinguishes this model from others is proposing an algorithm for ranking fuzzy efficiency scores which is based on efficiency concept and is not valid for ranking other fuzzy numbers.

References

Charnes

, Cooper

W.W.

and Rhods

, Measuring the efficiency of decision making units, European J Oper2 (1978), 429–444.

Charnes

, Cooper

W.W.

, Lewin

A.Y.

and Seiford

L.M.

, Data Envelopment Analysis: Theory, Methodology, and Application, Kluwer Academic Publishers, London, 1994.

Dubois

, Prade

, Possibility Theory: An Approach to Computerized Processing of Uncertainty, Plenmum Press, New York, 1988.

Dubois

and Prade

, Systems of linear fuzzy constraints, Fuzzy Sets and Systems (1980), 37–48.

Guo

and Tanaka

, Fuzzy, DEA: Aperceptual evaluation method, Fuzzy Sets and Systems (2001), 149–160.

Kahraman

and Tolga

, Data envelopment analysis using fuzzy concept, 28th Internat Symp On Multiple - Valued Logic (1998), 338–343.

Leon

, Liern

, Ruiz

J.L.

and Sirvent

, A fuzzy mathematecal programming approach to the assessment of efficiency with DEA models, Fuzzy Sets and Systems139 (2003), 407–419.

Lertworasirikul

, Fuzzy Data Envelopment Analysis for supply chain Modeling and Analysis, Dissertation proposal in Industrial Engineering, North Carolina state University, 2001.

Lertworasirikul

, Fang

S.C.

, Joines

J.A.

and Nuttle

H.L.W.

, Fuzzy data envelopment analysis (DEA): Apossibility approach, Fuzzy Sets and Systems139 (2003), 379–394.

10.

Sengupta

J.K.

, A Fuzzy systems approach in data envelopment analysis, Comput Math App24 (1992), 259–266.

11.

Zadeh

L.A.

, Fuzzy sets as a basis for a theory of possibility, Fuzzy Sets and Systems1 (1978), 3–28.

12.

Zimmermann

H.J.

, Fuzzy Set Theory and Its Application, Kluwer Academic Publishers, London, 1996.

13.

Jahanshahloo

G.R.

and Hosseinzadeh

, lotfi and M. Moradi, Sensitivity and stability analysis in DEA with interval data, Applied Mathematics and Computation (2003), 1–15.

14.

Hatami-Marbini

, Emrouznejad

and Tavana

, A taxonomy and review of the fuzzy data envelopment analysis literature: Two decades in the making, European Journal of Operational Research214(3) (2011), 457–472.

15.

Kao

and Liu

S.T.

, data envelopment analysis with imprecise data: An application of Taiwan machinary firms, International Journal of Uncertainity, Fuzziness and Knowledge-Based System13(2) (2005), 225–240.

16.

Kao

and Liu

S.T.

, A mathematical programming approach to fuzzy efficiency ranking, International Journal of Production Economics86 (2005), 145–154.