Abstract
Efficiency assessment by using data envelopment analysis (DEA) in interval environment is studied. Two parameters with regard to the input and output are introduced to characterize the variability of the production possibility sets. The extended production facets with different production possibility sets are determined. The inclusion relation between different extended production facets is discussed, and self-evaluation models are constructed to calculate the interval efficiency of the decision-making units (DMUs) with the optimal production facet. By setting self-evaluation as a target, the aggressive and benevolent cross-efficiency models are established based on the likelihood between the values of self-evaluation and peer evaluation. The analysis of the models yields the interval cross-efficiency matrices and the weight allocation method that is more advantageous to the DMU for aggregating the interval cross-efficiency matrices. An example is used to illustrate the applications of the models.
Introduction
Management efficiency is widely used by organizations for making amendments. Charnes et al. [1] developed data envelopment analysis (DEA) to measure the relative efficiency of decision-making units (DMUs) with inputs and outputs. Owing to the rapid development of theory, DEA has recently found extensive applications [2–25].
In conventional DEA models, such as the Charnes-Cooper-Rhodes (CCR) and Banker-Chanes-Cooper (BCC) models, each DMU was evaluated by choosing the most favorable weights to itself. With this self-evaluation, each DMU was allowed to consider its own circumstances only. The results classified the DMUs into two groups: efficient and inefficient based on the Pareto facets. In general, several efficient DMUs lead to no inferior-to-superior relationship among them. Considering this limitation, Sexton et al. [2] originally proposed cross-efficiency evaluation. Unlike the conventional models, the DMU was evaluated not only by self-evaluation but also by peer evaluation. In most studies, the overall efficiency of each DMU was calculated with the optimal weights of all DMUs by the CCR model. Compared with the conventional models, cross-efficiency evaluation can effectively distinguish the priority and rank the DMUs in a unique order. Because the optimal solutions of the CCR model are not unique, it is necessary to set a secondary goal to select an appropriate solution. The benevolent and aggressive models were proposed by Doyle and Green [3]. In the former, each DMU considered the other DMUs partners and selected the weights to simultaneously maximize their efficiency and maintain its own efficiency. In the latter, each DMU maintained its own efficiency but minimized the efficiency of the other DMUs, that is, the other DMUs were considered competitors. Liang et al. [4] introduced game theory to explain the competition and cooperation among the DMUs and obtained the Nash equilibrium solution of the model. Liang et al. extended Doyle and Green’s models by considering deviations from the ideal efficiency [5]. Moreover, since not every DMU was on the Pareto facet, Wang and Chin [6] improved the models by considering deviations from the CCR efficiency instead of ideal efficiency. Wu et al. [7] extended secondary goal models for weight selection. In their approach, each DMU had desirable and undesirable cross-efficiency targets. Different models were proposed to ensure that the results cloud be simultaneously close to the desirable target and far from the undesirable target. Moreover, some secondary goal function was developed to decrease the number of zero weights [8–10]. Wu et al. [11] made a Pareto improvement model for cross-efficiency evaluation.
Owing to the development of soft computing technologies, an increasingly large number of decision-making tools have been developed to cope with the problem of uncertain data, such as interval and fuzzy data. In addition to being an important method of multi-attribute decision-making, DEA has greatly contributed in the uncertain decision-making area since Cooper et al. [12] proposed the imprecise DEA (IDEA). With the development of uncertainty theory, IDEA has gradually developed as well. In accordance with the principles of interval linear programming, Despotis and Simirlis [13] computed the interval efficiency by introducing parameters to transform the nonlinear model to a linear model, and developed a model for handling interval and fuzzy data. Wang et al. [14] used the unified production facet to measure the interval efficiency of DMU with interval data. Lee et al. [15] extended IDEA to the two-stage additive model. Based on the CCR model, Kao [16] constructed a pair of bi-level mathematical programming problems, which was transformed into an ordinary linear programming problem by productivity concepts and variable substitution tools. The objective function value of the model was the interval efficiency of DMU. Chen et al. [17] computed the interval cross efficiency with interval and fuzzy data. In particular, the fuzzy data was converted into interval data by the tolerance and α-level approaches. Mariagrazia Dotoli et al. [18] considered triangular fuzzy numbers in DEA, and the model used fuzzy triangular efficiencies to assess DMUs. This retained as much fuzzy information as possible until the final ranking stage. Hossein Azizi et al. [19] formulated the ISBM model using the same production frontier from different viewpoints. It assessed the overall efficiency of DMUs in the presence of interval data with slack values. Hatami-Marbini, Adel et al. [20] proposed a new four-step fuzzy DEA method to re-shape weakly efficient facets along with revisiting the efficiency score of DMUs in terms of perturbing the weakly efficient facets. Wei and Wang [21] extended the robust efficiency analysis procedure to the situation where precise information on some input and output data is unavailable.
In existing works about IDEA, there are two problems worth considering. Namely, most IDEA models were formulated by different constraint sets to measure the interval efficiency of each DMU with uncertain data [13, 17]. This implies that each DMU is evaluated in a different production possibility set. Therefore, different production facets are used in the process of efficiency evaluation. This is unfavorable for comparing efficiencies. In uncertain environments, production possibility sets would vary across the inputs and outputs of imprecise data. The selection of the production facets would affect efficiency assessment. Moreover, to the best of our knowledge, cross-efficiency DEA has seldom been introduced in uncertain environments. In order to obtain an overall evaluation of DMUs, the assessments should combine self-evaluation and peer evaluation. However, the original models of cross-efficiency evaluation with crisp data cannot be directly used to handle interval data. Uncertainty theory is required in order to establish a reasonable model and ranking scheme. For example, Muren et al. [20] considered imprecise and vague information by typical fuzzy logic to retain the information about uncertainty. Chen et al. [17] ranked the DMUs by minimizing the maximum regret value.
In this paper, inputs and outputs of DMUs with interval data will be considered. Two parameters are introduced to represent the different production sets, and the DMUs achieving Pareto optimal according to these production sets contribute to the production facets. It provides variable reference facets for decision makers. Moreover, the inclusion relation among them will be analyzed. The inclusion relation between different extended production facets provides a theoretical basis for self-evaluation with the optimal production facet, which was proposed by Wang et al. [14] and Azizi et al. [19]. The cross-efficiency evaluation models with interval data will be formulated based on possibility degree theory. The relation between different evaluations, which is described by the upper and lower bounds of the interval efficiencies with the definition of likelihood, makes the cross-efficiency models more reasonable, simple and easy to be used than the models in [17]. The operation (max) of two intervals is introduced to ensure that the results are consistent with conventional results by crisp data. Then, the likelihood method is utilized to aggregate the cross-efficiency matrix. The results of the aggregation will be ranked by that method.
This paper is organized as follows: In Section 2, definitions and operations of interval numbers are introduced. In Section 3, the production possibility sets and production facets are defined. In Section 4, the cross-evaluation models with interval data under the maximum production possibility set are formulated. In Section 5, the cross-evaluation matrixes are aggregated and the DMUs are ranked. Section 6 contains numerical illustrations. Section 7 concludes the paper.
Preliminaries on interval data
Let
Moreover,
To compare two interval numbers, we introduce the likelihood to express the priority between them.
It is obvious that
Some useful properties of the likelihood are as follows [24, 26]: Let
Properties (a) and (b) indicate that the two interval numbers do not overlap. Property (c) shows the relationship between the likelihood and the midpoint of the interval number. Property (d) is the transitivity of likelihood.
Mn×m (I (R)) denotes the set of all n × m matrices over I (R). In particular, let M
n
(I (R)) = Mn×n (I (R)). For A ∈ M
n
(I (R)),
It is assumed that there is a set of n DMUs with m inputs X
ij
(i = 1, ⋯ , m) and s outputs Y
rj
(r = 1, ⋯ , s). Due to the uncertainty, it is assumed that the data of the inputs and outputs are interval numbers. Hence, X
ij
and Y
ij
can be replaced with
The representative points of DMUs constitute a reference set
S (γ, δ) generates the convex cone C (S (γ, δ)), which can be expressed as follows:
Obviously, if (X, Y) ∈ C (S (γ, δ)), then ω0TX - μ0TY ≥ 0.
As is known, in DEA models, the production facet is used to reflect the optimal production status of a decision-making unit. In traditional DEA with crisp data, it is obvious that the boundary points of a production possibility set are defined as the efficient production facet, since the DMUs on the boundary of the production possibility set achieve the Pareto Optimal. However, the effective production facets of S (γ, δ) are not the optimal production state of all DMUs. In general, it is deemed necessary to determine the efficient DMU by considering the efficiency of its optimal point. Furthermore, the production reference sets are uncertain when the input and output values of DMUs are interval numbers. Consequently, it is necessary to consider whether the optimal point of DMU0 achieves the Pareto Optimal with respect to the different production reference sets.
The linear programming problems based on the S (γ, δ) are now constructed as follows:
In model (4), the point (X0 (0) , Y0 (1)) is not always included in S (γ, δ), and the constraint conditions ω
T
X
j
(γ) - μ
T
Y
j
(δ) ≥0 are only for S (γ, δ). Thus, the optimal value of model (4) will be larger than one. If
As C (S (γ, δ)) is finitely multidimensional, it can form the data envelopment of S (γ, δ). Then, the production possibility set according to S (γ, δ) is defined as follows:
Therefore, (X, Y) ∈ T (γ1, δ1), that is, T (γ1, δ1) ⊇ T (γ2, δ2). This completes the proof.
Since (X0 (0) , Y0 (1)) is not in T (γ, δ), T/ (γ, δ) can be represented as follows:
In particular, if γ = 0, δ = 1, then T (0, 1) = T/ (0, 1).
The multi-objective programming problem
Furthermore,
It is now proved that if (X, Y) < (X0 (0) , Y0 (1)), then 0 ≤ λ0 < 1. It is assumed that λ0 ≥ 1. Since
(2) As mentioned earlier, when γ = 0, δ = 1, the problem is consistent with the traditional CCR model, and the proof may be found in [28–30]. This completes the proof.
The case γ ≠ 0, δ ≠ 1 is first considered. By the proof of Theorem 4, it follows that if the optimal points of DMUs are on the positive side of the production facet, then the optimal solution of model (4) is greater than 1 and achieves the Pareto Optimal of VP(γ, δ). E (γ, δ) denotes the extended production facet composed of the DMUs that achieve the Pareto Optimality. Thus, there are different extended productions facets corresponding to different production possibilities sets. In particular, if γ = 0, δ = 1, then the definition of production facet E (0, 1) is the same as in the CCR model.
It is noted that
By the proof of Theorem 3, this shows that (X0 (0) , Y0 (1)) is in E (γ2, δ2), that is, E (γ1, δ1) ⊆ E (γ2, δ2). This completes the proof.
By Corollary 2 and Theorem 3, it holds that the DMUs in E (0, 1) are also in E (γ, δ), for all 0 ≤ γ ≤ 1, 0 ≤ δ ≤ 1. E (0, 1) represents the optimal configuration of DMUs in the current optimal production state. Thus, E (0, 1) is defined to be the best production facet.
According to Wang et al. [14], the efficiencies of DMUs are not comparable in different production possibility sets, as different production facets are adopted in the assessment. According to the conclusion of Section 3, T (0, 1) is the maximum production possibility sets, and E (0, 1) is the best production facet with T (0, 1). That is, all points of the DMUs will be included in T (0, 1), and all DMUs in E (0, 1) achieve Pareto Optimality under any condition. Therefore, for the unified and comprehensive evaluation, interval DEA models with the same production possibility set will be developed.
Self-evaluation models
Let
We formulate a pair of linear programming (LP) models to measure the interval efficiency of DMUd under T (0, 1) as follows:
The optimal values of models (10) and (11) are denoted by
In conventional peer-evaluation, each DMU always evaluates other DMUs by ensuring its best relative efficiency. The cross-efficiency of a given DMUj, j = 1, ⋯ , n, obtained with the weights of DMUd, is as follows:
However, an adjustment is required for DMUs with interval data. In that case, the best relative efficiency is the interval efficiency, unlike in the case of DMUs with crisp data. Therefore, the cross efficiency of the given DMU should be interval data. Let
In the existing cross-efficiency model with crisp data, the peer-evaluation of a DMU is equal to its self-evaluation. Theorem 5 will show the consistency between the interval data and the crisp data.
It is obvious that
Similarly,
That is,
By Theorem 6, the position relationship of Position relationship of 
Then
Obviously, the above equations show that if
Theorem 7 may be used to explain the value of the likelihood
By the definition of cross-efficiency, secondary goal models must be developed to solve the non-unique optimal weights problem in DEA self-efficiency assessment. Since Doyle and Green [3] proposed secondary goal models (benevolent and aggressive), these models have been extended by various methods [4–11]. Liang et al. [5], Wang [6], and Wu et al. [7] proposed target efficiency and compared it with cross-efficiency, and the maximization or minimization of the gap between them was used to set up the secondary goal programming. Even though the quadratic programming models mentioned above were used for crisp data, the ideas of these models can be used to handle interval data as well. In this paper, the CCR interval efficiencies of the DMUs are set as their target efficiencies. According to Theorem 7, the likelihood
The constraint conditions
To a certain extent, the results of models (14) and (15) can be used to indicate the prior relationship between cross efficiency and CCR interval efficiency. Based on the prior relationship, a model will be formulated for aggregating the cross-efficiency matrix.
As a result of the above models, there are n cross-efficiency scores for each DMU, which form cross-efficiency matrixes. In traditional models, the average cross-efficiency score for each DMU as
Let
The optimal solution of model (17) is
By property (b) of the likelihood, τ
d
= 0 indicates that
Let w1, w2 ⋯ , w n be the aggregation weights computed by model (17). The cross-efficiency value for each DMU by aggregating can be expressed as follows:
By aggregating the cross-efficiency matrix, we obtain the efficiency vector
The steps of the cross-efficiency method based on likelihood are as follows.
In this section, the feasibility of the above methods is proved through the analysis of a numerical example from Kao and Liu [32]. The example is about 24 banks in Taiwan. The commercial banks could be regarded as financial intermediaries whose main business is to borrow funds from depositors and lend to others. There are three input indicators and three output indicators, as follows:
x1= Total deposit;
x2= Interest expense;
x3= Non-interest expense;
y1= Total loans;
y2= Interest income;
y3= Non-interest income.
The data set for the 24 banks is shown in Table 1.
Interval forecasts of financial data for the 24 commercial banks in Taiwan
Interval forecasts of financial data for the 24 commercial banks in Taiwan
In Table 1, “L” is the lower limit of the interval number, and “U” is the upper limit of the interval number.
The three inputs and three outputs are expressed in intervals rather than single values due to their uncertain nature. In order to illustrate the change of extended production facets, four different production possibility sets are chosen. The best relative efficiency of the optimal points of the 24 DMUs are shown in Table 2. The third, fifth, seventh, and ninth columns indicate whether the DMU attains the Pareto Optimal under the different production possibility sets. It follows that E (1, 0) consists of all DMUs, which implies that each DMU has reached the Pareto optimal by taking the worst points of all DMUs as a reference. It is clear that the results of this evaluation are meaningless. Similarly, it is noted that E (0.5, 0.5) is composed of DMU1, DMU2, DMU3, DMU4, DMU5, DMU6, DMU9, DMU10, DMU11, DMU12, DMU13, DMU14, DMU16, DMU17, DMU18, DMU19, DMU20, DMU22, and DMU24; E (0.1, 0.8) is composed of DMU1, DMU3, DMU4, DMU6, DMU9, DMU10, DMU12, DMU13, DMU14, DMU16, DMU17, DMU18, DMU19, DMU20, DMU22, and DMU24; E (0, 1) is composed of DMU3, DMU4, DMU6, DMU9, DMU10, DMU12, DMU13, DMU14, DMU16, DMU17, DMU19, DMU20, and DMU22. Therefore, E (1, 0) ⊇ E (0.5, 0.5) ⊇ E (0.1, 0.8) ⊇ E (0, 1), and E (0, 1) is the best production facet.
Relative efficiency under different production reference sets
In this paper, self-evaluation of DMU is based on the best production facet. Based on self-evaluation, we introduce benevolent and aggressive models to select the appropriate weights for evaluating other DMUs. The evaluation results of the 24 banks are presented in matrix form: the benevolent cross-evaluation matrix and aggressive cross-evaluation matrix. More weight will be distributed to the DMU whose peer evaluation is closer to the self-evaluation. Thus, the cross evaluation matrices are aggregated by model (17), and the results are shown in the second and the fourth columns of Table 3. Moreover, we compute the average score of the cross-evaluation matrix. The data in the third column is the average vector of the benevolent cross-evaluation matrix. The average vector of the aggressive cross evaluation matrix is shown in the fifth column of Table 3. As shown in Table 3, the results of aggregation and the average are not significantly different in the benevolent cross-evaluation matrix. However, the difference between the fourth and the fifth columns is greater. This is due to the fact that the DMU regards peers as competitors and may provide poor peer evaluation. For example, τ2 is 10.836 and τ19 is 1.172, which implies that the peer evaluation given by DMU2 is far lower than self-evaluation, whereas DMU19 provides the peer evaluation that is closest to self-evaluation. Hence, DMU19 should have the largest aggregation weight, in contrast with DMU2, whose cross-efficiencies should be given the lowest aggregation weight.
Results of different aggregates of cross-efficiency matrices
After effective aggregation, the cross-efficiency matrix is transformed into vector form. Notably, the self-evaluation efficiency vector is measured by the maximum production possibility set. Table 4 shows the various efficiency vectors and sorting of the 24 banks by likelihood ranking methods. As is known, when the input and output of DMUs is real data, the results of the cross-efficiency models will satisfy the following properties: (I) Self-efficiency is higher than cross-efficiency for every DMU; (II) the diagonal elements of the cross-efficiency matrix are the n self-efficiencies. These two properties reflect that self-evaluation efficiency is higher than peer evaluation, and the results of self-evaluation remain unchanged in peer evaluation. These are the basic principles of peer evaluation. Therefore, the cross-efficiency values are best to satisfy these two properties by interval numbers. In this paper, Theorem 6 shows that the upper and lower bounds of interval efficiency of each DMU are smaller than the values obtained by the self-evaluation model. This conclusion is consistent with property (I). Moreover, Theorem 5 shows that the diagonal elements of both the benevolent and aggressive cross-efficiency matrices are the self-efficiency vector. This conclusion is in accordance with property (II). Therefore, the cross-efficiency matrix of both the aggressive and benevolent models satisfy the two properties. Furthermore, it follows from Table 4 that the upper and lower limits of interval cross-efficiencies of DMUs computed by benevolent models are larger than those obtained from aggressive models. This shows that the benevolent (resp., aggressive) model is appropriate for maximizing (resp., minimizing) the interval cross efficiency of the other DMUs. There is large difference in ranking orders between the benevolent and aggressive models. For example, even though little difference is found in the interval efficiencies of DMU13 between the benevolent and aggressive models, DMU13 is ranked 14th by the aggressive model, whereas it is ranked 21st by the benevolent model. This shows that different evaluation strategies will result in different ranking orders. Therefore, the decision-makers have more flexibility in choosing their models according to their decision preferences.
Comparison of efficiency in different models
There is a connection between the α-level based approach of fuzzy DEA category and interval DEA category since α-level based approach generally converts the fuzzy DEA model into a pair of parametric models for calculating the lower and upper bounds of the efficiency at a given α-lever, that is to say, interval DEA models can be a special case of the α-level based approach of fuzzy DEA models for a given α-level. So the results obtained from the method in this paper are compared with the results obtained by Chen’s method [17]. The conclusions are as follows: In this paper, the models are proposed from both aggressive and benevolent perspectives. Compared with Chen’s model, our evaluation methods are more comprehensive. For example, as shown in Table 4, DMU13, DMU14, DMU18, and DMU20, are differently ranked by the aggressive and benevolent models. On the contrary, the results in Chen’s model do not reflect this difference. From Table 4, it follows that the interval cross efficiencies calculated by Chen’s model do not fully satisfy property (I). For instance, the lower bounds of interval efficiencies of DMU1, DMU2, DMU3, DMU4, DMU10, DMU13, DMU16, DMU18, DMU19, DMU22, and DMU24 are higher than the corresponding results of the self-evaluation. Moreover, the diagonal elements of the cross-efficiency matrix calculated by Chen’s model do not correspond to the self-efficiency vector. This implies that the results of Chen’s model do not conform with property (II). On the contrary, in this paper, the interval cross efficiency has been redefined and the models have been constructed by considering properties (I) and (II). This ensures that the models do not violate the basic principles of peer evaluation.
From the above it follows that the maximum production possibility set contains all points of DMUs, and the corresponding production facet contains the least effective DMUs. Therefore, it is more fair and reasonable to evaluate the self-efficiencies of DMUs under the maximum production possibility set. The aggressive and benevolent cross-efficiency models, which are set up by making the self-evaluation efficiencies as targets, can be consistent with the properties of the traditional cross efficiency model. Moreover, they can effectively discriminate the DMUs and evaluate the DMUs from different perspectives.
Many DEA models have been developed to evaluate the performance of DMUs with interval inputs and outputs. In general, when the input and output of DMUs are interval data, the production conditions are variable, that is, the production possibility sets are variable. In the conventional setting with the production possibility set in a fixed manner, the models are unable to reflect the efficiency changes and characterize the facets of DMUs. When the production possibility set is no longer well-defined, the results are paradoxical and self-evaluation and peer evaluation of DMUs are not well conducted. Moreover, the conventional models for interval data have only been proposed to evaluate DMUs by themselves, and seldom consider peer evaluations.
In this paper, the theoretical framework for interval DEA models is enriched as follows: (1) The input and output parameters are introduced to characterize the variability of production possibility set, and extended production facets are determined according to the different sets. Furthermore, the relationship of the different facets is discussed. It follows that as the number of points in the production possibility set increases, fewer effective units are evaluated. Therefore, it is more reasonable to evaluate the decision units in such a facet according to the production possibility set with maximum number of points. Thus, the interval self-efficiencies of DMUs are calculated under the maximum production possible set. (2) Aggressive and benevolent cross-efficiency models are established by making the self-evaluation efficiencies as targets. According to the interval efficiency value, the likelihood is used to reflect the difference between the self-evaluation and the peer evaluation. The operation (max) of two intervals is introduced to ensure that the results are consistent with conventional results by crisp data. The pairs of cross-efficiency models constructed from both maximization and minimization of the difference values between the self-evaluation and the peer evaluation can provide more comprehensive and reasonable evaluation for decision makers. Moreover, the weight allocation method, which is more advantageous to DMU, is used to aggregate the interval cross-efficiency matrices.
An example is used to illustrate the models. The results not only explain the effect of the parameter selection on the extended production facets, but also demonstrate the applicability of our cross-efficiency models, which provides more choices for the decision-makers. Therefore, the benevolent and aggressive models with interval data in this paper can be seen as improvements and extensions of the traditional cross-efficiency models with real data. This makes them meaningful contributions to uncertain DEA cross-efficiency evaluation.
Finally, interval efficiency provides a more comprehensive assessment of DMU than the traditional DEA efficiency. Therefore, it is expected to be more widely applied in even more diverse contexts. Notably, the variability of production possibility sets can be considered with the returns to scale. The model can also be extended to a stochastic model by considering that the distribution of the points in the interval is random. This is omitted due to space limitations.
Footnotes
Acknowledgments
This research is supported by National Natural Science Foundation of China (Nos. 71371053 and 71471125), Humanities and Social Science Foundation of the Ministry of Education (No. 14YJC630056) and Fujian Provincial Education Department (No. Jb14001).
