Abstract
Data Envelopment Analysis (DEA) is the most popular mathematical approach to assess efficiency of decision-making units (DMUs). In complex organizations, DMUs face a heterogeneous condition regarding environmental factors which affect their efficiencies. When there are a large number of objects, non-homogeneity of DMUs significantly influences their efficiency scores that leads to unfair ranking of DMUs. The aim of this study is to deal with non-homogeneous DMUs by implementing a clustering technique for further efficiency analysis. This paper proposes a common set of weights (CSW) model with ideal point method to develop an identical weight vector for all DMUs. This study proposes a framework to measuring efficiency of complex organizations, such as banks, that have several operational styles or various objectives. The proposed framework helps managers and decision makers (1) to identify environmental components influencing the efficiency of DMUs, (2) to use a fuzzy equivalence relation approach proposed here to cluster the DMUs to homogenized groups, (3) to produce a common set of weights (CSWs) for all DMUs with the model developed here that considers fuzzy data within each cluster, and finally (4) to calculate the efficiency score and overall ranking of DMUs within each cluster.
Introduction
Banking system is the pillar of the economy in most countries and has a significant role in developing the economic and financial systems by mobilizing small deposits of people in the household sector and directing them towards productive uses in the industrial sectors. For example, growth and development of the banking industry and financial markets pave the way for many organizations to be more innovative by receiving sufficient finance from the banking sector and improving the operational efficiency [1]. Therefore, evaluating the performance of the banking system helps to achieve higher economic growth [2]. There is wide ranging research available that has looked at the efficiency of the commercial banking industry [3, 4]. Banks need to boost their efficiency to remain competitive. Analyzing efficiency stimulates the banks to move forward by looking at the best performing branches [5]. A citation-based systematic literature review on banking sector performance by Ahmad et al. [6] demonstrates that studies in this realm widely have employed stochastic frontier analysis (SFA) as a parametric approach, whereas data envelopment analysis (DEA) is prevalent in nonparametric approach to measure the banking sector performance and efficiency. For example, recent studies such as Tavana et al. [7], Barros and Wanke [5] and Kiani Mavi et al. [8] have focused on the efficiency analysis using DEA models.
There are basic assumptions in DEA such as measuring inputs and outputs with real values and definite factors, while in practice the values of inputs and outputs might include missing data, judgmental data, forecasted data, or generally imprecise data [9]. Attributing a crisp value for a judgmental variable is difficult because of ambiguity, limitations, and complexity of them. To deal with this shortcoming, fuzzy DEA models have been developed in which data of inputs/ outputs are expressed by fuzzy linguistic variables [10]. Hatami-Marbini et al. [11] presented a taxonomy and review of the fuzzy DEA methods and presented a classification scheme in this regard.
On the other hand, conventional models of DEA split decision-making units (DMUs) into two categories of efficient DMUs with the efficiency score of 1 and inefficient DMUs with the efficiency score of a value between [0,1). Thus, conventional models do not distinct efficient units and cannot rank them. Several methods including common set of weights (CSW) have been developed to completely rank the efficient DMUs in DEA. The flexibility in determining the weights by each DMU to maximize its efficiency score makes it difficult or even sometimes impossible to rank all DMUs based on the efficiency scores especially when many DMUs obtain the efficiency score of one or very close to one which leads to low discrimination among DMUs 12]. Thus, CSW procedure diminishes the disbandment of the optimal weights allocated to the input and output variables by each DMU. The CSW approach essentially obtains a common compromised set of weights in lieu of different sets of weights for assessing the efficiency of DMUs, so evaluates the efficiency of DMUs with a common set of weights. Therefore, DMUs are not allowed to allocate different weights to their inputs and outputs. Common weight models make it possible to compare and rank all DMUs with the same optimal weights [13, 14].
Conventional DEA models accept that all DMUs are homogenous and operate in a similar operating environment [8]. However, in real-world situations, different environmental variables should be considered in the assessment process. While environmental variables challenge homogeneity of DMUs, DEA studies have paid relatively less attention to them. Athanassopoulos [15] points out that while bank branches are similar in the operations, they meet different environments in the market. Therefore, not considering the factors outside the control of branch manager will reduce their efficiency score in the performance evaluation plan. So, factors that make branches different should be taken into account and similar branches should be clustered to achieve homogeneous sets. Then, performance evaluation of DMUs should take place to the homogeneous set of DMUs. In addition, to deal with the high number of decision makers and performance criteria which cause higher complexity and vagueness of decision-making process, clustering of DMUs is essential [16]. Overall, clustering and DEA are integrated to each other via two different approaches. First, the results of clustering are integrated to the classification resulted from DEA to create several reference subsets from the initial reference set of DMUs. Second, the efficiency score of a DMU is not defined by the peer group, but it is defined by an efficient subset of the same peer subset. Subsequently, this method reaches multiple homogeneous subsets by measuring similarity among units, thus, every DMU is compared with its peers in the given subset [17].
Considering the above-mentioned issues, this study contributes to the existent literature by providing a new mathematical model to resolve them. Therefore, major contributions of this paper are as follows: This paper proposes a novel common weight model based on ideal point method (CSW-IP) under the fuzzy environment to overcome the problems of ranking efficient DMUs and fuzzy data altogether. The proposed approach can easily define the best performing DMU while traditional DEA models are not able to do so. This study identifies the environmental factors that influence the efficiency of bank branches. Then, the efficiency of bank branches in each cluster is measured utilizing the proposed model to provide more realistic results.
The remainder of this paper is organized as follows. Section 2 reviews the literature of DEA and clustering method. Section 3 presents the proposed CSW-F-IP model. Section 4 illustrates a real case to show the applicability of the proposed model. Section 5 discusses the advantages of the proposed approach. Finally, conclusion and direction for future studies are drawn in Section 6.
Literature review
Non-homogeneity among DMUs in efficiency analysis
Data envelopment analysis (DEA) is a technique to evaluate the relative efficiency of a set of homogeneous decision-making units (DMUs). Homogeneity means that all DMUs utilize the same inputs to generate the same outputs in different amounts from one DMU to another [18]. For example, manufacturing plants in the same industry as DMUs might not all utilize similar inputs to produce the same products. Fair comparison of DMUs in terms of efficiency analysis in the absence of homogeneity is an issue for real world situations [19]. By homogeneity, all DMUs must obey three conditions: first, DMUs should execute the same processes; second, their efficiency should be evaluated by the same input and output variables means that DEA model applies equally to all DMUs; third, all DMUs operate within the same environment under the same conditions [20]. However, in real world practice, these hypothetical assumptions are not always applied. Undoubtedly, to compare performance of two bank branches in different cities, decision makers will face environmental heterogeneity that should be considered as an environmental variable, which reveals economic, societal, political, environmental, legal, and cultural differences among different environments. For example, Dyson et al. [21] and Athanassopoulos [15] have emphasized on the environmental variables especially at service sectors in which identification, definition, and evaluation of these variables are far more difficult. Heterogenous preferences in group decision making problems may also stem from decision makers. Chen et al. [22] reviewed three types of fusion approaches addressed the integrating and aggregating the heterogenous preference structures. Despite successfully identification of heterogenous variables in terms of environmental differences among DMUs, accuracy of the inputs and outputs and efficiency scores is not guaranteed. This issue is more important when decision makers should express their opinion on variables, which adds a new heterogeneity to the model [22].
A variety of DEA-based techniques have been developed to handle different environmental conditions. Haas and Murphy [23] mentioned two approaches to manage non-homogeneity among DMUs. The first approach is to partition DMUs into homogeneous clusters. The second approach is to accommodate for non-homogeneity. Saen et al. [24] used analytic hierarchy process (AHP) chance-constrained DEA for calculating the relative efficiency of slightly non-homogeneous DMUs. Cook et al. [25] proposed hierarchical DEA to handle the non-homogeneity of DMUs by measuring efficiency at multiple levels. They delivered a technique for modifying the ranking of DMUs at one level to account for the rankings obtained by the groups (which these DMUs belong to) at a higher level. It should be noted that they classified non-homogeneous DMUs into different groups and evaluated them in different models. A two-step process is commonly used to manage the non-homogeneity issue in DEA [26, 23]. DEA efficiency values are obtained in the first step, then in the second step, other sources of non-homogeneity that were not considered in the original modeling are taken into account to regress the DEA efficiency scores. When environmental factors cause non-homogeneity, they are considered in a single model as non-discretionary inputs. Therefore, different reference sets are defined to discriminate DMUs in different environments [28]. In DEA, the approaches used to deal with non-homogeneity, are classified as the (i) frontier separation, (ii) all in-one, (iii) two-stage, (iv) environmental harshness index approaches [29]. Although DEA models with non-discretionary factors such as distinguishing the internal and externa non-discretionary variables [30] have been developed, there is no generally accepted approach for reflecting non-discretionary variables in DEA models. Therefore, this study classifies DMUs based on non-discretionary variables then evaluate them with DEA.
Clustering
Clustering is a subset of multivariate statistical techniques that finds hidden patterns in a set of data. It divides a set of data (operating units) into different subsets during the process of grouping them according to pre-defined criteria so that similar objects are grouped in a distinct cluster. Clustering is usually done by maximizing the similarity of objects within each cluster [31, 32].
Generally, clustering methods are divided into the following groups: hierarchical clustering, mixture-model clustering, learning network clustering, objective-function-based clustering, partition clustering, distance-based clustering, spectral clustering, kernel-based clustering, and graph-based clustering [17, 32]. Distance-based algorithms are the most common algorithms that minimize the overall dissimilarity of objects. They make an initial cluster and then use a repeating movement technique to maximize the overall similarity of objects within clusters or minimize the overall dissimilarity by transferring the objects from one cluster to the other cluster. The most common methods used for this type of clustering includes K-means, fuzzy C-means (FCM), Possibilistic C-means (PCM) and fuzzy equivalence relation (FEC) [17, 32] which C-means algorithm is the most practical approach. In this algorithm, the number of clusters should be pre-identified and the results of classification significantly rely on the selection of the threshold values. The issue for decision makers is the difficulty of judgment about the classification when they have limited information. Identifying the number of intended clusters in FCM method is difficult. To decrease the complication of calculations, fuzzy clustering method is more appropriate when uses equivalence relation for dividing DMUs of a large set into multiple smaller sets. Due the existence of ambiguity and limitations of data or complexity of decision-making factors, doing relative preferences with crisp numerical values might lead to inaccurate results. In contrast, modelling linguistic variables with fuzzy sets can flexibly express the relative preferences of decision makers. Therefore, clustering with fuzzy equivalence relation (FEC) has been considered as a method which provides more appropriate classification results, and also solves the problem of clusters’ overlapping by applying max-min transitive closure approach [28]. Samoilenko and Osei-Bryson [23] and Paradi et al. [34] integrated DEA with clustering including.
Fuzzy equivalence relation (FEC) algorithm
Fuzzy equivalence relation method supposes that a system comprises a group of criteria C = (C1, C2, …, Cm), and especial mutual relations are identified to modelling with mathematical relations. We consider a group of experts (executives or managers) who have a substantial role in establishing strategy map for an organization and are knowledgeable about criteria, alternatives, and their influence on the goal, in order to evaluate alternatives against criteria. After gathering experts’ evaluations, all matrices are aggregated as an average matrix. Therefore, the solution procedure of Wang [33] is extended with this point of view:
Step 1. Determine decision objective. Identify the relevant criteria to evaluate the pros and cons of alternatives to ensure the goal is achieved. A group of experts, K, should be formed to evaluate alternatives against the criteria to achieve the goal.
Step 2. Generate the initial direct influence matrix. Measuring the influence of each alternative against the criteria requires designing a comparison scale, for example, seven levels of influence as 1 (very low), 2 (low), 3 (between low and medium), 4 (medium), 5 (between medium and high), 6 (high), and 7 (very high) influence. An initial direct influence matrix A is a matrix constructed by the pairwise comparisons done by the expert k, in which a ji is the extent to which the alternative j is evaluated against the criterion (variable) i in order to achieve the goal, i.e. A k = [a ji ] n×m.
Step 3. Convert the linguistic terms into fuzzy linguistic scale and transform them into fuzzy numbers as shown in Table 1.
The linguistic terms and corresponding fuzzy numbers
The linguistic terms and corresponding fuzzy numbers
Step 4. Completing step 3 leads to K fuzzy initial direct influence matrices shown by
Here, a preliminary of mathematics of fuzzy numbers is presented [33]:
Definition 1. A triangular fuzzy number (TFN) is described as a triplet (a
l
, am, a
r
). If à is a TFN, its membership function is defined as (2).
Assume à and
Step 5. Let
Step 6. Considering the data matrix D, the fuzzy binary relation between any two fuzzy sequences is defined as follows:
Assume
Step 7. Binary relation between fuzzy sequences
where
and
Because R meets reflexive and symmetric conditions, i.e. μ R (i, j) = 1 and μ R (i, j) = μ R (j, i), respectively, therefore, the fuzzy binary relation R will be fuzzy compatible relation.
Definition 2. In the n × n fuzzy relation matrix = [r
α
(i, j)] n×n, 0 ⩽ r
α
(i, j) ⩽ 1 represents the relationship between two sequences i and j, and 0 ⩽ i, j ⩽ n. Matrix A is reflexive, if r
α
(i, j) = 1 for 0 ⩽ i ⩽ n, and it is symmetric, if r
α
(i, j) = r
α
(j, i). The matrix A is named max-min transitive, if
Definition 3. When a fuzzy relation matrix satisfies both reflexive and symmetric conditions, then it will be a fuzzy compatible matrix.
Definition 4. A fuzzy equivalence matrix is a fuzzy compatible matrix which meets max-min transitive.
Step 8. R will be reflexive and symmetric but might not be transitive. Therefore, R is just a fuzzy compatible relation, not a fuzzy equivalence relation. When fuzzy compatible relation is used to clustering, fuzzy data sequences might have some overlaps. To remove the overlap, the compatible relation must be converted into a fuzzy equivalence relation by an approach such as max–min transitive closure. the max–min transitive closure R* of R is presented as (12-13):
and
Integrating above-mentioned relations, leads to the revised R* and R n :
μ
R
n
(i, j) ⩾ μ
R
*
(i, j); so that, R* ⊆ R
n
derived by following definitions if
Definition 5. An n × n fuzzy relation matrix B = [r β (i, j)] n×n contains the matrix A, indicated as A ⊆ B, where r α (i, j) ⩽ r β (i, j) for all i, j.
Definition 6. When a fuzzy equivalence matrix B contains a fuzzy compatible matrix A and is included by any fuzzy equivalence matrix that contains A, then it is called the equivalence closure.
Step 9. In addition, R*, based on R and meeting transitive condition, is a fuzzy equivalence relation. With R* and a threshold value, λ, linguistic data sequences are clustered into clusters. Therefore, after obtaining R*, the appropriate threshold value λ should be identified. Furthermore, R n can replace for R* in clustering.
Definition 7. Suppose that
According to definition 7, R* ={ (i, j) |μ
R
*
(i, j) ⩾ λ } and 0 ⩽ λ ⩽ 1. By
If μ R * (i, j) ⩾ λ, then ith sequence and jth sequence belong to a same cluster; otherwise, the two sequences will be members of two separate clusters.
Because R* is a fuzzy equivalence relation, then
Definition 8. A cluster of S shows a group of separate subsets, like {S1, S2, … , S n }, in a way that the union of these subsets comprises the entire set S. In other words, S1 ∪ S2 ∪ … ∪ S n = S and S i ∪ S j = ∅ , ∀ i ≠ j.
Definition 9. Assume that P R λ is a cluster of S resulted from an equivalence relation R λ . Define the elements x, y ∈ S to be in a subset S i of S as (x, y) ∈ R λ , and S i ∈ P R λ .
Step 10. According to the validation index of partition
where
Partitioning data results in inter-cluster (between clusters) and intra-cluster (within cluster) relations. Normally, a specific clustering is preferred to the other one when it has a low inter-cluster relation and a high intra-cluster relation [33]. It is clear that when the number of data rises, the number of clusters increases which brings more inter-cluster and intra-cluster relations. Validation index considers these two relations and a higher validation index shows a better partitioning which means a higher intra-cluster relation between objects and a lower inter-cluster relation between objects. There is no upper bound for validation indices and they rise by the number of objects.
Data Envelopment Analysis (DEA) is a non-parametric fractional mathematical programming approach which divides the weighted sum of outputs by the weighted sum of inputs to measure the relative efficiency of a homogeneous group of decision-making units. Primarily, DEA is run to compare the relative efficiency of a group of homogeneous set of decision-making units and implemented in many contexts such as airports, hospitals, university departments, schools, industries and sectors, banks, products and services, and technologies [36, 8].
The CCR model
Consider a group of n DMUs where each DMU
j
, (j = 1, … . . , n) generates s different outputs utilizing m different inputs that are denoted by y
rj
, (r = 1, …, s) and x
ij
, (i = 1, … . . , m) respectively. We assume x
ij
and y
rj
are all positive [37]. For every DMU
j
under evaluation, the efficiency score E
j
can be calculated by the CCR input oriented multiplier model (16):
Subject to: (16)
j= Number of decision-making units (DMUs)
i= Number of inputs used by the DMUs
r= Number of outputs generated by the DMUs
DMU j =jth decision-making unit
E j = Efficiency score of DMU j
y rj = Amount of output r generated by DMU j
x ij = Amount of input i used by DMU j
u r = Weight assigned to the output r
v i = Weight assigned to the input i
The conventional DEA models such as CCR, need exact (crisp) values for inputs and outputs; however, crisp data might not always be attainable in real-world situations. Sometimes in real settings, inputs and outputs contain missing data, judgmental data, or predictive data or imprecise and vague data. One way to manage uncertain input and output data is to use fuzzy numbers to extend DEA models [38–40].
As CCR model calculates different set of weights for each DMU to maximize its relative efficiency, thus ranking efficient DMUs is not possible with this model. Scholars solved this problem using various methods [41] and it is noteworthy that one of the most popular techniques for assessing and ranking efficient DMUs is common set of weights (CSW) technique [42]. Different common weight models are developed including maximizing the efficiency score of the set of efficient DMUs using CWA-methodology [41], utilizing an auxiliary vector to transform the multi-objective fractional linear programming (MOFP) [43], regression analysis and nonlinear regression models [14], considering ideal and anti-ideal DMUs ([7, 44]), goal programming (GP) [45–47], compromise programming [48], and displaced ideal methodology [49].
Ideal point method
Definition 10. The (virtual) ideal DMU is a DMU that its inputs are at the minimum level and its outputs are at the maximum level among all of the DMUs. The ideal DMU is shown with
Subject to: (17)
From among different types of fuzzy numbers, triangular fuzzy numbers (TFNs) are more popular. In the sequel, the values of inputs and outputs of DMUs will be triangular fuzzy numbers.
In the simplest situation where there are one input and one output, the efficiency frontier of fuzzy CCR model will be as Fig. 1. In Fig. 1a, the DMU consumes fuzzy input

Efficiency frontier of fuzzy CCR.
In Fig. 2, the vertical axis shows the weighted sum of s outputs and the horizontal axis depicts the weighted sum of m inputs. Line “

Gap analysis for DMUs below the virtual ideal DMU.
The objective is to calculate an optimal set of weights
Therefore, the common weight model using ideal point method with fuzzy inputs and outputs (CSW-F-IP) is proposed as (18):
Subject to: (18)
Where, ‘
Subject to: (19)
Model (19) is a fuzzy multiplier linear programming and cannot be solved by standard linear programming solvers because of the fuzzy numbers. Four categories of approaches to solve fuzzy DEA are (1) the tolerance technique, (2) the α-cut technique, (3) the fuzzy ranking technique, and (4) the possibility technique. The α-cut technique is more commonly used to solve fuzzy DEA model [40, 51]. While more linear programs should be solved choosing the α-cut approach, we selected this technique because of two main reasons: (1) this approach allows modelling crisp values of inputs and outputs as triangular fuzzy numbers if any of those data are found to be crisp, and (2) owing to the extension principle, this approach transforms the fuzzy DEA model into a crisp DEA model which is easily solved as a linear program [52].
Thus, this method is applied to solve (19). First, introduce α-cuts of the objective function and constraints in (19). Then Model (19) transforms into (20):
Subject to:
All the coefficients in (20) are intervals which imply that Model (20) is a distant problem. The following steps are taken to solve (20):
Step 1. Consider the variables
Step 2. Substituting these variables in (20) then:
Subject to: (21)
Model (21) is a crisp nonlinear programming problem where α ∈ (0, 1] is a parameter and the optimal solution for different values of α can be found.
Definition 11. A DMU
j
is efficient at given α ∈ (0, 1] if
Nonlinear programming (21) is converted into linear programming problem (LP).
Step 3. Substitute the variables
Subject to: (22)
Model (22) is a crisp linear programming problem where α ∈ (0, 1] is a parameter and we find an optimal solution for each value of α. The variable transformations in step (3),
Since the available real-world data of inputs and outputs are uncertain in most cases, therefore, the proposed common set of weights approach is more appropriate in such situations. Parameter α can adopt different values between [0,1]. When the value of α is smaller, the support of the membership function becomes wider. The wider the support of the membership function, the higher the uncertainty. That is, α = 1 shows the most certain efficiency measurement while α = 0 shows the most uncertain efficiency measurement. On the other hand, efficiency values obtained at α = 1 would equal the efficiency score of DMUs when their inputs and outputs are crisp values. From risk management perspective, risk averse decision makers prefer higher values of α to have the higher certainty while risk takers prefer lower values of α.
Data and variables illustration
To follow proposed method for clustering and evaluating efficiency using DEA, several environmental criteria have been considered. The identification and classification of these criteria have been conducted based on both previous research such as Yang [53], Shyu and Chiang [54], Paradi et al. [55], Paradi and Zhu [56], Frederick [57], Katircioglu et al. [58], Al Shaher et al. [59] and experts’ opinions. Effective environmental criteria are shown in Table 2.
Effective environmental criteria on bank branch efficiency
Effective environmental criteria on bank branch efficiency
*Note: BTW: Between low and medium; BMH: Between medium and high; BAH: Between average and high; BAG: Between average and good; BAL: Between average and large; BBA: Between bad and average; BSA: Between small and average; BLM: Between low and medium; BFA: Between few and average; M: Medium; A: Average; VL: Very low; VH: Very high; VB: Very bad; S: Small; VS: Very small; L: Large; VG: Very good; LV: Very large; B: Bad; G: Good; LO: Low; MA: Many; SM: So many; F: Few; H: VF: Very few; High; P: Public; N-P: None-public; E: Employee; B: Business; I: Industrial; G: Government; IN: International
Selecting inputs and outputs is a critical part of performance analysis, as it includes evaluating different aspects of bank branches performance. Certainly, since banks deliver several products and services, it is too difficult to consider the whole range of banking activities. The European banking industry considers the intermediation approach which acknowledges intermediation as the core activity. It means that banks do not produce loan and deposit services, instead, deposits are dealt with as inputs while, loans and investments are considered as outputs. Therefore, to get a clear understanding of the proposed methodology, this paper evaluates efficiency of 100 DMUs (Melli bank branches in the province of Isfahan) with three fuzzy inputs and two fuzzy outputs. The data of fuzzy inputs and outputs are positive TFNs as shown in Table 3.
Input and output variables for evaluating efficiency of bank branches
Clustering algorithm has been formulated in MS Excel 2016, and all DEA computation were done by MATLAB version R2016b. A group of five experts who are the senior managers in the Melli bank of province of Isfahan participated in this study. They evaluated 100 branches of Melli bank against the 31 criteria to cluster them. Table 4 shows the score of alternatives against the criteria which are attributed by the decision maker No.1. For example, alternative A1 has the score of “Very High” in C1. Table 5 shows the equivalent triangular fuzzy numbers of scores in Table 4. For example, the score of “Very High” for A1 against C1 in Table 4, is equivalent to (0.8,1,1) in Table 5.
The linguistic assessments of alternatives against criteria
The linguistic assessments of alternatives against criteria
Assessments of alternatives against criteria with TFNs
Table 6 reports details about the maximum distance between upper and lower bounds of qth fuzzy number of all sequences.
Distance between upper and lower boundaries of qth fuzzy number of all sequences
*Note: C: Criteria; V: Value.
Table 7 shows the initial relation matrix of all sequences and Table 8 shows the fuzzy equivalence relations of the 100 branches’ preferences against all criteria that are obtained by the max–min transitive closure.
Binary relation between X iq sequences
The max-min transitive closure R*
In Table 8, the entries can be partitioned into nine intervals. Therefore, the clustering of the 100 branches is shown in Table 9 according to the fuzzy equivalence matrix
The partitioning of the 100 branches on different intervals ofλ s
*Note: VI: Validation index.
Results show that nine sets of clusters have been obtained in which clustering set No.5 with highest validation index (6316.10) provides the best clustering. In this case, 100 branches of Melli bank are clustered in 5 distinct clusters which include 26, 14, 18, 24, and 18 DMUs respectively.
Now, each cluster contains homogenous DMUs which can be evaluated by the proposed CSW-F-IP model (22). Data of inputs and outputs for efficiency analysis are presented in Table 10.
Input and output variables (10 Rials)
Efficiency score of DMUs using the proposed common weight model (22) at different levels of α is shown in Table 11.
Efficiency values for 100 bank branches
The efficiency of branches in each cluster are measured with the developed CSW-F-IP model and all 100 branches are ranked at different levels of α. Results show that a branch with high efficiency may produce lower outputs than the branch with lower efficiency when receiving the same inputs or even in some cases more inputs. However, it is ranked higher because of the environmental factors that have made heterogeneous conditions for branches in different clusters.
After introducing the useful clustering methods, the fuzzy equivalence relation method is used due to the following reasons: the existence of uncertain data, the large number of DMUs, the lack of clarity in the number of clusters at the beginning of the work, and solving the problem of overlapping clusters. This method eventually leads to the clustering of decision-making units in homogeneous groups by obtaining more appropriate and accurate answers. Each cluster includes those DMUs that have homogenous operational and environmental conditions which is the fundamental assumption to apply DEA. The proposed approach pays the highest attention to homogenizing the decision-making units and so provides fairer results for the managers and policy makers to make relevant decisions. This research contributes to the theory of DEA by 1) overcoming the heterogeneity of DMUs in terms of operational and environmental conditions through clustering with fuzzy equivalence relation, 2) considering fuzzy data for inputs and outputs which makes the DEA model nonlinear, 3) obtaining more information about the efficiency of DMUs, and 4) developing common set of weights for complete ranking of DMUs. The ideal point method has some merits over other approaches of finding CSW such as feasibility of the proposed model, possibility of defining a DMU as the best decision-making unit, and also its simplicity in contrast to the other methods. This study faces two limitations. First limitation relates to the number of inputs and outputs. Although there are quite a lot of DMUs under consideration, but the number of DMUs in some clusters are very few while for better discrimination among DMUs we need at least three times of the sum of inputs and outputs. To handle this limitation, we selected the most common factors as the proxies of input and output variables. Second limitation relates to the number of participant experts and their thorough knowledge on the operating environment of bank branches. Although there are many senior managers who are professional in banking industry, not all of them have extensive knowledge of environmental conditions of the branches. To overcome this barrier, we found five experts with snowball sampling which is enough for multi criteria decision-making situations.
Conclusion and directions for future research
Conventional models of DEA accept that management of each DMU controls the inputs and outputs at his/her preference. Those models assume that all the DMUs are functioning under the same environmental situations. On the other hand, homogeneity of DMUs is a prerequisite for them. In reality, however, there might exist environmental inputs or outputs which DMU’s management is unable to control them. In many cases, it is essential to consider the environmental factors for performance evaluation. Those factors mainly relate to the operational environment of DMUs, which challenge the homogeneity presumption in DEA. In this study, we investigated the environmental conditions of bank branches to cluster them into homogenous decision-making units for a fairer efficiency analysis. To incorporate the uncertainty of human judgment in efficiency analysis, fuzzy equivalence relation approach is used to cluster DMUs in terms of environmental and operational conditions. On the other hand, to overcome the weight flexibility of conventional DEA models and reach a complete ranking of all DMU, we developed a fuzzy common set of weights model using ideal point method (CSW-F-IP) to measure the efficiency of DMUs in each cluster. So, this paper contributes to knowledge of performance measurement by (a) clustering DMUs into homogeneous groups in terms of their operational and environmental conditions, and (b) improving discrimination among DMUs using the proposed common weight analysis with fuzzy data. Future studies can use multi-criteria decision-making methods to select the most important environmental factors for clustering. Finally, we recommend researchers to develop common weight models with goal programming technique and fuzzy data for non-discretionary factors.
Footnotes
Acknowledgments
The authors would like to thank the editor and two reviewers for their insightful comments, as a result, this paper has been improved substantially.
