Abstract
Uncertain data envelopment analysis (DEA) model make an estimate of the efficiency of decision making unit (DMU) under data uncertainty. The current research on uncertain DEA model is only based on sectional data to calculate DMU’s static efficiency for the DMU’s set in the same period. From this article, we attempt to combine Malmquist productivity index and uncertain DEA model (the uncertain DEA-Malmquist productivity index model) to calculate the dynamic change of DMU’s efficiency over time. Additionally, the impact of technical factors and scale factors on DMU’s efficiency can be further explored and the Malmquist productivity index will be decomposed into pure technical efficiency change, scale efficiency change and technical change. Finally, the article uses the model to analyze the provincial environmental efficiency from 2014 to 2016 in China.
Introduction
Data envelopment analysis (DEA) is an effective method to calculate DMU’s relative efficiency. The DMU is an entity, such as a province or a department. Charnes et al. [1] proposed the first DEA model referred to CCR model in 1978. The CCR model can evaluate DMU’s comprehensive technical efficiency. Later, Banker et al. [2] presented another model (BCC model) to evaluate DMU’s scale efficiency. Here, each DMU is “self-evaluated" based on the DMU set, which is used to evaluate the situation of DMU in the same period. When evaluating DMU data set is panel data, the traditional DEA method is no longer applicable to its premise and needs to be improved. In addition, DEA has many applications. For example, Ganji et al. [3, 4] applied DEA to road safety evaluation.
Malmquist Productivity Index (MPI) is a significantly mainstream way of measuring productivity over time and it was first proposed by Malmquist [5] in 1953. Then, Caves et al. [6] first applied the Malmquist index to the measurement of productivity changes under the DEA in 1982. Later, Ray and Desli [7] put forward the decomposition mode of MPI, which is referred to as RD decomposition mode. The mode can decompose the index value to the technical change (TC) and technical efficiency change (TEC), among which the latter was further decomposed into pure technical efficiency change (PTEC) and scale efficiency change (SC). Since then, the combination of DEA and Malmquist productivity index (the DEA-Malmquist productivity index model) has been widely applied to reflect the dynamic efficiency changes of DMU. Such as Ganji and Rassafi [8] use the DEA-Malmquist productivity index to assess Iranian road safety in 2019, Singh et al. [9] applied DEA-Malmquist productivity index on Health Care System Efficiency in 2021, Bansal et al. [10] applied the network DEA and Malmquist productivity index to banking data in 2022, and so on.
However, the above mentioned studies on traditional MPI require precise input-output data. However, some input or output data are imprecise in reality, such as carbon dioxide emissions. At this time, the traditional MPI is not suitable to measure the dynamic efficiency change of DMU when the evaluation data is imprecise. For this situation, researchers have done a lot of research on how to measure MPI of DMU with imprecise data. In recent ten years, such as Hatami-Marbini et al. [9] applied the MPI to health care in 2012, Oruc [12] proposed a new grey MPI model to measure DMU’s productivity changes with interval data in 2015, Kordrostami and Noveiri [13] proposed a new fuzzy DEA with the expected value method to the multi-period systems, where MPI is used to calculate the efficiency change in two periods through two examples in 2017, Huang et al. [14] combined fuzzy BWM-DEA-AR model with the MPI to analysis China’s energy security from 2008 to 2017 in 2021, Peykani and Seyed Esmaeili [15] extended the possibilistic programming and chance-constrained programming to deal with fuzzy data in 2021, Huang et al. [16] applied the interval MPI to the new patterns in China’s regional green development in 2021, Yang et al. [17] applied the DEA-MPI model to measure the efficiency of scientific and technological innovation in regional industrial enterprises in 2022, and so on.
In this paper, we will start from uncertainty theory [18] and propose a new uncertain DEA-Malmquist productivity index model with imprecise data to measure DMU’s dynamic efficiency. Uncertainty theory is an understandable mathematical structure for dealing with the doubt in data. Wen et al. [19] established the first uncertain DEA model by the uncertain chance constraint method in 2014. Subsequently, Lio and Liu [20] improved the uncertain DEA model studied by Wen et al. [19] to describe DMU’s technical efficiency through the expected value method in 2018. Then, Jiang et al. [21] separated the pure technical efficiency and scale efficiency of the uncertain DEA model based on the research results by Lio and Liu [20]. So far, these uncertain DEA models are focused on sectional data to analysis DMU’s static efficiency value relative to DMU’s set over the same period. In order to explore the dynamic changes of DMU efficiency over different periods and the influence of technical factors and scale factors on DMU’s dynamic efficiency, the paper proposed the new uncertain DEA-Malmquist productivity index model with imprecise data. In the measurement of production efficiency in financial, industrial, medical, environmental, and other sectors, it plays a very important role in measuring productivity changes over time in the case of imprecise panel data. Therefore, the purpose of this study is to establish the uncertain DEA-Malmquist productivity index model to reflect the change of DMU’s efficiency value in different periods when the data is imprecise. In addition, researchers will answer the following research questions: (a) How to deal with imprecise data? (b) What is the specific DEA model involved in the uncertain DEA-Malmquist productivity index model? (c) How to model imprecise data to calculate MPI? (d) How to demonstrate the application of the uncertain DEA-Malmquist productivity index model in practice through examples? To sum up, the main contributions of this paper are as follows: In order to study the change of DMU productivity over time under imprecise data, we use uncertain variables to describe the imprecise data, and then construct the uncertain DEA-Malmquist productivity index model. The specific construction process of this model is shown in Section 3, in which DEA models (1) and (2) are involved. Then, a numerical example is used to demonstrate the practical application of this model.
The rest of this paper is structured as shown below. In Section 2, some basic knowledge of uncertainty theory and two basic uncertain DEA models are introduced. In Section 3, the uncertain DEA-Malmquist productivity index model is proposed to describe the total factor productivity of DMU. Additionally, the Malmquist productivity index model is decomposed. In Section 4, a numerical example is given to empirically analyze each China’s province environmental efficiency from 2014 to 2016. In Section 5, some conclusions are given.
Preliminaries
Uncertainty theory
In this section, we will introduce some knowledge about uncertainty theory that is involved in this paper. Uncertainty theory uses uncertain measure
Moveover, the uncertainty distribution of ξ is
In addition, if the uncertainty distribution Φ (x) is a continuous function and strictly increasing with respect to x at which 0 < Φ (x) <1, and
Uncertain DEA models
With the application and development of the uncertainty theory, DEA has a broader development prospect on account of imprecise data. In 2018, Lio and Liu [20] proposed the uncertain DEA model to measure whether the DMU is technically efficient. The basic symbols of the model are as follows:
DMU i : the ith DMU, i = 1, 2, …, n.
DMU o : the target DMU.
u = (u1, u2, …, u r ): the vector of input weights.
v = (v1, v2, …, v s ): the vector of ouput weights.
There are n DMUs, r inputs, and s outputs. DMU’s relative efficiency evaluation is relative to the overall DMUs. Uncertain DEA model to measure technical efficiency was presented by Lio and Liu [20] under the condition of constant returns to scale is as follows:
In 2018, Lio and Liu [20] upgraded an uncertain DEA model based on the expected method to measure DMU’s technical efficiency. After that, Jiang et al. [21] proposed an uncertain DEA model to measure DMU’s scale efficiency and pure technical efficiency based on Lio and Liu [20]. The existing uncertain DEA models measure static relative efficiency values of DMU in the same period. However, the evaluation data of DMU includes not only sectional data but also panel data. When it is panel data, model (1) and (2) are no longer applicable. In order to further reflect the change of DMU’s efficiency value in different periods, the uncertain DEA-Malmquist productivity index model is presented in this article. The calculation of MPI is based on distance function of period t and period t+1, where the distance function happens to be the inverse of the calculation results of the uncertain DEA model. Taking the construction of the distance function at period t as an example. Suppose there are n DMUs. Each DMU at period t has r inputs and s outputs. The basic symbols are as follows:
Thus, based on constant returns to scale, let
The distance function
The equivalent form of model (6) is drived as (hereafter model IV):
Similarly, based on variable returns to scale, let
Distance function
The equivalent form of model (4) is drived as (hereafter model II):
Then, MPI reflects the DMU’s productivity change. The DMU’s productivity change on account of the technical level at period γ is defined as
According to Ray and Desli [7], the MPI can be divided into technical efficiency change (UTEC) and technology change (UTC). Moreover, UTEC can be further divided into pure technical efficiency change (UPTEC) and scale efficiency change (USEC). Besides, the results of uncertain DEA model include technical efficiency (UTE), pure technical efficiency (UPTE), and scale efficiency (USE). UTE reflects the resourse alllocation efficiency for the DMU, UPTE shows the DMU’s input utilization level determined by the specific production technology level, and USE reflects the DMU’s input utilization level determined by the specific scale efficiency. The model (1) can calculate the value of UTE. The model (2) model can calculate the value of UPTE. In addition, USE is defined as
With the continuous progress of society and the improvement of people’s living standards, environmental pollution has gradually entered people’s vision. How to control environmental pollution and improve the quality of people’s living environment has become a major problem in today’s society. Therefore, it’s practical significance to study province’s environmental efficiency. In this section, we apply the uncertain DEA-Malmquist productivity index model to assess the dynamics of environmental efficiency of China’s provinces from 2014 to 2016.
The paper selects specific statistics from 30 provinces in China from 2014 to 2016. Each province acts as a DMU. Moreover, The statistics of Tibet, Hong Kong, Macau, and Taiwan are incomplete, so this article does not consider them for the time being. Then, we need to fully consider and choose input indicators and output indicators. The existing research’s input indicators with respect to the environmental efficiency are usually selected from three perspectives: capital, labor, and energy (e.g., Du, Chen, and Huang [25], Zhu et al. [26], Choi, Oh, and Zhang [27]). In this paper, we select provincial employed population (EP) as the labor input variable, provincial energy consumption (EC) as energy input variable, and capital stock (CS) of each province as the capital input index according to the practical situation. Among them, CS represents all existing capital resources of a province, which can usually reflect the current province’s production technological level and scale. However, there are no accurate statistics on hand of the CS. Therefore, we can regard CS as uncertain variable by using uncertainty theory [18], and obtain its uncertainty distribution by using expert scoring method. In the selection of output variables, the output variables may be divided into desirable output and undesirable output. Among them, the desirable output is mainly economic variable, and undesirable output is mainly the emission index of related environmental pollutants. In this paper, we select four output variables, gross domestic product (GDP), total CO2 emissions (TCE), total SO2 emissions (TSE) and total wastewater emissions (TWE) according to the practical situation. Table 1 indicates that the selection of input indicators and output indicators. Furthermore, one of the traditional DEA model purposes is to obtain the maximum output. TCE, TSE and TWE are undesirable outputs because they belong to environmental pollutants. Thus, one of the solutions is that we regard these undesirable output variables as input variables (Zhou et al. [28]). Furthermore, under the current technical conditions, we may not get accurate data about the emissions of environmental pollutants, so we may use the uncertainty theory [18] to treat TCE, TSE, and TWE as uncertain variables, and then use the expert scoring method to obtain their uncertainty distribution. Except CS, TCE, TSE, and TWE, the precise statistics of other indicators come from China Energy Statistical Yearbook [29] and the employed population statistics comes from the provincial statistical yearbooks 1 in China.
The selection of indicators
The selection of indicators
Since we can’t obtain specific values of CS, TCE, TSE, and TWE, we have to invite some environmental experts and economic experts to assess people’s level of belief degrees of environmental pollutants and CS, respectively. Take Beijing’s total CO2 emissions in 2014 as an example, the specific consultation procedure was as shown below.
(Q) What’s your opinion about the minimum Beijing’s total CO2 emissions in 2014?
(A) 85 million tons. (We got a set of the expert’s experimental data (85,0), which represents that the experts believe that the minimum Beijing’s CO2 emissions in 2014 is 85 million tons and the belief degree is recorded as 0.)
(Q) What’s your opinion about the maximum Beijing’s total CO2 emissions in 2014?
(A) 94 million tons. (We got a set of the expert’s experimental data (94,1), which represents that the experts believe that the maximum Beijing’s CO2 emissions in 2014 is 94 million tons and the belief degree is recorded as 1.)
Two experimental values from the expert’s evaluation as (85,0) and (94,1) about the value of Beining’s total CO2 emissions in 2014 was obtained. Therefore, the uncertain variable is
Similiary, use the same approach to get Beijing’s CS, TSE and TWE in 2014 as
The 30 provinces’ environental efficiency of China
In Table 3, Beijing and Shanghai performed eximiously from 2014 to 2016. Their environmental efficiencies were achieved 1. Thus, they were efficient. It is worth noting that Guangdong province’s environmental efficiency value in 2014 was 1, which was effective. However, the efficiency values were 0.991 and 0.968 in 2015 and 2016, respectively. It indicates the Guangdong’s environmental efficiency has declined year by year in the past three years. In addition, Ningxia performed the worst since the environmental efficiencies from 2014 to 2016 were 0.438, 0.442 and 0.515. Moreover, its average efficiency over the 3 years was only 0.465. Then, there are 4 provinces with an average environmental efficiency below 0.5, including Liaoning with 0.480, Guangxi with 0.499, Qinghai with 0.457 and Ningxia with 0.465, which is lower than half of the optimal environmental efficiency value. Obviously, the developed provinces are generally more environmentally efficient than less developed provinces. The above results show that the environmental development of China’s provinces is unbalanced.
The specific division of China’s regions
For the purpose of further analyzing the impact of regional differences on each province’s environmental efficiency, we divided the 30 provinces into three categories: the eastern region, the central region, and the western region. Table 3 shows the specific division.
Then, we calculated the average environmental efficiency of the provinces included in each region and plotted it as a line chart as shown in Fig. 1. Observing Fig. 1, we can easily find that the average environmental conditions in the eastern region are the best, followed by the central region and the worst in the western region. Moreover, the average environmental efficiency of the three regions showed a trend of decreasing first and then increasing from 2014 to 2016. Additionally, it is worth noting that the average environmental efficiency in the central region was slightly higher than the whole country environmental level in 2016, while it was significantly lower than the whole country environmental level in 2014 and 2015. It shows that the growth of the central region’s environmental efficiency is faster than other two regions from 2015 to 2016.

Changes in China’s regional environmental efficiency.
To further analyze the dynamic changes of environmental efficiency over time in each province from 2014 to 2016, we calculated the MPI for China’s 30 provinces. First, according to the data in Appendix Tables 6 and 7, we use the method in Example 1 to obtain the inverse uncertainty distribution of each uncertain variable. Then, according to models I, II, III, and IV, we calculate the distance functions (i.e.,
China’s 30 provinces’ average MPIs and its decomposition
The results show that China’s total factor productivity decreased by 0.016, but the average value was 1.002, indicating that the overall province’s environmental efficiency was rising. From the decomposition of the total factor productivity index, UTEC decreases by 0.04, and UTC increases by 0.052. In contrast, UTC plays a major role in the change the total factor productivity index, indicating that the improvement of the environment from 2014 to 2016 mainly comes from technical progress and technical innovation. In addition, USEC increased by 0.034 and UPTEC decreased by 0.079, indicating that the UPTEC has a greater impact on the UTEC. Then, we calculated the average MPI of China’s each province. Table 5 displays the results.
The decomposition of MPI in China’s different provinces
In Table 5, from 2014 to 2016, some provinces’ average MPIs are less than 1, indicating that these provinces’ environmental efficiency exhibited a downward trend. However, the 30 provinces’ average MPI are growing with an average growth rate of 1.5 percent, indicating that China’s overall environmental efficiency is improving. Then, from the perspective of technological progress, except for Tianjin, Hubei, Liaoning and Zhejiang, other provinces’ average UTC values are all exceed 1. It shows that environmental technology innovation is increasingly competitive with China’s rapid economic development, and technological progress has become the key factor to promote environmental efficiency in each province. From the perspective of pure technical efficiency changes, UPTEC values of all provinces fluctuated around 1, and nearly half of 30 provinces had UPTEC values less than 1. In addition, the average UPTEC value of the 30 provinces is 0.989, which is less than 1. These indications suggest that pure technical efficiency is an unimportant factor in promoting provincial environmental efficiency. From the perspective of scale changes, the overall average USEC value of the 30 provinces is 1.011, which is greater than 1. It shows that China’s environmental returns to scale are generally on the rise, and scale is an important factor for affecting the environmental efficiency of Chinese provinces. From these analysis, it can be seen that the growth momentum of environmental total factor productivity in China’s provinces mainly from technological progress and the promotion of returns to scale. How to improve pure technical efficiency is an issue that needs attention in China’s environment.
For the sake of reflecting the environmental total factor productivity’s change of China’s provinces well, we have drawn a distribution map as shown in Fig. 2 according to the size of the data and the geographic location of each province. Intuitively, Hebei, Yunnan and Jilin have the lowest degree of coloration, except for provinces such as Tibet that are inconsiderable. Among them, Hebei and Jinlin are the northern provinces of China, and its economic development and technological level are relatively weak. Yunnan, as one of the China’s southernmost provinces, due to the impact of geographical location and other factors, its economic development and technological level still are relatively weak.

Distribution map of MPI in different provinces of China.
The current research on uncertain DEA models is only based on sectional data to calculate DMU’s static efficiency value. This arcticle introduced the uncertain DEA-Malmquist productivity index model to describe the dynamic change of DMU efficiency. Moveover, the Malmquist productivity index was decomposed to technology change, pure technical efficiency change, and scale efficiency change to explore the impact of technology and scale. Then, we applied this model to measure the environmental efficiency changes in China’s provinces from 2014 to 2016.
The main conclusions were as follows: (1) From 2014 to 2016, Beijing and Shanghai achieved the optimal environmental efficiency and they could serve as benchmarks for inefficient provinces to improve their environmental efficiency. (2) From the regional division, the eastern region of China has the best environmental efficiency, followed by the central region, and the western region has the lowest environmental efficiency. It showed the imbalance of regional environmental development in China. In addition, from 2015 to 2016, the growth of the environmental efficiency in central China was significantly faster than other regions. (3) From the Malmquist productivity index and its decomposition, China’s average environmental total factor productivity has been on the rise during the 3 years. The improvement of China’s environmental total factor productivity primarily from technological progress and the improvement of returns to scale. How to improve pure technical efficiency was an issue that needs more attention.
In this paper, the selection of input and output indicators is based on previous research results and the actual situation. Human subjective factors are relatively large. In future research, Tobit regression model may be introduced into the selection of indicators of the model to make more accurate selection of indicators. In addition, the model proposed in this paper may be extended based on the double-frontier DEA model for empirical analysis.
Footnotes
Acknowledgments
This work was funded by the National Natural Science Foundation of China (Grant Nos. 12061072 and 62162059) and the Xinjiang Key Laboratory of Applied Mathematics (Grant No. XJDX1401).
Appendix
See Tables 6 and 7.
