Abstract
Suspended particulate matter (SPM) emissions from coal-based thermal power plants (CTPPs) cause respiratory illness. However, this has not been given its due importance in the efficiency assessment of CTPPs. This study contributes to the literature by incorporating suspended particulate matter in the benchmarking exercise for Indian CTPPs. In such a study, the theoretical assumptions regarding pollution generating technology or the choice of evaluation tool may impact the ranking of CTPPs. To draw robust inferences, we present a comparative study of two alternative microeconomic approaches (joint and by-production technologies) and two types of data envelopment analysis tools (graph–hyperbolic and directional distance function) applied on two representative samples of Indian CTPPs. Results indicate that Indian CTPPs are moderately inefficient. Choice of technological assumption or data envelopment analysis model does not impact the ranking of CTPPs. Ownership and plant load factor play vital roles in determining inefficiency, and impacts of these factors remain stable across models.
Keywords
Introduction
Coal-based thermal power plants (CTPPs) are the primary source of energy for India. However, the use of coal as the primary input for electricity generation produces various pollutants such as suspended particulate matters (SPM), CO, CO2, nitrogen oxides (NOx) and SO2. In an emerging economy like India, electricity is an essential input for economic growth. Nevertheless, the problem of high local air pollution also needs to be addressed. To understand health costs associated with Indian CTPPs better, interested readers may refer to Mahapatra et al. (2012) and Cropper et al. (2013). 1 The coal used in the Indian power sector has a high ash content, making SPM the key local air pollutant released from CTPPs (Kumar & Rao, 2003; Sengupta, 2007). Sengupta (2007) suggests replacing age-old boilers and introducing ash tax as policy measures to curb SPM emissions from Indian CTPPs.
To address negative externalities related to CTPPs, the Government of India introduced the Environmental Management Award scheme 2 in 2008 and finally set up some new emission regulation standards for power plants in December 2015. An overview of the Indian thermal power sector and current policies to address energy efficiency and sustainability are found in Malik et al. (2015), Shukla and Zia (2016) and Kumar (2017). In this context, regular performance evaluation of Indian CTPPs may help us to identify the plants that require attention from the environmental sustainability perspective and explore the trade-off between increasing electricity generation and reducing local air pollutant emissions. Effective management may reduce emissions and overall energy consumption of CTPPs, which can help to achieve sustainable development of the whole industry. From managerial and policy perspectives, it is essential to determine whether the energy–environment balance is dealt with more efficiently by private or public enterprises. Auxiliary power consumption in Indian CTPPs is on a higher side than CTPPs in other developed countries due to poor plant load factor, the use of poor coal quality, obsolete equipment and aging of equipment (Mandi & Yaragatti, 2014). Thus, auxiliary power consumption and hence emissions can be reduced by improving the plant load factor (Mandi & Yaragatti, 2014).
In this study, we utilize data envelopment analysis (DEA) — a non-parametric method to evaluate the efficiency of Indian CTPPs (decision-making unit or DMU as per DEA terminology) generating both electricity (socially desirable or ‘good’ as per DEA vocabulary) and SPM emissions (socially undesirable or ‘bad’ as per DEA jargon). Under the DEA modelling approach, one compares the actual output levels produced by a DMU with the best achievable output levels of that particular DMU. Scholars have used DEA in assessing both productive and environmental performances of electric utilities or other entities to figure out laggard DMUs (Sueyoshi et al., 2017). In DEA literature, one can accommodate a bad output in one of the three different ways: (a) bad output as input; (b) bad output as an unintended output of production process or joint-production (JP) approach as proposed by Färe et al. (1989); and (c) bad output as a by-product of the production process or by-production (BP) approach as propositioned by Murty et al. (2012). Among these alternative methodologies, JP and BP approaches are most popular in empirical research to accommodate bad output in the production process.
Although the production process assumes a positive correlation between good and bad outputs in both methodologies, they are fundamentally different from each other. Under the JP approach, it is assumed that producing good output will generate bad output, which is unavoidable. So, given an input level, we cannot unilaterally dispose of bad output. The reduction in bad output is always linked with a reduction in good output. On the other hand, the BP approach assumes that bad output arises because of specific input’s use to generate good output. Thus, reduction in bad output is possible only by cutting down the use of that particular input. So, we can say that unlike the JP approach, the BP approach imposes an additional constraint on the use of input also. As DEA is deterministic, introducing such additional constraints may lead to a variation in the efficiency measures. From an empirical perspective, the difference in efficiency measures stemming from the alternative methodologies might create difficulty in practical benchmarking exercises, because the ranking of DMUs may alter across two sets of model assumptions.
Moreover, choosing a particular DEA model can also impact efficiency scores and the ranking of DMUs. Sometimes the ranking of DMUs may also vary across DEA models (for a given technology) due to models’ inherent nature. For instance, suppose a decision-maker tries to increase good output and reduce bad output generation simultaneously. In that case, he/she might choose either the Directional Distance Function (DDF) model or graph–hyperbolic (GH) model to compute the inefficiency and technical efficiency (TE) measures. However, the nature of those two models is significantly different from each other. The DDF is a straight forward linear optimization problem where the GH is a non-linear optimization problem.
This study considers both the economic (electricity generation) and environmental (SPM emissions) aspects of thermal power generation to balance economic and environmental forces while determining the efficiency of Indian CTPPs. We study the sensitivity of efficiency measures and ranking of Indian CTPPs in the presence of SPM as bad output, by using two alternative DEA models (DDF and GH) under two different technological assumptions (JP and BP). This modelling strategy will help us to draw robust inferences regarding the effect of ownership and plant load factor on efficiency. The rest of our paper is organized as follows. The second section provides a literature review followed by a detailed methodological discussion in the third section. The empirical model and data source of the study are mentioned in the fourth section. We present the results from our empirical models in the fifth section and concludes our study in the last section.
Literature Review
The literature on energy efficiency reveals that researchers follow different technological assumptions to incorporate bad output in the production process and adopt different models to evaluate power plants. Numerous studies have been conducted to evaluate the productive and environmental performance of CTPPs in both developed and developing nations (Mardani et al., 2017; Xu et al., 2020). Here, we consider recent empirical studies to provide the reader with a glimpse of various methodologies adopted for efficiency evaluation of Indian CTPPs and main results. Murty et al. (2007) use a parametric DDF model, which captures the effort to increase electricity generation and reduce emissions for five CTPPs under the Andhra Pradesh Power Generation Corporation. To estimate DDF, they use monthly observations on electricity generation and various air pollutants (SPM, SO2, NOX) as outputs, while coal consumption, wage bill and total capital stock of CTPPs as the inputs during the period from 1996-97 to 2003-04. Their DDF model’s result indicates a ‘win–win’ situation as the representative plant can increase its electricity by 6% while cutting down emissions by the same proportion.
Shrivastava et al. (2012) compute Technical Efficiency (TE) of 60 Indian CTPPs for the year 2008-09 applying input-oriented versions of traditional DEA models on a technology set defined over one output (electricity generation) and three inputs (coal consumption, oil use and auxiliary power consumption). The results suggest that TE of small-sized CTPPs are comparatively lower than medium and large-sized CTPPs, and state government-owned CTPPs are more inefficient than central government and privately owned CTPPs. Yadav et al. (2014) evaluate the performance of 65 Indian CTPPs for the year 2008-09. Their model specification includes two outputs (electricity and CO2 emissions) and two inputs (installed capacity and coal consumption). They used an input-oriented DEA model to find out inefficient CTPPs. The study also uses the slack-based model to realize the improvement direction for an inefficient CTPPs. The result shows that state government-owned CTPPs are highly inefficient compared to their counterparts. The slack-based analysis suggests that inefficient CTPPs should reduce their coal consumption to improve operational performance.
Sahoo et al. (2017) examine the energy efficiency for 71 CTPPs in India for 2010. The technology set of their study incorporates two outputs (electricity generation and CO2 emissions) and five inputs (plant capacity, load factor, plant availability, coal consumption and secondary fuel consumption). The study assumes a joint production relation between electricity generation and CO2 emissions and follows Färe et al. (1989) DEA model to compute the energy efficiency scores for CTPPs. Apart from the energy efficiency scores, they have used the slack-based model to calculate the energy savings potential of Indian CTPPs. The results show that market-based targets are much lower than actual targets, and there exists substantial room for improvement in the energy-use system. Jain and Kumar (2018) estimate quadratic directional output distance function using linear programming (LP) approach by using a sample of 56 Indian CTPPs for 2000–2013 to compute TE and shadow prices of CO2 emissions. They find that over time, the inefficiency of the thermal power sector has been increasing.
Murty and Nagpal (2019) measure TE of Indian CTPPs using output-based Färe, Grosskopf, and Lovell efficiency index. The study uses plant-level annual data of 48 CTPPs from 2003 to 2015. The model specification contains two outputs (electricity generation and CO2 emissions) and three inputs (plant capacity, operating availability and fuel use). They follow the BP approach to model CO2 emissions (bad output) in the technology set. They define overall technology as an intersection of two sub-technologies—intended or ‘good’ technology and emission causing or ‘bad’ technology and decompose the Färe, Grosskopf, and Lovell index into TE and environmental efficiency indices. The environmental efficiency index is calculated relative to ‘bad’ technology, while TE is measured relative to ‘good’ technology. The high environmental efficiency score over time indicates that significant improvement in CO2 emission control performance cannot be achieved by reducing the use of emission causing input. The study finds that average TE scores remained more or less the same. However, an increase in the percentage of high-performing plants and a decrease in the percentage of low-performing plants in the inefficiency distribution suggest an improvement in electricity generation performance across the study period.
Jindal and Nilakantan (2021) enumerate TE of 129 Indian CTPPs over the period 2005–2014, applying input-oriented versions of radial DEA model and non-radial slacks-based measure DEA model. They define a technology set over one output (gross electricity produced) and six inputs (coal consumption, fuel oil use, auxiliary power consumption, installed capacity, plant load factor and plant availability factor). They find that average TE declined from 0.847 in 2005 to 0.742 in 2014. The bootstrapped truncated regression results suggest that efficiency increases with plant size, and privately owned CTPPs operate at higher efficiency levels than their public sector counterparts.
At the backdrop of the literature, our study aims to make some humble contributions to the existing body of knowledge. To the best of our knowledge, the impact of plant load factor on TE of Indian CTPPs has not been studied before. This is only the second study that focuses on SPM emissions as bad output, after Kumar and Rao (2003) who have analysed data from the 1990s. Our study tries to answer the following research questions:
Do ownership and plant load factor significantly affect inefficiency and TE of CTPPs across alternative DEA models under different technological assumptions? Given technology assumption, is the ranking of CTPPs robust across alternative DEA models? How does technology assumptions affect the ranking of CTPPs, given a DEA model?
Methodology
Staring from the microeconomic foundations, Ray (2020) provides an overview of the DEA methodology for radial and non-radial measurements of efficiency and alternate ways to combine undesirable ‘bad’ output with the ‘good’ output in DEA models. The interested reader may also refer to Dakpo et al. (2016), who summarize theoretical developments around the inclusion of bad outputs in production technology modelling and related DEA-based models. As mentioned earlier, variation in efficiency/inefficiency measures for a given CTPPs (across technological assumptions or DEA models) may lead to an inconsistency in benchmarking, and drawing robust inference on the role of non-discretionary factors on TE may be difficult. This study tries to overcome these problems by implementing the following strategy.
First, we adopt both JP and BP methodological approaches to accommodate bad outputs and aim to compute TE under both technological assumptions. Second, we use two alternative DEA models (DDF and GH) under both technological assumptions to compute inefficiency and TE measures. Third, we conduct a second-stage regression analysis to identify whether the ownership factor affects TE of Indian CTPPs across alternative technologies and DEA models.
Suppose we have N number of DMUs producing one ‘good’ or desirable output (Y) and one ‘bad’ or undesirable output (B) using a set of inputs (X). A DMU should be given credit for good output generation and penalized for its bad output generation. In this study, we discuss two different methodologies, where both these outputs (i.e., good and bad outputs both) are treated asymmetrically in the performance evaluation.
We start our discussion with the JP approach, introduced by Färe et al. (1989), where it is assumed that both the outputs are jointly produced from a given input set that is freely disposable. However, to model bad output in the production technology set, they have introduced two additional assumptions (i.e., null jointness and weak disposability of bad output with good output) in their production technology. The null jointness assumption says that bad output is produced jointly with good output; so, it is impossible to produce any good output without producing any bad output. The weak disposability assumption implies that any proportional contraction in bad output is possible only if the good output is reduced in the same proportion. In other words, for a given input set, a DMU can reduce its bad output only if the good output is also reduced. The idea of weak disposability assumption suggests that bad output disposal is costly, which restricts good output production. However, Färe et al. (1989) assume that good output is freely disposable, indicating that a DMU can produce less amount of good output with the same amount of bad output at a given input level.
Based on null jointness of outputs, weak disposability of bad output with good output, and free disposability of good output Färe et al. (1989) have constructed the following production possibility set (a.k.a. technology set):
In Equation (1), T(X) defines the technology set followed by the DMU and λj is the intensity vector (or DMU specific non-negative weights). The convex combinations of outputs in the technology set, i.e. and
The JP approach suggests that a DMU cannot reduce its bad output without cutting down its good output, which implies a positive correlation between outputs at a given input level. However, at a given input level, a reduction in both output levels may result from resource diversion from good output production to abatement of bad output. The primary criticism against the JP approach is that it fails to model the abatement activities explicitly in the technology set. To accommodate the abatement activity regarding bad output in the technology set, Murty et al. (2012) propose an alternative modelling framework known as BP approach under which, DMU has two conceptually different production technologies: good output generation and bad output generation. Førsund (2021) also champions the philosophy behind the BP approach as generation of good output is independent of generation of bad output and vice versa. However, these separate technologies for the good and bad outputs are linked through common input use in the production process. The input associated with the good technology is defined as neutral input (X1), while input that causes bad output is called polluting input (X2). To produce good output, DMU requires both types of inputs, while for the generation of bad output, DMU only needs polluting input. According to Murty et al. (2012), the generation of good and bad outputs is independent. They use two different sets of intensity vectors to model both sub-technologies for the DMU. In this case, a DMU is said to be efficient DMU if and only if it is efficient relative to both sub-technologies.
However, Ray et al. (2017) criticize the idea of independent sub-technologies proposed by Murty et al. (2012). They argue that in the presence of input (mainly for polluting input) sharing between two sub-technologies, use of two sets of intensity vectors leads to two different peer groups (for a single inefficient DMU), which is not desirable from an empirical point of view. So, they propose some modifications to the BP approach. They assume that bad output and polluting input are jointly disposable in the corresponding sub-technology, ensuring that reduction in bad output is always linked with a simultaneous reduction in polluting input. Second, they use a single intensity vector (in the production technology) to choose a single peer group for an inefficient DMU.
In this study, we follow a combined approach of Murty et al. (2012) and Ray et al. (2017) to model the bad output in the production possibility set. Like Murty et al. (2012), we assume that overall technology is an intersection of good and bad technology. However, we follow Ray et al. (2017) to use a single intensity vector for both technologies. However, we modify the assumption of joint disposability in Ray et al. (2017) as follows. A DMU cannot substitute its polluting with any other inputs due to its usages in both sub-technologies. So, reducing bad output by cutting down the polluting input use might affect its good output generation. An advantage of the modified joint disposability assumption is that it ensures a balance between both outputs when the DMU mitigates its bad output. We define our BP technology set as
where T1(X) defines good technology while T2(X) is the bad output technology, and the overall technology is defined by TBP(X), which is the intersection of both sub-technologies. The intensity vector λj is same for both sub-technologies. Like Equation (1), we assume VRS in our BP approach as defined in Equation (2). Equality constraints
Case I: DEA-LP for GH Model Under JP Approach
Here the objective is to find the optimal value, up to which we can simultaneously expand good output production and reduce (proportionately) the bad output generation. To obtain the value of θ*
JP
for jth DMU, we solve the following optimization problem:
Subject to
Note that in the this optimization problem (Equation (3)) the bad output restriction involves non-linear equality. To make it linear, we use first-order Taylor’s series approximation for the non-linear constraint. We define
Hence, at θ0 = 1 we get
So, using this approximation (Equation (4)), we can rewrite Equation (3) as
Subject to
Case II: DEA-LP for DDF Model Under JP Approach
Like the GH model, we try to find the optimal scaling factor to expand the production of good output while reducing bad output simultaneously. To obtain the value of δ*
JP
> for jth DMU, we solve the following optimization problem:
Subject to
Note that we set our direction vector to (1, −1), to keep our model simple.
Case III: DEA-LP for GH Model Under BP Approach
Our objective is to get the optimal value to expand good output while reducing the bad output by cutting down the polluting input use in the case of BP approach. To get the optimal value for jth DMU, we solve the following problem:
Subject to
Here also we use the first-order Taylor’s series approximation to make the non-linear restriction linear one. So, based on our approximation result in Equation (4), we rewrite our Equation (7) as follow:
Subject to
Case IV: DEA-LP for DDF Model Under BP Approach
The following DEA–LP objective is to find an optimal strategy to expand the production of good output while reducing bad output by cutting down polluting input use. Hence, our DEA–LP model can be written as:
Subject to
Following standard practice (Ray et al., 2017), we set the direction vector as (1, −1, −1) to make our model simple.
Once we solve all optimization problems, we use regression analysis as a second-stage study to identify factors that cause variation in the performance level. In this study, we have computed four different scaling factors θ*
JP
, δ*
JP
, θ*
BP
and δ*
BP
. So, we estimate four models, one for each scaling factor to recognize non-discretionary factors. We consider the following regression model for jth DMU:
where I (N × 1) vector of calculated inefficiency or efficiency score, Z is (N × K) vector of exogenous variables, β is (K × 1) vector of parameters, ε is an (N × 1) vector of error term of the model and t indicates the time. Many scholars note that TE varies between zero to one and suggests a Tobit model to identify non-discretionary factors. However, McDonald (2009) argues that efficiency scores are fractional data, and the Tobit model may provide inappropriate findings. So, we decide to adopt the pooled regression model with robust standard errors to estimate Equation (10). Figure 1 illustrates the analytical steps used when we begin with the BP technology assumption. 4

Empirical Model and Data Source
This study adopts a technology set involving two inputs and two outputs. In our analysis, net electricity generation and SPM emissions are considered as good output (Y) and bad output (B), respectively. For the input set, we have two inputs: the plant’s net capacity (X1) and total coal use (X2). In the case of BP approach, the capacity variable is considered as neutral input for our study, while coal is used as polluting input. The data that goes into the construction of our empirical models are collected from the Central Electricity Authority’s annual publications (Central Electricity Authority, 2013, 2015). Each annual review offers detailed plant-level data regarding electricity generation, generation capacity, coal use per unit electricity generation, operation availability, electricity consumption for operation purpose, ownership status, capacity utilization and SPM emissions range. These data are voluntarily shared with the Central Electricity Authority, leading to missing information and an unbalanced panel.
We utilize annual data for two different years 2012-13 and 2014-15. We cannot use more recent data as annual reports stop providing data on SPM emissions after 2015. In 2014-15, the annual review covers 167 CTPPs, out of which we get full information on 70 CTPPs. Similarly, for 2012-13 we get information on 70 CTPPs. Out of these CTPPs (i.e., 70 CTPPs in 2012-13 and 70 CTPPs in 2014-15), we have 66 CTPPs contributing data for both years. As the data source does not reveal any information on labour use at CTPPs, we cannot include labour input in our models. However, as CTPPs are highly technology intensive, labour probably plays a minor role in productivity (Sahoo et al., 2017). This study constructs the following variables to measure the inefficiency scores.
Net electricity: We subtract the auxiliary consumption from gross electricity generation to get the net electricity generation (in GWh) of a plant. SPM emissions: We take average of the maximum and the minimum SPM emissions and create SPM emissions (in mg/Nm3) level of each plant. Net capacity: During a year, CTPPs do not operate at their full capacity. To get the actual capacity, we multiply the plant’s total capacity by its operational availability and get a plant’s net capacity (in MW), which is considered neutral input in our analysis. Coal use: We multiply coal use per KW of a plant with its gross electricity generation to get total coal use (in thousand tonnes) by that particular plant.
After DEA calculations, we merge each year’s data set to a single data set and make a balanced panel of CTPPs to identify the possible non-discretionary factors. We utilize inefficiency and efficiency scores (calculated from Equations (5), (6), (8) and (9)) as dependent variables in the second stage regression model. Previous studies on efficiency analysis of CTPPs include type of fuel, plant’s age and size as explanatory variables in the second-stage regression model. However, in this study, we do not consider these variables as explanatory variables. Given the data limitation, plant age determination is complicated and hence ignored. Second, a high correlation between the number of units in a particular plant and net capacity of that plant restricts the use of the units variable in second stage regression.
We use ownership dummies, capacity utilization and year dummy as explanatory variables in the second stage regression model. Indian CTPPs can be classified into three categories based on the ownership: central government-owned, state government-owned, and privately owned CTPPs. The different ownership dummies help us to capture the impact of management on plant’s efficiency level. Similarly, better capacity utilization is expected to improve the performance level of the plant. Again, over time performance of CTPPs may improve due to technological progress. Note that we consider private company owned CTPPs as a base category and compare that with the other two ownership categories in our analysis. Similarly, we consider the year 2012-13 as the base year for our analysis. Definitions of all regressors are as follows.
Central: If the central government owns the plant, the dummy takes value 1, otherwise 0. State: For state government-owned plant, the dummy takes value 1, otherwise 0. Plant load factor: To capture the capacity utilization effect, we use the plant load factor, which is defined as the ratio of actual energy generated by the plant and maximum energy generated by it. Y14: If the data belongs to 2014-15, the Y14 dummy takes value 1.
Descriptive Statistics of Sample.
Results and Discussion
Descriptive Statistics of GH Efficiency and DDF Inefficiency Scores.
Spearman Rank Correlation (ρ) of Matrix Across Technology.
Spearman Rank Correlation (ρ) Matrix Across DEA Models.
Mean and Median GH Efficiency and DDF Inefficiency Scores Across Ownership.
List of Poor-Performing CTPPs Across Technologies and Data Envelopment Analysis Models for 2014–2015.
Second-Stage Pooled Regression.
Conclusion and Policy Discussion
This study aims to contribute to the growing body of literature on the efficiency of Indian CTPPs producing electricity and SPM, a harmful local air pollutant that has not been given due consideration. Previous efficiency studies on Indian CTPPs have derived interesting conclusions based on a specific assumption on pollution generating technology and a particular DEA model. We adopt and empirically compare four modelling options (combinations of two influential approaches to model pollution generating technologies: JP and BP, and two popular DEA tools: GH and DDF) to study their implications for benchmarking. The results from all four models suggest that on average Indian CTPPs are moderately inefficient in joint production of electricity and SPM emissions. Model outcomes suggest that efficiency/inefficiency scores vary somewhat across alternative models, but ranking of CTPPs remains almost the same across two DEA tools and technological assumptions.
In this study, we have covered 22 different electricity producing companies under three different ownership status. Out of these 22 companies, state governments own most of the utilities (15), followed by private sector (5) and the central government (2). The TATA Power Company Limited and Reliance Industries Limited own the most efficient CTPPs across DEA models and technological assumptions. On the other hand, Maharashtra State Power Generation Company Limited turns out to be the most inefficient utility company in our study. Under central government owned utilities, the Damodar Valley Corporation performs poorly across alternative models and technological assumptions. The positive (negative) significant correlation between efficiency (inefficiency) scores and plant load factor across technological assumptions suggests that better capacity utilization by a company is directly linked with its performance level.
The second-stage regression findings reveal that non-discretionary factors are similar across different technological assumptions and DEA models. Regression results confirm that CTPPs owned by private companies have performed better both in terms of efficiency compared to the central or the state government-owned power enterprises. The better utilization of the capacity helps the plant to improve productive performance. This modelling exercise will be of interest to various stakeholders to understand the ‘win–win’ moves, that is, the extent of potential to expand electricity generation and contract SPM emissions if CTPPs operate efficiently.
However, our findings are subject to two major limitations. First, due to the unavailability of labour data, we fail to accommodate the labour input in the technology set. Second, there may be a technological difference between public sector owned CTPPs and private company owned CTPPs. However, our study fails to address this issue due to the insufficient number of privately owned CTPPs. Despite such limitations, our study may provide a robust benchmarking of Indian CTPPs, which may help policymakers develop strategies to improve the overall performance level of the Indian thermal energy sector. The government should either go for public–private participation models for the better managerial experience or upgrade the infrastructure of public-owned CTPPs to generate more electricity with fewer emissions.
Footnotes
Declaration of Conflicting Interests
The authors declared no potential conflicts of interest with respect to the research, authorship and/or publication of this article.
Funding
The authors received no financial support for the research, authorship and/or publication of this article.
