Abstract
While earlier studies have focused excessively on bankruptcy prediction of banks, this study classifies banks based on their financial strength from the perspective of retail depositors who currently do not have an authentic guiding framework that helps them identify banks with higher risk profiles. Using machine learning techniques, we classify 44 Indian banks into distinct categories of financial health based on 12-year data from 2005 to 2017. We first use unsupervised learning to identify a pattern leading to logical groups in terms of financial health and then move to supervised learning for prediction. Using linear discriminant analysis (LDA), Classification and Regression Tree (CART) and Random Forest methods, we predict the cluster membership with the associated explanatory power alongside. We also compare our classification with the credit ratings awarded by rating agencies and highlight certain discrepancies that exist between what is predicted by our models and the credit rating awards.
Introduction
Collapse of banks caused by mismanagement impacts deposit holders, investors, employees and has a negative bearing on the economy as a whole (Huang, Chang, & Liu, 2012). Bank failure could trigger an adverse financial repercussion and generate negative impacts such as a massive bail-out cost for the failing bank and loss of confidence from the investors and depositors (Tung, Quek, & Cheng, 2004). Studies have shown that the growth foregone by the economy due to an overhang of non-performing loans can be in excess of two percentage points annually (Balgova et al., 2016). Weak institutions lacking competence during periods of credit expansion rapidly increases non-performing loans and can deplete bank capital, unless resolved on a timely manner, underpinning the importance of financial supervision (Steigum, 2011). Establishing an early warning system and timely intervention by the central bank can safeguard deposit holders, investors and the economy.
The Reserve Bank of India (RBI) has highlighted the deteriorating stability of Indian banks primarily due to poor capital adequacy, non-performing advances, profitability, liquidity and efficiency (RBI Financial Stability Report, 2018). Non-performing assets are loans which are in default or arrears where the bank has not received interest or principal due for more than 90 days. The stress in the Indian banking sector continues with problems of deteriorating asset quality and declining profitability, exerting pressure on the bank’s regulatory capital ratios (Kadanda & Raj, 2018). Indian banks fall short of the capital adequacy norms stipulated under the global capital risk norms (Basel III). The deteriorating non-performing assets (NPA) position warrants a higher level of capital infusion that the banks are unable to raise from the market; it has been challenging for the government to infuse such a requisition of additional capital (Economic Times, 2018a). This in effect means most of the Indian banks are undercapitalised, exposing the depositors and regulators to a higher level of risk.
India has been lacking a robust risk-based supervisory framework for the monitoring of the scheduled commercial banks. Although oversight has evolved recently, large magnitude frauds like the Punjab National Bank scam of ₹114 billion continue to be reported in the Indian banking system, which deepens the trust deficit between the banks and deposit holders (Firstpost, 2017). Depositor insurance in India is governed by the Deposit Insurance and Credit Guarantee Corporation (DICGC) Act, 1961, which insures deposit holders in the event of a bank failure. In comparison to the developed and developing economies, insured limit of deposits needs to increase at least by 15-fold to cover at least 90 per cent of the fixed deposits in an Indian context (Economic Times, 2018b). Especially given that ranking scheduled commercial banks in terms of risk and stability have been lacking, the interest of deposit holders is not fully secured and bank failures are a major concern for deposit holders. The recent restrictions imposed by the Reserve Bank of India (RBI) in terms of withdrawal caps on deposit holders of Punjab and Maharashtra Cooperative Bank (PMB), one of the top fibre cooperative banks in India, following the ₹65 billion scam, has resulted in a public outcry, further denting the trust deficit (Firstpost, 2019).
Credit rating agencies provide guidance to the deposit holders, to a certain extent. However, virtually every government inquiry into the 2008–2009 global financial crisis has assigned some blame to credit rating agencies for assigning inaccurate ratings and errors (Kashyap & Kovrijnykh, 2015). Recently, India’s credit rating agencies have come under severe scrutiny over their sudden downgrading of one of the country’s largest non-banking financial institution, Infrastructure Finance and Leasing Corporation (IL&FS). In India, the ‘issuer-pays’ model is popular where the rating agencies charge the banks that issue the debt instrument a fee for providing credit rating assessments, resulting in conflict of interest. Deposit holders are thus vulnerable and are exposed to bank instability and potential defaults, with no agency to provide an effective warning system. Predicting bankruptcy of commercial banks has been extensively studied in literature using a multitude of methodologies (Affes & Kaffel, 2019; Jones, 2017; Kolari & Sanz, 2017; Migliore & Chinta, 2017). However, there is a lack of studies that propose robust rating mechanisms that could identify unstable banks, thereby presenting an authentic guiding framework from the retail depositor’s perspective. This is a clear research gap this study has identified.
Our study contributes primarily in three areas. First, while earlier studies focused excessively on bankruptcy prediction, our study’s key objective is to classify banks based on their financial strength from the perspective of retail depositors who currently do not have an authentic guiding framework that helps them to identify banks with higher level of risk profile. Our study classifies banks into three risk categories—high, medium and low—facilitating the retail investor to make informed decisions when it comes to choosing a commercial bank for placing their savings as deposits. Our study includes all 44 Indian public and private sector banks, excluding foreign banks in operation, during the 12-year period from 2005 to 2017. Second, our study uses a more refined methodology, employing machine learning technique for classifying the Indian scheduled commercial banks into three distinct risk categories. This study first uses clustering, an unsupervised learning technique, and then moves to supervised learning involving linear discriminant analysis (LDA), classification and regression trees (CART) and Random Forest. Third, this study compares our classification of banks into high, medium and low financial risk categories with the credit ratings awarded by the top Indian rating agencies to the deposits issued by the banks and demonstrates disparities in the credit rating awards. Rating agencies could immensely benefit from such a framework.
This article is organised as follows: the second section sets out the literature review. The third section describes the data, and the fourth section elucidates the methodology. The fifth section analyses the results, and the sixth section sets out the discussions, conclusions and managerial implications. The seventh section lists out the limitations of the research and scope for further research.
Literature Review
The gross non-performing assets of the Indian scheduled commercial banks increased 11.6 per cent and the net non-performing assets to 6.1 per cent of the advances. Stressed advances that include non-performing and restructured advances reached 12.5 per cent of the total advances. The position of the public sector banks (PSBs) is more severe with net non-performing advances at 8.6 per cent and the quantum of stressed advances at an alarming 16.7 per cent. The capital adequacy ratio of Indian scheduled commercial banks plummeted to 13.8 per cent with Tier-I capital adequacy ratio falling to 7.0 per cent. Net interest at 2.7 per cent yielding a return on assets at a negative 0.2 per cent demonstrates deteriorating asset quality (RBI Financial Stability Report, 2018). The recent enactment of the Insolvency and Bankruptcy Code (IBC), 2016, intended for resolving the problem of stress has been ineffective, given the lengthy process and frequent court interventions, defeating the very purpose of time-bound stressed asset resolution (RBI Annual Report, 2018); Out of the 977 cases admitted, only 34 cases have seen successful resolution even after 2 years after the inception (IBC Delays, 2018).
Global banks fall short of regulatory norms, and stress tests used are not robust enough to differentiate between weak and sound banks (Goldstein, 2017). While the overall and Tier-I capital adequacy ratio for the Indian banks are lower than the Basel III prescribed norms and have been deteriorating, the problem for the banks identified under the Prompt Corrective Action (PCA) is severe, which have recorded much below 9 per cent of its risk weighted assets (RBI Financial Stability Report, 2018).
RBI has classified three large banks (SBI, ICICI, HDFC), having assets more than 2 per cent of the country’s Gross Domestic Product (GDP), as ‘too big to fail’ and termed them as ‘domestic systemically important banks (DSIB)’. These institutions are large and highly interconnected, and hence important for the orderly functioning of the entire national financial system and the real economy (Financial Express, 2017). The government guarantees to support these banks in the event of financial stress. However, there are four other large banks not designated as DSIBs (Axis Bank, Bank of Baroda, Bank of India and Punjab National Bank), exposing them and their deposit holders to higher financial risks (Firstpost, 2017) and further signalling that these banks would get a lower priority in government support in times of financial stress; this is again a concern for deposit holders.
Historically, none of the Indian PSBs have failed while more than 35 private banks have been either put under moratorium in public interest due to mismanagement or have gone out of existence in the last 40 years; merger of such weak private banks with PSBs has frequently been resorted to (Newsclick, 2018). The major challenge for the government and RBI has been the effective monitoring of the financial position of the banks, and the need for corrective actions and infusing funds on a timely manner. More importantly, weak bank financials not only steeply increase the retail depositor’s risk exposure but also draw upon the precious taxpayers’ money that eventually weakens the government’s financials. In the recent past, the government and RBI have been strengthening the risk-based supervision of scheduled commercial banks.
India has been lacking a risk-based supervisory framework for the monitoring of the scheduled commercial banks. The RBI introduced a Risk Based Supervision (RBS) framework in 2012 and which was renamed to Supervisory Program for Assessment of Risk and Capital program (SPARC). Such risk-based oversight calls for an ongoing interaction from banks and supervisors, much beyond periodic inspections. Supervision of the banks, especially in ranking them in terms of risk and stability, has been an important role of the regulators.
In spite of the strengthening regulatory oversight framework, large magnitude frauds continue to be reported in the Indian banking system, which deepens the trust deficit of the deposit holders; the ₹114 billion scam at Punjab National Bank, the second largest public sector bank in the country, has shown the Indian banking system in poor light in the eyes of the deposit holders (Livemint, 2018b). Many lenders including Axis bank, Bank of India, ICICI Bank and Yes Bank have under-reported non-performing advances, and this has been continuing for many years (Times of India, 2018). Recently, it has come to light that PMB has under-reported its non-performing assets to the tune of 73 per cent (Firstpost, 2019).
Explicit, limited, deposit insurance schemes, providing protection from loss up to some limited amount per depositor, have become the norm around the globe (Davis, 2020). In the USA, the Federal Deposit Insurance Corporation (FDIC) uses CAMELS data to develop a rating system to rank banks on their stability criteria (Affes & Kaffel, 2019). FDIC insures the deposit holders to a limited extent in the event of bank failures. FDIC, through its rating system, provides an early warning to investors, in order to draw their attention to those banks that have a great default probability ratio. Depositor insurance in India is governed by the DICGC Act, 1961, which insures deposit holders in the event of a bank failure. The total amount payable by DICGC in such an event shall not exceed ₹100,000 per depositor. In comparison to the developed and developing economies, insured limit of deposits needs to increase at least by 15-fold to cover at least 90 per cent of the fixed deposits in an Indian context (Economic Times, 2018b). Several regulations like the Financial Resolution and Deposit Insurance (FRDI) Bill, 2017, have been attempted by the government to address the depositor interest, but such regulations have failed to take off. The interest of the deposit holders is not fully secured, and bank failures are a major concern for deposit holders when they choose the banks for placing their deposits. Credit rating agencies provide guidance to the deposit holders to a certain extent.
However, virtually, every government inquiry into the 2008–2009 financial crisis has assigned some blame to credit rating agencies. Kashyap and Kovrijnykh’s (2013) study shows that inaccurate AAA credit ratings introduced risk into the US financial system and further show that rating errors are larger when the firm orders it than when investors do. Recently, India’s credit rating agencies have come under severe scrutiny over their sudden downgrading of one of the country’s largest non-banking financial institution—IL&FS. The IL&FS group, comprising over 100 companies, accounted for more than 2 per cent of all Commercial Paper issued in the country, 1 per cent of all Corporate Debentures and nearly 1 per cent of all banking system loans outstanding in the country. IL&FS’ aggregate group borrowings reached US$13 billion (Livemint, 2018a). It is pertinent to note that in the financial year 2017–2018, these borrowings raised steeply, by over 44 per cent over the last year. Until August 2018, two of India’s top credit rating agencies ICRA limited and Credit Analysis & Research (CARE) continued to award highest possible credit ratings of AAA or AA to both the long-term and the short-term borrowings, even as one of the IL&FS subsidiaries did default on its debt obligations. However, in September 2018, it came to light that IL&FS defaulted on multiple credit obligations after which the agencies drastically downgraded IL&FS rating to junk grades of D rating (The Wire, 2018). Top credit rating agencies did not comprehend the disaster as it was happening around them, which precisely is their responsibility; safeguarding the interests of the investors by setting out the financial health of the deposit issuer. Within 7 months, 50 per cent of the IL&FS group assets have been put on the block; so serious has been the inability of the rating agencies to forewarn the deteriorating financial situation of the IL&FS group (The Hindu, 2019).
Researchers attribute the failure of rating agencies to an incentive problem. Since the credit rating agencies are paid by the issuers, their interests are more likely to be aligned with the issuers than with the investors (Pagano & Volpin, 2010). In India, the ‘issuer-pays’ model is popular where the rating agencies charge the banks that issue the debt instrument, creating a conflict of interest (Livemint, 2018). Given such conflict of interests, researchers recommend that rating agencies be paid by investors rather than issuers (Pagano & Volpin, 2010). Since credit rating business is reputation based, investors demand more reliable disclosures and qualitative information from debt issuers when rating agencies lose reputation, and the ratings become less credible. Sharing more information can thus reduce information asymmetry and adverse selection in lending markets (Sethuraman, 2018). However, such additional disclosure comes at higher costs that can further worsen the financials of the banks issuing the deposits. Recent studies have shown that using machine learning techniques for establishing credit ratings can provide promising results and could be well designed to predict credit rating changes (Hsu, Chen, & Chen, 2018).
Research Gaps
Research Gap 1
Predicting bankruptcy has been extensively studied in literature (Abdou, Abd Allah, Mulkeen, Ntim, & Wang, 2017; Affes & Kaffel, 2019; Altman, 1968; Ekinci & Erdal, 2017; Halteh, Kumar, & Gepp, 2018; Kolari & Sanz, 2017; Tung et al., 2004; Jones, 2017). While earlier studies focused excessively on bankruptcy prediction, a lack of studies to classify banks based on their financial strength from the perspective of retail depositors, especially to facilitate the retail investor to make informed decisions in choosing a commercial bank for placing their savings as deposits, this is the first research gap this study identified and proposes to address. This article proposes a risk classification that will facilitate the retail investor to make wise decisions when it comes to choosing a commercial bank, among the private and public banks in India, for placing his investments as deposits.
Research Gap 2
A plethora of methodologies have been used to predict bankruptcy; multivariate discriminant analysis and Logistic regression (Jones, 2017), decision trees (DTs) and Random Forest (Halteh et al., 2018), CART and multivariate adaptive regression splines (MARS) (Affes & Kaffel, 2019). Techniques like Generic Self-organising Fuzzy Neural Network have been used (Tung et al., 2004). Non-parametric techniques of trait recognition combined with neural network methodology of self-organising maps have also been used (Kolari & Sanz, 2017). Recently, machine learning (Migliore & Chinta, 2017), ensemble learning models (random subspaces) and hybrid ensemble learning models (bagging and multi-boosting) have been used (Ekinci & Erdal, 2017) along with chi-square Automatic Interaction Detector CHAID, CART, multilayer-perceptron neural networks, discriminant analysis and logistic regression (Abdou et al., 2017).
However, many of the above models use an ‘a-priori’ classification theme based on experience and judgement in a heuristic manner. Such an approach cannot guarantee homogeneity within a cluster in a robust manner and heterogeneity between clusters apart from being highly subjective in terms of grouping. Very often, this will lead to a high variance in a cluster. This is the second research gap this study has identified. The hidden pattern in the data can be unravelled only by an unsupervised learning algorithm. By profiling the pattern, target outcome can be identified. In our case, we have used K-Means Clustering to identify distinct groups based on profiling the pattern. Having unravelled the pattern through unsupervised learning, the prediction of individual banks as to which category it belongs to involves supervised learning algorithms. Thus, we address this research gap by combining unsupervised and supervised learning to ensure homogeneity within a cluster with minimum variance.
Research Gap 3
With poor credit rating awards that do not help deposit holders, this study compares our classification of banks into distinct groups with the credit ratings awarded by the top Indian rating agencies to the deposits issued by the banks and demonstrates disparities in the credit rating awards. This is the third research gap that the study captures.
Data
Indian banking sector comprises scheduled commercial banks and cooperative banks. Scheduled commercial banks form the backbone of the Indian banking system comprising around 27 PSBs, 30 private banks, 40 foreign banks and a number of regional rural banks. Data for this article include all 44 public and private sector banks, excluding foreign banks, in operation during the 12-year period from 2005 to 2017. The 44 banks comprising 26 PSBs and 18 private banks include some of the well-known PSBs like State Bank of India and its group banks, Indian Bank, Punjab national bank, Indian Overseas Bank, Bank of India and private banks like ICICI, HDFC, Yes Bank and Kotak Mahindra Bank. The data for this study include financials drawn from the balance sheet and income statements of the banks, as well as key annual financial metrics published by RBI for the period from 2005 to 2017. The study also uses RBI reports and guidelines, press releases and reports from the ministry of finance.
Previous studies have extensively resorted to financial ratios according to the CAMELS system, comprising capital base, asset quality, management soundness, earning capacity, liquidity and sensitivity ratios. Each of these characteristics influences bank failure (Huang et al., 2012). A modified CAMELS is used by RBI and other regulators in an Indian context. Very often, bank failures are due to financial distress (Tung et al., 2004). CAMELS is a five-part banking rating system, applied to large number of banks and credit unions in the USA, used as a supervisory system by various banking regulators, including the Federal Reserve and FDIC (Affes & Kaffel, 2019). Ekinci and Erdal (2016) used 35 financial ratios according to CAMELS. Huang et al. (2012), showed that equity to assets and net interest margin had negative relationship with financial distress and were the best predicted trends in developing economies, part of the South East Asian region and European Union banks. Tung et al. (2004) also model the financial ratios based on CAMELS.
Based on the recommendations of the Padmanabhan Committee, the commercial banks incorporated in India are rated on the ‘CAMELS’ model(Capital adequacy, Asset quality, Management, Earnings, Liquidity, and Systems & control), while foreign banks’ branches operating in India are rated under the ‘CALCS’ model (Capital adequacy, Asset quality, Liquidity, Compliance, and Systems and control).The banking stability indicator (BSI) defined by RBI includes five dimensions to evaluate the risk exposure of scheduled commercial banks—capital adequacy, asset quality, profitability, liquidity and efficiency (RBI Financial Stability Report, 2018).
Based on the comparative study of various models nine key accounting ratios for various public and private sector banks for the past 12 years were selected for the study. These nine ratios in a representative manner adequately capture the essence of key dimensions from a deposit holder perspective that includes credit risk, capital adequacy, profitability, efficiency and liquidity. This data set will form the basis of the statistical analysis and model-building exercise for deriving insights. The ratios along with their definition are detailed in Table 1.
Given that our study is conducted from the point of view of deposit holders in banks, other secondary data resources have been referred to, which includes analyst research reports, business dailies and other eminent subject matter experts in the banking field. The time effect on data is not considered for this study as the ratios are used for the modelling purpose.
Financial Ratios and Definitions
Financial Ratios and Definitions
The objective of this study is to classify the 44 banks into specific risk categories in terms of financial health from the perspective of protecting the interest of retail deposit holders. In order to achieve this, the study first uses unsupervised machine learning to identify patterns and then move towards supervised learning. As the data involve nine financial ratios of several diverse banks indicating high heterogeneity, clustering is done so that homogeneity within a cluster and heterogeneity between clusters could be achieved. It is pertinent to point out that clustering into groups could have been made using experience and judgement in a heuristic manner. Such an approach cannot assure homogeneity n a robust manner within a cluster and heterogeneity between clusters. Very often, this will lead to a high variance in a cluster; our methodology overcomes this shortcoming.
First, our study uses unsupervised machine learning algorithm, namely K-Means clustering, to ensure homogeneity within a cluster with minimum variance, leading to identification of three distinct clusters. Profiling of the clusters into high-, medium- and low-risk categories provide insights, which is a key objective in unsupervised learning. We then move from unsupervised learning to supervised learning. We then confirm the three clusters in terms of predictor variables by actually classifying each record into one of the three clusters. This would pave the way for classifying and predicting a new record, given the values of the predictor variables. Correlation study is then carried out with a view to minimising the effect of multicollinearity and reducing the number of variables. The relative importance of the variables in terms of ranks is inferred from the correlation ratios.
Second, the study uses supervised machine learning method, namely LDA where we first predict the clusters in terms of explanatory power. Using Jackknife validation procedure, which drops one record at a time, builds the model with the remaining and then predicts the outcome of the omitted records, a confusion matrix is drawn upon to cross-validate the predictive accuracy. This is further cross-validated using Fischer’s discriminant function and Mahalanobis classification procedure.
Third, by splitting the data set into training data (70%) and testing data (30%), we use CART and Random Forest to build our model on the training data. As part of classification of data to different clusters, alternate approaches are explored for testing the classification performance; we use RPA which is associated with the term DT for classification and prediction. RPA creates a DT that aims to correctly classify members of the population based on several dichotomous dependent variables. We then use Random Forest technique, which is an ensemble learning method for classification, and regression. It operates by constructing a multitude of DTs using training data. Using CART and Random Forest, we then predict the clusters in terms of explanatory power as well as predictive accuracy that is critical for capital protection of deposit holders. We then establish the accuracy of our predictive model.
Fourth, we compare our classification of banks that are high, medium and low in terms of financial health with the credit ratings awarded by top Indian rating agencies to the deposits of respective banks, highlighting the disparities in the credit rating awards.
Analysis and Results
Unsupervised Learning
Basic exploratory analysis is carried out after necessary data-cleaning procedures. Descriptive statistics of the sample is in Table 2. Three clusters have been identified from the K-Means clustering algorithm that is unsupervised in nature. Banks with low ratio of gross NPA to gross advances (GNPATA), high return on assets (ROA), high ratio of net interest income to total assets (Net Interest Margin) (NIITA), high current asset ratio (CAR) and high cash deposit ratio (CDR) are banks with relatively high financial health; cluster 3 can, hence, be profiled as banks with ‘High’ financial health, followed by cluster 1 classified as banks with ‘Medium’ financial health and cluster 2 as banks with ‘Low’ financial health.
Descriptive Statistics
Descriptive Statistics
Further to the clustering analysis, the characteristics of each cluster are studied with an objective to profile such clusters. Figure 1 sets out the bar plots of the input variables. These are compared to arrive at profiling the clusters.

Table 3 sets out the mean values of the input variables for the three clusters. From the comparison of the means of the three clusters for all the nine input variables, it is evident that cluster 3 has the lowest GNPATA and ratio of net NPA to net advances (NNPATA) followed by clusters 1 and 2. Cluster 3 has high return on assets (ROA) followed by clusters 1 and 2. Cluster 3 has high NITA, followed by clusters 1 and 2. Cluster 3 has high CAR and CAR, followed by clusters 1 and 2. Cluster 3 has high credit-deposit ratio (CDR) followed by clusters 1 and 2.
Before proceeding with the model-building exercise, correlation study is carried out to minimise the effect of multicollinearity. Results of exploratory data analysis are in Table 4.
From the correlation analysis, it is evident that CAR is highly correlated to CAR I (>90%), and GNPATA is highly correlated to NNPATA (>90%). As these are derived financial ratios, we consider all the ratios except CAR and NNPATA. Hence, seven financial ratios are considered for the linear discriminant analysis.
We next move to model building for predicting group memberships to clusters. LDA, which is a proven approach, especially when the group variables are continuous, is used. LDA is a statistical technique used to classify an observation into two or more a priori groups. It is also known by the expression Multiple Discriminant Analysis (MDA). For example, in the case of bankruptcy prediction, there are two predefined groups—bankrupt and non-bankrupt firms. Classification is accomplished by a discriminant function, which is a linear combination of independent variables (Fisher, 1936). A cut-off score is derived to determine group classification for each observation. Confusion matrix that shows the misclassification error, a measure of predictive accuracy.
Mean Values of the Input Variables for the Three Clusters
Mean Values of the Input Variables for the Three Clusters
Correlation Matrix
After profiling of three clusters, one of the key requirements of the LDA is differentiation and for three clusters two discriminant functions are identified. Using the Fisher discriminant function, Table 5 sets out the output of the analysis.
We infer that the first discriminant function accounts for 82.5 per cent of information content, and the second discriminant function accounts for the balance 17.5 per cent. As seen in Table 6 (input variables and discriminant function), CART II, ROA followed by CART I are highly correlated with the two discriminant functions, indicating the relative importance of these three in terms of separating the three clusters with regard to financial health.
We then move to multivariate analysis of variance (MANOVA) and test of Hypothesis. The initial hypotheses can be stated as follows: H0: The mean of all the predictor variables are zero in other words µ1 = µ2 = µ3 = µ4 = µ5 = µ6 = µ7 across all the three clusters with 3 × 7 matrix. H1: Alternate hypothesis is at least one pair µi–µj is not equal to zero across the three clusters. Table 6 also sets out the results of MANOVA and Wilks Lamda test. All the seven independent variables are statistically significant as ( p-value is less than 0.005) in sharply differentiating the clusters namely High, Medium and Low financial health. The relative importance in terms of ranks could be inferred from correlation ratios displayed. The correlation ratio is a measure of strength of the independent variable in assessing the financial health of the clusters. The ranking of the input variables is also listed as a part of Table 6.
Discriminant Functions
For each record in the data set, scores are computed for all the three groups, using the above three classification functions. The largest of the scores determine the group membership. Table 7 also sets out a classification confusion matrix. The classification based on confusion matrix is showing accuracy of 95.5 per cent, indicating good fit of the model based on the training data. As this may do well for training data to avoid the over-fitting for the test data, Jackknife cross-validation was carried out to check the veracity of the model. The Jackknife validation procedure drops one record at a time, builds the model with the remaining and then predicts the outcome of the omitted records. The Jackknife cross-validation results interestingly compare with the confusion matrix of training data, providing very high accuracy rate (95.36%).
Discriminant Function, Correlations, Manova, Wilks Lamda and Mahalanobis Classification
Confusion Matrix
Results of the test data classifying banks into ‘high’, ‘medium’ and ‘low’, using Mahalanobis classification for 2017 are presented in Tables 8–10.
As we may find, all the banks which are classified as ‘High’, financial health is primarily from private sector, and it resonates with popular perception and published data. Popular banks like State Bank of India (SBI) and Indian bank fall under banks with medium financial health, and the list has a mix of public sector undertaking (PSU) and private sector banks. All the state bank group banks, Indian Overseas Bank, Punjab National Bank, Bank of Baroda and Corporation Bank fall under ‘Low’ financial health.
The report published by the RBI in August 2017 showed that six banks were identified for their overall performance with key focus on bad loans and has initiated PCA, which bars them from expanding their banking business such as opening branches, hiring or paying higher rates on deposits. The banks are Dena Bank, Central Bank of India, IDBI Bank, Indian Overseas Bank, Bank of Maharashtra and United Commercial Bank (UCO) Bank. Apart from Canara and Union banks, the other PSBs that might come under PCA are Andhra Bank, Punjab National Bank and Punjab & Sind Bank. These banks appear in the list of banks that are classified as ‘Low’ financial health banks, in our study (Table 10).
List of Banks with ‘High’ Financial Health
List of Banks with ‘Medium’ Financial Health
List of Banks with ‘Low’ Financial Health
As part of classification of data to different clusters, alternate approach of Random Forest and CART are used for testing the classification performance. Recursive Partitioning Algorithm (RPA) is associated with the term DT for classification and prediction. Recursive partitioning creates a DT that aims to correctly classify members of the population based on several dichotomous dependent variables. The popular algorithm used in recursive portioning is CART. When the outcome variable is categorical, Classification Tree is used, while the outcome is continuous, Regression Tree is used. In our study, the outcome variable is categorical, having three distinct clusters. Random Forest is an ensemble learning method for classification and regression. It operates by constructing a multitude of DTs using training data. The output is based on the mode of the classes (classification) or mean prediction (regression) of the individual trees. Random Forest is well known for giving high prediction accuracy though it can over-fit sometimes.
The data set was split into training (70%) and testing (30%). The next step is to implement RPA, taking all available independent variables into consideration. First, train the model with 10-fold cross-validation. The value of minsplit is considered as 30 (5% of the total no. of observations) and minbucket as 10. Minbucket is the minimum number of observations in any leaf node. The DT plot is set out in Figure 2. No pruning was done here.

The tree clearly suggests that records in cluster ‘3’ mostly have CART I greater than 0.29. As seen in Table 11, the summary of the model also provides the order of importance of the independent variables.
The model is tested on the test data set, and the results are obtained. Overall Accuracy is 87.79 per cent. Pruning of the model is carried out to improve the accuracy further. Figure 3 sets out the revised output where the tree is visualised:
Summary of Model
CART I is considered as the best predictor variable and forms the root of the tree. The other variable considered is GNPATA. Clusters ‘1’ and ‘2’ have CART I less than 0.29. The model is tested on the test data set, and the results are obtained. After pruning we see that the accuracy has increased to 89.53 per cent. Next, we build our model using Random Forest on the training set. All available independent variables are considered. The Random Forest model is set out in Table 12.
As seen in Figure 4, the Random Forest model error is plotted.
The black line shows the overall error rate, which falls below 10 per cent. The red, green and blue lines show the error rates for ‘cluster 1’(high), ‘cluster 2 (medium)’ and ‘cluster 3 (low)’, respectively. We also plotted the relative variable importance, which is set out in Figure 5.

Random Forest Model


The final step was to make the prediction, using the test data set. Results are set out in Table 13. The summary of the model performance in terms of accuracy is set out in Table 13. LDA and Random Forest produces better accuracy compared to CART model. Interestingly, all seven predictor variables are important in all the three models with slight change in the order.
Summary of Model Performance
We now compare our classification of banks into ‘High’, ‘Medium’ and ‘Low’ categories with the credit ratings awarded by the top Indian rating agencies (see Table 14). As it is seen here, there are a number of disparities in the credit rating awards that need reconciliation. For example, HDFC Bank (‘high’ financial health as per our classification), State Bank of Bikaner and Jaipur (‘low’ financial health as per our classification) and Indian Bank (‘medium’ financial health as per our classification) are all awarded AAA by the rating agency CRISIL.
Similarly, IndusInd Bank, State Bank of Travancore and Axis Bank, all from different clusters as per our classification, have been awarded AAA by the rating agency ICRA.
Comparison of our Classification of Banks with Ratings from Credit Rating Agencies
The first research gap we addressed is to classify the banks into three specific categories based on financial health, being high medium and low. The negative impact of growing non-performing assets to the economy is well documented (Balgova et al., 2016). Our study has precisely captured the banks that are under severe stress from the high level of non-performing assets; it is interesting to note that all the banks classified as having ‘Low’ financial health have a double-digit GNPATA (Table 10), whereas banks classified as having ‘High’ and ‘Medium’ financial health are in single digits (Tables 8 and 9). Falling capital adequacy is a problem of high magnitude (RBI Financial Stability Report, 2018); our classification rightly shows that the average CART I for banks classified as having ‘High’ financial health is substantially higher than for banks classified in the medium or low range (Tables 8–10). Our classification of banks from the perspective of ROA, NITA and GNPATA reflect a similar position. These go to demonstrate the appropriateness of the classification and the accuracy of the prediction model. More importantly, while analysing the composition of the three clusters, it is interesting to note that some of the large, perceptually most reputed and safest, banks in the country like State Bank of India are not clustered under the ‘high’ category, but they fall under the ‘medium’ category. Our classification is also counter-intuitive to the stock market valuations, for example, smaller banks like Karur Vysya Bank is valued by the stock market at a price to book ratio (PB) of 0.72 but finds a place in high cluster, while State Bank of India with a PB of 1.15 is positioned in the ‘medium’ cluster. Comparing individual membership of banks into our three clusters with the market valuation data (PB) shows many such anomalies. It is understandable that stock market valuations go much beyond the perspective of deposit holders and considers various other dimensions in arriving at the valuation of the bank. While past studies focused on bankruptcy predictions, our study and classification provide an authentic guidance to deposit holders in terms of the financial strength of the banks. This is our first research contribution.
The accuracy of our predictive model stems from the choice of a refined methodology, employing machine learning techniques for the classification of banks; this is our second research contribution. In the process, we also compared the predictive accuracy of techniques like discriminant analysis and Random Forest, thereby incrementally contributing to the quantitative theoretical literature. Random Forest though can quantify the relative importance of the input variables; it cannot emphatically measure the change on the outcome when an incremental change in an input variable takes place. In other words, the explanatory power of the input variables impacting the outcome cannot be precisely assessed in Random Forest. Right from the time of Altman (1968) discriminant analysis has been found to be the appropriate technique for classification problems involving financial indicators and hence for separating solvent companies from bankrupt companies. LDA of Fisher can quantify precisely the relative importance of the input variables in sharply separating the clusters and is certainly statistically more parsimonious than random forest. In our study, the difference in classification accuracy between Discriminant Analysis (95.36%) and Random Forest (95.93%) is marginal. Hence, Discriminant Analysis may be preferred to Random Forest as it has substantive explanatory power.
Our third research contribution brings out the anomaly when comparing our classification with the credit ratings awarded by reputed rating agencies to the deposits issued by the respective banks. The AAA rating awarded by CRISIL for long-term instruments are considered to have the highest degree of safety regarding timely servicing of financial obligations and such instruments are intended to carry lowest credit risk. It seems appropriate that CRISIL has assigned the AAA rating to the long-term instruments issued by HDFC Bank as our study classifies HDFC Bank as ‘high’ in terms of financial health (see Table 8). However, assigning the same rating to the long-term instruments issued by State Bank of Bikaner and Jaipur, which is classified as ‘Low’ and to Indian Bank which is classified as ‘Medium’ in our study, clearly seems to be an anomaly (Table 14). More importantly, when we compare some of the input variables with high relative importance, for three banks, the rating anomaly is fairly evident. The ROA for HDFC, Indian Bank and State Bank of Bikaner and Jaipur are 1.88 per cent, 0.67 per cent and a negative 1.22 per cent, respectively (Tables 8–10). The GNPATA for the three banks at 1.04 per cent, 7.47 per cent and 15.52 per cent, respectively, underscores our argument. The CART I for the three banks amount to 12.79 per cent, 12.20 per cent and 7.13per cent, respectively. On all the key variables of relative importance, we see that there is a wide disparity among the three banks, in spite of which we see that the long-term instruments are awarded a AAA rating, which are intended to have the highest degree of safety and timely servicing of financial obligations. Other two top rating agencies in the country— ICRA and CARE—also award very similar ratings to these three banks, which have very different financial strengths and risk profiles, as modelled in this study. Our results corroborate with RBI, which has acknowledged the weakness in the current credit rating structure, commenting that the rating agencies are intended to be forward-looking, but have always been a laggard (Economic Times, 2019). With an objective to prevent rating-shopping or any conflict of interest, it is exploring the feasibility of rating assignments being determined by the RBI itself and paid for from a fund to be created out of contributions from the banks and the RBI (Indian Express, 2017). More importantly, credit rating agencies need to deploy machine learning techniques for establishing credit ratings, which has in recent times been providing promising results; the model developed in this study could go a long way in providing a sophisticated guidance framework to the deposit holders. This is one of the most important incremental contributions of this study.
The IL&FS episode has shaken up the regulators; India’s corporate affairs secretary has highlighted that 100 of the IL&FS group companies would need a risk-based classification and would need to be categorised as high-risk or medium-risk companies (The Hindu, 2019). Our study has developed a model that precisely addresses such a situation and can be applied to any financial institution, including commercial banks. Machine learning techniques to classify banks based on their financial health can help identify on a timely manner the change in the risk profile of the banks over a period of time. This can immensely benefit the deposit holders who could be alerted for a timely reallocation of their savings to more secure banks, from banks whose financials are weakening. From a regulator’s perspective, the RBI, for instance, monitors banks on a continuous basis and periodically moves certain banks into the list of banks requiring PCA, and the vice versa (Business Line, 2019). Machine learning-based predictive frameworks could come in handy for a PCA. Going beyond deposit holders, an oversight mechanism based on machine learning framework can support the regulators to intervene on a timely manner. Especially weak financial institutions with high non-performing assets can deplete bank capital unless resolved on a timely manner (Steigum, 2011); a machine learning–driven dynamic algorithm that can support an RBS framework is the need of the hour. The regulators are precisely moving in this direction (The Hindu, 2019). Our study and the model we have proposed is an incremental contribution in this direction.
Limitations and Scope for Further Research
With more training data being available, the predictive accuracy could steeply increase over a period of time. This article does not integrate bank-level corporate governance differentials, especially the lack of separation between management and control, which is a key challenge faced by the all the Indian banks, both private sector and the PSBs. Integrating the governance differentials with the financial health of banks in the context of providing guidance to deposit holders could be an interesting area for future research. Neural Network and Support Vector Machine (SVM) as classifiers will be interesting to explore for possible improvement in predictive accuracy.
Footnotes
Declaration of Conflicting Interests
The authors declared no potential conflicts of interest with respect to the research, authorship and/or publication of this article.
Funding
The authors received no financial support for the research, authorship and/or publication of this article.
