Abstract
Financial distress prediction (FDP) has received considerable attention from both practitioners and researchers. This paper proposes a novel support vector machine (SVM) classifier ensemble framework based on earnings manipulation and fuzzy integral for FDP named SEMFI. We use financial data from the previous three years to predict companies’ current financial situation and divide the companies in each year into different categories according to whether they manipulate the earnings. Then, SVM is trained on different categories. The outputs of SVM are combined by fuzzy integral which adopts a new fuzzy measure determination method. This method considers the fact that recent financial data are more valuable for FDP. Additionally, when using the model trained by historical data for FDP, the external environment may have dramatically changed. Therefore, a fuzzy measure dynamic adjustment method is proposed by considering the confidence of each single classifier’s output, the consistency between each single classifier’s output and the diversity among classifiers. To verify the performance of SEMFI, an empirical study using real financial data is conducted. The results indicate that the introduction of earnings manipulation, the new fuzzy measure determination and dynamic adjustment method to FDP can significantly enhance the prediction performance.
Keywords
Introduction
The increasing volatility of markets signifies the importance of financial distress prediction (FDP) for companies. Financial distress can cause serious adverse effects on the development of companies. Financial distress leads to significant losses for investors and negatively affects the entire society. Therefore, FDP receives considerable attention from both companies and researchers.
Many methods have been developed to study FDP. Statistical techniques, including single ratio analysis [45], multiple discriminate analysis [11, 15] and logistic regression [4], were first used for FDP. Later, with the rapid development of artificial intelligence, neural networks [30], decision trees [17], support vector machines (SVM) [29] and case-based reasoning [20] were applied to FDP. SVM is one of the most useful tools to predict financial distress. The main concept of an SVM is to map the input vectors into a high-dimensional space using a kernel function and then find an optimal separating hyperplane to classify the samples [5]. As a relatively new machine learning method, the SVM is widely used in the field of FDP in recent years because of its unique advantages in solving the problems of small samples, nonlinear and high-dimension [16]. Sun and Hui [21] applied the SVM to FDP with Chinese listed companies. Chen and Hsiao [29] proposed a SVM method based on a genetic algorithm for their FDP model. Bae [23] developed a FDP model for the South Korean manufacturing industry based on a radical basis function (RBF) SVM. Li et al. [14] constructed a SVM model for FDP by using statistical indices of ranking-order information. All of these studies show that the SVM has satisfactory prediction performance in FDP.
Early studies mainly used single classifiers for distress prediction in the field of FDP. In recent years, a number of studies have examined the classifier ensemble approaches. Because different classifiers may have different prediction performances, classifier ensembles can utilize various single classifiers and perform better than single classification techniques [39]. A classifier ensemble first designs a number of single classifiers and then combines the outputs of different classifiers to obtain the final result. Sun and Li [22] used a weighted majority voting combination of multiple classifiers for FDP. Kim and Kang [35] proposed two neural network ensemble models by using bagging and boosting for bankruptcy prediction. Both methods are based on the assumption that there is no interaction between single classifiers. However, this assumption is invalid in many real problems [36]. The prediction accuracy of classifier ensembles may decrease if the interaction between classifiers is neglected [8]. To overcome this weakness, classifier ensembles based on fuzzy integrals have received considerable attention. The interaction between classifiers can be measured accurately by a fuzzy measure. Kim and Cho [26] presented a novel data fusion approach for web content mining using the fuzzy integration of structure adaptive SOMs. Verikas and Lipnickas [2] proposed an ensemble method based on a fuzzy integral and neural network for FDP. Abdallah et al. [3] applied a method to the problem of landmine detection by applying a fuzzy integral to different regions of the feature space. All of these studies show that a fuzzy integral outperforms other ensemble methods that do not account for the interactions between single classifiers.
A key issue in the application of a fuzzy integral is the determination of the fuzzy measure. The overall recognition accuracy of each single classifier on the training dataset is typically regarded as the fuzzy measure. Li et al. [44] proposed an ensemble model by using heterogeneous features and a fuzzy integral in which the prediction accuracy of the classifier over the training dataset was used to define the fuzzy measure. Xiong et al. [13] proposed an effective method for fault diagnosis based on the RBF neural network and a fuzzy integral in which prediction accuracy was used to define the fuzzy measure. Another commonly used method is confusion matrices. In this case, the prediction accuracy of each single classifier with respect to each class is used to determine the fuzzy measure. Pizzi and Pedrycz [37] used confusion matrices to compute the fuzzy measure and combined the classifiers with a fuzzy integral. Cao [48] applied confusion matrices to compute a prior fuzzy measure and proposed a Choquet integral-based classifier ensemble method for FDP. However, these two methods share a common problem, namely, that the computation of the fuzzy measure is time consuming. For example, if the model has n classifiers,
In fuzzy integral ensemble models, the fuzzy measure of each single classifier is typically based on its performance in the training stage. In the prediction stage, these initial fuzzy measures remain unchanged. However, the classifier that performs well in the training stage may perform poorly in the prediction stage. If the initial fuzzy measures are not adjusted according to the prediction condition, the accuracy of the ensemble model may deteriorate [7]. In many previous studies, the confidence of each single classifier’s output and the consistency between each of the two classifiers’ outputs were considered to adjust the initial fuzzy measures in the classifier ensembles. Cao [48] proposed a dynamically adjusted fuzzy integral to combine different classifiers for FDP and improved the performance of the recognition model. The performance of the classifier ensemble system depends on not only the performances of single classifiers but also the diversity among classifiers [8]. In this study, diversity is considered to adjust the proposed fuzzy measure.
Previous studies ignore an important factor for FDP: earnings manipulation. Earnings manipulation is the act of managers that alter their financial statements intentionally to maximize their own interests [38]. There are many motivations for managers to manipulate the earnings, such as the capital market motivation and meeting a regulatory threshold [24]. Previous studies have shown that earnings manipulation is a common behavior for companies [43]. Because earnings manipulation is the act of altering the financial statement, certain financial data in the financial statement may be contaminated [12]. Therefore, the financial statement of a company that manipulates its earnings may differ from that of a company that does not. When using classifier ensembles models, it is inappropriate to directly combine the outputs of single classifiers without considering the different characteristics of the companies’ financial data. However, to the best of our knowledge, earnings manipulation has not been considered in classifier ensemble models for FDP thus far.
The objective of this research is to propose a novel framework of SVM classifier ensembles based on earnings manipulation and a fuzzy integral to improve the prediction performance of FDP, called SEMFI. The contributions of this work are three-fold. First, earnings manipulation is introduced to divide companies into different categories for training by single SVM classifiers. Second, the ensemble strategies of single classifiers are studied based on different situations in which the company manipulates earnings in the previous three years. Third, a new fuzzy measure determination and dynamic adjustment method is proposed to obtain the final ensemble results.
The remainder of this paper is organized as follows. Section 2 is devoted to the methodologies, including earnings manipulation, the support vector machine and the fuzzy integral. Section 3 describes the SEMFI framework. Section 4 discusses the determination and dynamic adjustment of a fuzzy measure. Section 5 illustrates the experimental results and analysis. Section 6 summarizes the major findings and proposes future research.
Literature review
Earnings manipulation
Earnings manipulation has received considerable attention since the 1980s and has evolved into an important subfield of accounting research. According to [38], earnings manipulation is the act of managers that alter their financial statements intentionally to mislead their stakeholders regarding the real financial situation of the companies or to increase their contractual outcomes that depend on reported financial performance.
The motivations for managers to manipulate the earnings are varied [24]. Managers may want to obtain desirable stock valuations in the capital market through such manipulation. Managers can also use this manipulation to avoid a reduction of their earnings-based bonuses. In addition, in China, the managers of listed companies manipulate the earnings to meet the regulatory thresholds to avoid special treatment or being delisted.
Because of the existence of earnings manipulation, data in the financial statements will be influenced, and the financial statements may not reveal the real financial situation of the companies. This issue will cause losses to the investors and make the market unstable [12]. Hence, detecting the earnings manipulation of companies is important and necessary.
Many models have been developed to determine whether a company manipulates the earnings. This study adopts the widely used modified-Jones model, which has suitable performance [41]. This model divides the total accruals into discretionary accruals, which are used to measure the earnings manipulation of the companies, and nondiscretionary accruals. In addition, Kothari performed specific adaptations to the modified-Jones model. According to Kothari’s work, this study included an intercept term,
where
The SVM is a classification method based on the statistical learning theory. The SVM was first proffered by Vapnik in the 1990s and was subsequently rapidly developed by researchers. The principle of the SVM is based on the structural risk minimization principle [5]. The SVM is widely used in many fields, such as pattern recognition, regression estimation and FDP, because of its unique advantages in solving the problems of small samples, nonlinear, and high-dimensional pattern recognition [21, 42].
The training date should be
However, when the training set is linearly inseparable, the optimal separating hyperplane cannot classify all of the samples correctly and subsequently leads to misclassifications; therefore, the constraint optimization model is rewritten as follows:
where
For a nonlinear binary classification problem, the input vectors are mapped into a high-dimension feature space through a mapping function
where
Any kernel function that meets the Mercer’s condition corresponds to an inner product and can be used in the SVM. The most widely used kernel functions are linear kernel, Polynomial kernel, and RBF kernel.
Then, the classification hyperplane for the nonlinear binary classification problem is
Then the dual formulation of the above optimization problem is as follows:
By solving the dual formulation, we can get the optimal solution
The final decision function is
where
Classifier ensembles entail training a number of single classifiers and combining the outputs of these classifiers to obtain the final result for an unknown sample. Traditional ensemble methods, such as OWA, define the weight of each single classifier to represent its importance and to obtain the final results by combining the outputs of the classifiers. However, these methods are based on the assumption that there is no interaction between the single classifiers. However, this assumption is not realistic. In 1974, Sugeno introduced the concept of the fuzzy measure, which measured the importance of each single classifier and also clearly expressed the interaction between classifiers [34]. Because of these advantages, a fuzzy integral, which is based on a fuzzy measure, has been used as the ensemble method in many studies [2, 3]. In this section, certain basic definitions of fuzzy measures and fuzzy integrals are briefly reviewed.
Let
If
For subset
In previous studies, diverse fuzzy measures were proposed, such as the Sugeno
For a subset with a single element
As previously noted, a fuzzy integral, which is based on a fuzzy measure, plays an important role in the study of classifier ensembles. Many different fuzzy integral methods have been proposed, such as the Sugeno integral, the Choquet integral and the upper integral [49]. Among these fuzzy integral methods, the Choquet integral has been proven to be a useful tool to combine the results of single classifiers [31], and it is applied in this study.
where
Considerable research has shown that the SVM provides promising performance in classification problems [14, 23]. In the field of FDP, the SVM is widely used as a single classifier due to its excellent learning ability and generalization performance. A fuzzy integral is a strong reasoning method under conditions of uncertainty. The ensemble based on a fuzzy integral can address the interaction between classifiers, which is beyond the capabilities of many typical ensemble methods. The Choquet integral and Sugeno integral are two typical fuzzy integral methods. An important difference between the Choquet integral and Sugeno integral is that the Choquet integral uses all values of the single classifiers to compute the integral, whereas the Sugeno integral discards a considerable amount of the information. Many empirical studies have proven that the Choquet integral ensemble can utilize different single classifiers and provides satisfactory performance [2, 3, 26]. In addition, many studies indicate that earnings manipulation can lead to the distortion of financial data and thereby affect the characteristics of the financial statement, which would directly affect the accuracy of FDP. Based on the above analysis, we propose a SVM classifier ensemble framework based on earnings manipulation and the Choquet integral for FDP, called SEMFI, as shown in Fig. 1. For a specific FDP process, the new framework involves three steps.
SEMFI framework.
In step 1, an initial dataset, which consists of distressed companies and normal companies, is prepared as samples, and appropriate financial indicators are selected for FDP. When predicting the financial situation of a company in year
In step 2, an earnings manipulation determination is conducted to divide each dataset into two subsets. Thus, each dataset
In step 3, the Choquet integral is applied to combine the outputs of single classifiers obtained in step 2 to obtain the final result. For a specific company, there are two strategies to combine the results of single classifiers in step 2 to obtain the final classification results based on whether the company manipulates earnings in years
If a company manipulates the earnings in all three years (i.e.,
Classifier ensemble procedure of strategy 1. Classifier ensemble procedure of strategy 2. If a company manipulates the earnings in one of the three years, for example, in year


With the extension of the look back period, for example, 4 or 5 years, the number of possible combination will increase. For each combination, corresponding ensemble strategy can be designed. Because the older data are less useful for evaluating the current financial situation of the companies, few studies use the data of more than four years.
Before applying the Choquet integral to combine the outputs of single classifiers, the fuzzy measure of each classifiers set must be determined. In the classifier ensemble process, a fuzzy measure can be viewed as the degree of importance of the classifier set. To improve the accuracy of classifier ensembles based on a fuzzy integral, it is necessary to determine the fuzzy measure appropriately. In this study, nonlinear programming that attempts to minimize the classification error of an ensemble is applied to determine the fuzzy measure. Furthermore, it considers the value of financial data in different years for evaluating company’s financial situation.
Grabisch and Nicolas [32] proposed a minimum classification error approach that is widely used for the determination of the fuzzy measure. They applied a mean square error (MSE) cost function to compute the cost of ensemble misclassification. Let
where
This function has a disadvantage, namely, that the prediction performance is sensitive to the desired outputs, and it is difficult to determine the ideal value of the desired outputs [1]. In this study, we apply the cost function based on the difference between the probabilities of each sample belonging to different classes, which removes the need to set desired outputs. These differences are called the dissimilarity measures.
Suppose that the actual class of
where
In Eq. 14, the parameter
In this function, the losses of correctly classified samples are zero, and the losses of incorrectly classified samples are positive. Thus, samples that are solely misclassified are calculated. For a sample, the dissimilarity measure will increase when the possibility of being misclassified is increased; thus, the loss will increase. With this loss function and dissimilar measure, we use the following cost function to compute the cost of the classification error of ensemble for
Financial data have a common feature, namely, that recent data are more useful for evaluating the current financial situation of the companies compared with older data. Thus, we should assign a higher importance to single classifiers trained on the recent data in the fuzzy integral ensemble. In other words, it is reasonable to provide higher fuzzy measures to single classifiers trained on recent data. Suppose that classifier
Then, we follow nonlinear programming:
The fuzzy measure of each classifier set can be obtained by solving the above nonlinear programming.
For FDP, the external environment may significantly change when using the model trained by historical data to predict the current financial situation of companies. If the initial fuzzy measure obtained in the training stage is fixed in the prediction stage, the prediction performance of the model may decline. Therefore, we should adjust the initial fuzzy measure obtained in the training stage when using the SEMFI shown in Fig. 1 to predict the financial situation of companies.
Two aspects must be considered when adjusting the fuzzy measure: the confidence of each single classifier’s output, which is represented by a confidence correction coefficient,
Suppose that
For each sample
where
The final fuzzy measure
However, the performance of the classifier ensemble system depends on not only the performances of single classifiers but also the diversity among classifiers [8]. When classifier
Given
where
According to the method applied by Si et al. [28], the consistency correction coefficient
Thus, the final fuzzy measure
where
The dataset and indicators
Listed manufacturing companies on the Chinese stock exchange are selected to verify the prediction performance of SEMFI in this paper. To protect the investors and the market, the China Securities Regulatory Commission implements special treatment mechanisms for listed companies in poor financial situations. Consistent with most previous research methods for the FDP of Chinese listed companies, in this paper, we regard the special treatment companies as distressed companies and regard the non-special treatment companies as normal companies [21, 22]. According to [46], special treatment companies include the companies that are labeled ST, *ST or S*ST: companies that have negative net profits for two consecutive years are labeled ST; companies that suffer losses for three consecutive years are labeled *ST; and companies that suffer losses for four consecutive years are labeled S*ST. S*ST companies have the worst financial situations, and *ST companies have a worse financial situation than ST companies. The 116 manufacturing companies, which were determined to be ST, *ST or S*ST for the first time during the 2007–2013 period, are selected as the distressed companies. In addition, 116 manufacturing companies, which are not labeled ST, *ST or S*ST during the same period, are selected as the normal companies to match the distressed companies by the ratio of 1:1.
Among the 232 manufacturing companies, 79 distressed companies and 79 normal companies are randomly selected for training data, and the remaining 37 distressed companies and 37 normal companies are regarded as testing data. Financial data in years
The financial indicator system is an important aspect of FDP and has a considerable impact on the performance of the FDP model. Financial indicators are typically divided into many categories, such as profitability, liability and market value, and relevant indicators are selected from these categories for FDP by the researchers [19, 47]. To comprehensively summarize the financial situation of companies and consider the particularity of the Chinese stock market and accounting system, 32 financial indicators from 5 categories are selected to develop the indicator system. The indicator system is shown in Table 1.
Financial indicator system
Financial indicator system
First, we design three SVM classifiers that are trained on financial data in years
To test the necessity of considering earnings manipulation in this paper, the prediction results are split into two situations. Situation 1 considers earnings manipulation, and Situation 2 does not consider earnings manipulation. The classification procedure of the SVM classifiers in situation 1 and situation 2 are shown in Figs 4 and 5, respectively. The prediction results are presented in Table 2. The overall accuracy, Type I error and Type II error are used to measure the prediction performances of the SVM classifiers. The Type I error denotes the percentage of normal companies misclassified as distressed companies, whereas the Type II error denotes the percentage of distressed companies misclassified as normal companies.
Classification procedure of the SVM classifier in situation 1. Classification procedure of the SVM classifier in situation 2. The classification procedure of classifiers ensemble in situation 2.


Table 2 illustrates that the overall accuracy of SVM
Prediction results of SVM classifiers with a sample ratio of 1:1
Then, we compare the proposed framework with other seven classification methods. PAFI and CMFI denote fuzzy integral ensemble methods in which the fuzzy measure determination methods are based on the overall prediction accuracy and confusion matrices, respectively. OWA and DS denote classifier ensembles methods based on ordered weighted averaging and the Dempster-Shafer evidence theory, respectively. The Dempster-Shafer evidence theory, which was first proposed by Dempster and developed by Shafer, is an effective information combination method [50]. The logit, BP neural network (BPNN) and SVM classifiers are single classifiers that are widely used in FDP. Similar to previous studies, we use financial data in year
To test the necessity of considering earnings manipulation in this paper, we also provide the prediction results of SEMFI and the above seven classification methods in two situations. In situation 1, the classification procedure of PAFI, CMFI, OWA and DS are similar to SEMFI, as shown in Fig. 1. And the classification procedures of three kinds of single classifiers are similar to Fig. 4. In situation 2, the classification procedures of PAFI, CMFI, OWA, DS and SEMFI are shown in Fig. 6. And the classification procedures of three kinds of single classifiers are similar to Fig. 5. The results are shown in Table 3.
Prediction results of SEMFI and seven classification methods on the sample ratio of 1:1
Table 3 shows that SEMFI has satisfactory performance. Using the results in situation 1 for example, SEMFI has a higher overall accuracy (90.5%) and a lower Type I error (2.7%) and Type II error (16.2%) than the two typical fuzzy measure determination methods, i.e., PAFI (85.1%, 8.1% and 21.6%, respectively) and CMFI (83.8%, 8.1% and 24.3%, respectively). The overall accuracy of SEMFI is also higher than those of OWA (85.1%) and DS (87.8%); in addition, the Type I error, Type II error of SEMFI are lower than those of OWA (8.1% and 21.6%, respectively) and DS (5.9% and 20.6%, respectively). SEMFI has significant advantages compared with the logit, BPNN and SVM classifiers in terms of prediction performance. For example, although the SVM performs best among the three single classifiers, the overall accuracy of the SVM is 8.1% lower than that of SEMFI, and the Type I error, Type II error of the SVM are 10.8% and 5.4% higher, respectively, than those of SEMFI. The comparison results in situation 2 are similar to those in situation 1. These results indicate that SEMFI has better performance, regardless of whether earnings manipulation is considered.
Table 3 also reveals that SEMFI and other seven classification methods perform better when considering earnings manipulation, as shown in Figs 7 and 8. Compared with situation 2, in situation 1, the average overall accuracy is 3.7% higher, and the average Type I error and average Type II error are 3.7% and 3.2% lower, respectively.
Overall accuracy comparison with a sample ratio of 1:1. Type II error comparison with a sample ratio of 1:1.

The ratio of distressed to normal companies selected in the above experiment is 1:1. However, normal companies far surpass distressed companies in the real world. Therefore, an experiment with unbalanced samples is conducted in this study to further validate the effectiveness of the proposed framework. Furthermore, to evaluate the real prediction performance of SEMFI, the sample balance techniques, such as sample weighting and data resampling [25, 27], are not used in this experiment. A total of 116 distressed companies and 464 normal companies are randomly selected as the samples with a ratio of 1:4 during the 2007–2013 period. Among the 580 sampled companies, 79 distressed companies and 316 normal companies are randomly selected as training data, whereas the remaining 37 distressed companies and 148 normal companies are regarded as testing data.
The prediction results of the SVM classifiers, which use financial data in years
Prediction results of SVM classifiers with a sample ratio of 1:4
Prediction results of SEMFI and seven classification methods with a sample ratio of 1:4
The statistical significances of the difference between two situations regarding the overall accuracy, Type I error and Type II error of SEMFI and the other seven methods are further assessed by using the Wilcoxon signed ranks test. The Wilcoxon test is a nonparametric test method that is used when overall distribution is unknown. The Wilcoxon test results of overall accuracy, Type I error, and Type II error in the two situations, i.e., considering earnings manipulation and not considering earnings manipulation, are shown in Table 6.
Statistical comparison using the Wilcoxon test
When the ratio of distressed companies to normal companies is 1:1, the value of the
When the ratio of distressed companies to normal companies is 1:4, the value of the
Overall accuracy comparison with a sample ratio of 1:4. Type II error comparison with a sample ratio of 1:4.

This research proposes a novel SVM classifier ensemble framework based on earnings manipulation and a fuzzy integral for FDP called SEMFI. The financial data in the previous three years are used to predict the current financial situation of companies. The companies in each year are divided into two different categories according to whether the companies manipulate the earnings. Then, single SVM classifiers are trained on different categories, and a fuzzy integral is applied as the ensemble method. A new fuzzy measure determination method is proposed to improve the performance of the fuzzy integral. In this method, an important feature (i.e., the recent data are more valuable for evaluating the current financial situation of the companies) of companies’ financial data is considered. Furthermore, considering that the external environment may have significantly changed when using the model trained by historical data to predict the current financial situation of companies, we propose a fuzzy measure dynamic adjustment method that considers the confidence of each single classifier’s output, the consistency between each of the two single classifiers’ outputs and the diversity among classifiers. An experimental study that compares the performance of SEMFI with three single classifiers and the other four ensemble methods is conducted on the real data of Chinese listed companies. Two cases are considered, with the ratios of distressed companies to normal companies are balanced (1:1) and unbalanced (1:4). From the experimental results, SEMFI obtains the highest overall accuracy and lowest Type I error and Type II error compared with the single classifiers and the other ensemble methods. The results also show that for SEMFI and the other seven methods, considering earnings manipulation can significantly improve the overall accuracy and reduce Type I and Type II errors in statistical terms, regardless of whether the dataset is balanced. Future research will focus on examining the performance of the heterogeneous classifier ensembles, such as combining the logit, BPNN and SVM classifiers using a fuzzy integral based on the new fuzzy measure determination method for FDP.
Footnotes
Acknowledgments
This research was supported by NSFC Grant (No. 71201024) and NSFC Grant (No. 71671038).
