Abstract
Because of Covid-19 effect economic stability has been shaken and unemployment rates have increased in recent years, predicting financial conditions of consumers and providing a credit rating has become more important. So, to overcome these problems, Coot-Sine Cosine Algorithm (Coot-SCA_NN) is aimed to predict financial distress in federated learning (FL). Proposed method involves different entities, i.e., nodes and server. Process included in the proposed framework includes local training based on local data at each node, updates to server, model aggregation at server, and download of global model at the nodes, update training based on the downloaded global model and local model at every epoch of iteration. Here, in training model, input data is taken from dataset and then data augmentation model is carried out based on the mutual information. Finally, financial distress is predicted utilizing a neural network (NN) that is trained by the proposed optimization algorithm named Coot-SCA. At each node the information is processed and the details are recorded on the server for further processing. Proposed Coot-SCA_NN is derived by combining the Coot algorithm with Sine Cosine Algorithm (SCA). A major contribution is made through local renewal and consolidation of the service performed by replacing CAViaR. The performance of the proposed method is analyzed using Financial Distress Prediction dataset and the proposed method has the accuracy of 0.924, MSE of 0.083, RMSE of 0.289, Loss of 0.079, and MAP of 0.910.
Introduction
Studies show, because of bad financial structures, financial distress occurs. A firm's financial leverage increases its agency costs and makes top managers more likely to make high-risk decisions, 1 and poor financial structures are generally inversely related to corporate performance.2,3 Financial distress occurs when a company suffers from a threat from the outward economic environment or a failure of inward financial decisions, causing it to struggle with insufficient cash flow, or worse, it may lose profits or go bankrupt. Many companies suffered heavy losses and went bankrupt during the global financial crisis that exploded in 2008. A company without a financial risk early warning mechanism cannot properly handle financial difficulties and shocks when the external economic environment deteriorates. To some extent, a well-developed internal control mechanism of a company should include a scientific financial distress prediction (FDP) system, which can indicate financial collapse before a financial crisis occurs. 4 Bankruptcy prediction, is commonly referred to as business failure prediction, financial distress prediction plays a vital role for decision makers like investors, lenders and company managers. Contrarily, managers must determine company's risk while developing a development plan for company. Alternately, lenders or investors need to evaluate the company's financial position, liquidity, solvency and profitability before making a decision on credit rating or investment. 5
As the core of corporate operations, efficient financial management can predict potential financial crisis. 6 Because potential financial vulnerabilities or risks revealed by financial distress forecasting, investors’ decision-making can be improved, help financial institutions in their lending decisions, and assist regulatory authorities formulate regulations. If competent authorities, corporate governance bodies, CPAs, auditors and senior management can identify risk warnings or problems as soon as possible, relevant actions can be taken to prevent suffering or deterioration; Therefore, effective financial crisis prediction is very important.3,7–10 FL enables target model building without assigning local information by device model training on edge resources and also referred by a newly created privacy-preserving distributed ML scheme. By building a global learning model based on local models of distributed agents, FL-based application to predict customers’ financial problems is solved. By network agents utilizing their device data and local resources, Local models are created. FL concept is used because the learning strategy does not need to share any data with the server or any other agent that ensures the protection of clients’ sensitive data. 11
To address data privacy 12 issues in machine learning (ML),13,14 Google introduced a privacy-preserving machine learning framework called FL,15,16 which allows multiple parties to jointly train a model without sharing data. The model is trained using their local dataset and a copy of model is maintained by each party and only gradients are sent to other parties at every training epoch. So, from these parameters parties cannot derive raw data and for parties with similar data this method is useful i.e., data with the similar feature space. In different parties, data stored with feature space is not always similar. The emergence of this technology solves the conflict between data privacy and data sharing for distributed devices. FL is suitable for use when data is privacy sensitive due to the nature of data not being exposed to a third central server. Developing an efficient FL method for heterogeneous data remains challenging. 17 Investigating powerful financial distress forecasting (FDP) models has always been a necessary research topic attracting academics and practitioners. In the twentieth century, some classical statistical FDP models such as the Z-score model, 18 the zeta-score model, the logistic regression analysis (logit) model, and the probit model were developed, which laid the foundation for further research on FDP modeling. With the development of artificial intelligence, a large number of researchers have devoted themselves to developing more effective FDP models based on artificial intelligence technologies, such as decision trees (DTs), neural networks (NNs), 19 and support vector machines (SVMs), case-based reasoning (CBR), and genetic algorithms (GAs). 20
Fundamental aim of this paper is to propose Coot-SCA_NN for predicting financial distress. The proposed method includes various entities such as node and server. The process included in the proposed framework is local training based on each node's local data, server updates, model aggregation at the server, global model download at the nodes, update training based on the downloaded global model and local model, and at every epoch of iteration. In this training model, input data is taken from dataset and a data augmentation model is implemented based on mutual information. Finally, financial distress is predicted using a NN trained by the proposed Coot-SCA optimization algorithm. Data is processed at each node and the data is stored on the server for further processing. Proposed Coot-SCA_NN is obtained by unifying Coot algorithm with SCA. A major contribution is made through local renewal and consolidation of the service performed by replacing CAViaR.
A important role in this paper is given as follows,
Remaining segment of this paper is structured as: reviewed research papers are given in Section 2, the proposed financial crisis prediction techniques are mentioned in Section 3, and obtained outcome for Coot-SCA_NN is given in Section 4 and conclusion is explained in section 5.
Motivation
In a company, bad decisions regarding pricing or marketing can lead to financial distress. Expensive advertising campaign or ineffectual modifications to a product or pricing structure are other possible causes of a financial crisis that can lead to a loss in sales. Therefore, the prediction of financial crisis is important and it is a challenging process. Hence a useful model is proposed in this paper.
Literature survey
To predict financial problems of customer, Imteaj, A. and Amini, M.H. et al. 11 introduced FL to predict customers financial distress. This method achieved rapid convergence. However, it failed to identify that an agent is malicious or not. Liu, J. et al. 5 designed a GA-based gradient boosting method to parameter selection in gradient boosting decision tree and integrated network-based variables for financial distress prediction. The developed model was robustness. However, network-based variables can improve the classifier prediction performance relating to accuracy was not explored. Sun, J. et al. 20 introduced Support Vector Machine (SVM). SVM-based multiclass financial distress prediction (FDP) merged with decomposition and fusion methods. This method greatly improved multiclass performance. However, it was extremely hard to predict by human expertise. Sun, J. et al. 4 designed Adaboost-SVM ensemble learning for dynamic FTP. This method greatly improved the recognition ability of minority financial crisis models. But it failed to solve class asymmetric transient classification problems. Jan, C.L. 3 introduced deep learning (DL) to develop high accuracy and effective financial crisis forecasting models. This method produced more accurate and useful results. Outcome of this study failed to apply in all situations because of different economic and financial environments and financial regulations of diverse countries, regions or economies. Du, X. et al. 21 designed clustering-based under-sampling (CUS) with Gradient Boosting Decision Tree (GBDT) to improve the scope and depth of clustering to achieve better predictive performance. With regard to both overall performance and recognition efficiency, this method outperformed benchmark models. Due to class-imbalanced financial datasets, there is further need to explore a better ensemble method. El-Bannany, M. et al. 22 introduced MLP + LSTM + CNN for financial Distress prediction. This method was very strong. However, it encountered problems when increasing the number of input attributes. El Bannany, M. et al. 23 introduced a multi-layer perceptron with parameter optimization for financial distress forecasting. Due to standard ML methods, like SVM and DT, this method outperformed. However, it failed to include macroeconomic and other industrial factors.
Challenges
The challenges faced during financial distress are labeled as follows,
In,
11
a mechanism for detecting malicious agents was integrated and fairness is implemented considering the agent's contribution to the global model convergence. This method outperformed the centralized model and the state-of-the-art FL model, mostly in non-IID settings. It failed to incorporate a mechanism to identify malicious agents and execute fairness by examining the agent's contributions regarding global model convergence. The method developed in
11
cannot detect whether an agent is malicious or not, and did not guarantee fairness when assigning computational tasks during training. Different feature selection procedures may be intended for diverse decomposition and fusion approaches. In addition, the decomposition and fusion methods can also be integrated with other classification algorithms to propose more effective multiclass FDP models.
20
The method developed in
4
failed to address other issues such as default detection, customer classification, and spam filtering. Also, it failed to explore a dynamic model weighting mechanism for integration with ensemble learning algorithms, e.g., Random space and Bagging. Tough, extended financial distress can eventually cause bankruptcy. When a financial crisis occurs, it should be considered to right away so that the situation does not worsen. Financial problems can cause more financial problems if they are not fixed immediately
Proposed coot-SCA for financial distress prediction
Fundamental reason of this research is to devise a method for financial crisis prediction using proposed Coot-Sine Cosine Algorithm based neural network (Coot-SCA_NN). The proposed method involves different entities, i.e., nodes and server. Process included in the proposed framework includes local training based on local data at each node, updates to server, model aggregation at server, downloads of global model at the nodes, update training based on downloaded global model and local model, and executes each time. Here, in the training model, the input data is taken from the dataset and then the data augmentation model is carried out based on the mutual information. 24 Finally, financial distress is predicted using NN, 25 which is trained by the proposed optimization algorithm named Coot-SCA_NN. At each node the information is processed and the details are recorded on the server for further processing. The proposed Coot-SCA_NN is obtained by integrating the Coot algorithm 26 with the SCA. 27 A major contribution is made through local renewal and consolidation of the service performed by replacing CAViaR. 28 Block diagram of Coot-SCA_NN method is demonstrated in Figure 1.

Block diagram of coot-SCA_NN method.
Local training data based on local data at every node, training models, training architecture are discussed in the below section.
Training at every node
Data is trained in every device node for every
Training models
In training model, tuning Ride NN for financial distress prediction and tuned Ride NN is trained utilizing the proposed Coot-SCA algorithm are discussed in below section,
Data acquisition
Consider dataset as S, which encloses p quantity of input data and exhibited as
Herein, p denotes total number of data and Financial distress prediction
Financial distress prediction is always significant for financial institutions to evaluate the financial health of companies and individuals. Bankruptcy prediction and credit scoring are two significant problems in financial distress prediction, where various statistical and ML methods have been utilized to develop financial prediction models. For decision makers like investors, lenders and company managers, financial distress prediction plays a crucial role. Contrastingly, even if developing a development plan for the company, company's risk must be accessed by the managers. Alternatively, company's financial position like liquidity, solvency and profitability are needed to be evaluated by the lenders or investors before taking credit rating or investment decisions. The Ride NN architecture and the prediction of financial distress applied by the proposed hybrid Coot-SCA algorithm are discussed below. Here, the input fed to of NN for financial distress prediction is Ride NN architecture
Input layer, hidden layer and outcome layers are the three layers in the RideNN classifier and each layer contains neurons. In different layers, by linking outcome of one neuron to input of another neuron, a network is constructed. Given input to RideNN classifier is listed below,
To have a bias and generate a potential is assumed by hidden layer neurons.
Here, using mutual information loss function is computed and it is fundamental measure of relationship among random variables. Mutually informative two random variables Tuning Ride NN using Coot-SCA

Architecture of rideNN.
In this section the tuned neural network is trained by the intended optimization algorithm known as Coot-SCA. This is derived by incorporating Coot algorithm with SCA. The principle of the
The COOT optimization algorithm is on the basis of diverse movement behaviors of coot flocks on water surface. Coots are tiny waterfowl that exhibit several distinct group behaviors at water surface, with the final aim of the behavior being to move near food or a precise location. On water surface, coot group mostly has four diverse movement behaviors: random movement, chain movement, position adjustment according to the leader, and leader movement. The Sine Cosine Algorithm (SCA) is a population-based optimization algorithm for solving multiple optimization problems. SCA produce diverse initial random solutions and moves near the best solution utilizing a mathematical model on the basis of sine and cosine functions. Diverse stochastic and adaptive variables are incorporated into this algorithm to preserve exploration and exploitation of the search space at diverse milestones of the optimization.
The main two approaches of SCA (Population Search Strategy and Local Search Strategy) are provided to expand an intelligent algorithm suitable for efficient search through two main search strategies (Global Search and Local Exploration). The optimization process include two phases: exploration and exploitation phase.
Solution encoding
Solution encoding is employed to find simplest solution in a
Wherein,
Fitness function
It is used for computation of best solution. It is the difference between the target value and the output of ride NN. From the previous iteration the optimal solution is determined as each solution to achieve a best location.
Step 1: initialization
Consider input data Y in
Step 2: exploration phase
There are gradual modifications in the stochastic solutions, and the stochastic variations are significantly lower than in the exploration phase.
In exploration phase, random variations are significantly lower.
Assume,
herein,
From Coot,
Substituting the Equation (18) in Equation (12),
Updated solution is expressed as follows,
Step 3: exploitation phase
In random solution there are some gradual changes in exploitation phase.
Equation (9) and equation (26) are combined as follows,
In the interval
Search the solutions outside the space among their correlated destination and also for exploring the search space. In interval
When ranges of sine and cosine functions are in
Step 4: termination
Termination process is repeated until the best solution arises. By default, when the iteration counter is higher than greater number of iterations, the optimization process terminated. Pseudo code for Coot-SCA is mentioned in algorithm 1.
Trained data are sent to the global model and nodes are aggregated at the server. Aggregate weight from different local training data and using CAViar model weights are averaged in aggregation at the server. Figure 3 represents aggregation at the server.

Aggregation at the server.
Apply caviar,
Applying global trained model at every local node and at last, averaged weights are upgraded to server and every local node (devices).
Results and discussion
Experimental outcome of Coot-SCA_NN method for financial distress prediction is examined in this part.
Experimental setup
Using python tool, Coot-SCA_NN method for financial distress prediction is successfully executed.
Dataset description
Data visualization
SHAP (SHapley Additive Explanations) is an explainability framework for machine learning models that helps interpret how individual features contribute to a model's predictions. It is based on Shapley values, a concept from cooperative game theory, which assigns a fair contribution score to each feature in a model's decision-making process. Figure 4 presents the SHAP visualization to show the impact of each input feature on the model's predictions. Figure 4(a) shows the SHAP heatmap that visualizes the impact of different features on a machine learning model's prediction across multiple instances. Figure 4(b) is a SHAP feature importance plot, which visualizes the impact of individual features on the model's predictions. Figure 4(c) is a SHAP summary plot that illustrates the mean absolute SHAP values for different features, showing their average impact on model predictions. Figure 4(d) is a SHAP summary dependence plot, which shows how different features contribute to the model's output, highlighting their impact and interactions. Figure 4(e) explains how individual features contribute to a specific prediction. It shows how the base value (expected model output) is adjusted by each feature's SHAP value to arrive at the final prediction.

SHAP visualization.
Coot-SCA_NN method for financial distress prediction has five performance measures like accuracy, Loss Function, MSE, RMSE, and MAP are listed below.
Accuracy
Accuracy is utilized to estimate the ability of a categorical decision and referred to as performance measure.
MSE computes error by squaring differences among predicted and actual value and on the dataset averaging it. Expression for MSE is explained in equation (7).
RMSE
RMSE is utilized in ML and to calculate accuracy of a predictive model. It is a popular metric and computes the differences among the predicted and actual values, squaring errors, takes mean, and then finds square root.
Loss function is a function, which computes loss for one data point.
It calculates by finding (AP) average precision for each class and average over a number of classes.
Performance analysis
Performance analysis of devised methodology using time step is discussed in Figure 5. Analysis of Coot-SCA_NN regarding accuracy is displayed in Figure 5(a). When time step = 20, accuracy achieved by the proposed method is 0.848, 0.852, 0.856, 0.860 and 0.865 for epochs 20, 40, 60, 80, and 100, respectively. For time step = 80, the proposed method has the accuracy of 0.894, 0.905, 0.910, 0.913, and 0.921 for epochs 20, 40, 60, 80, and 100, respectively. The estimation of new methodology in relation to MSE is discussed in Figure 5(b). When time step = 20, 0.392, 0.358, 0.312, 0.288 and 0.249 are the MSE values obtained by the proposed method for epochs 20, 40, 60, 80, and 100, respectively. For time step = 80, intended method obtained MSE of 0.235, 0.186, 0.170, 0.130, and 0.093 for epochs 20, 40, 60, 80, and 100, respectively. Assessment of intended method with respect to RMSE is demonstrated in Figure 5(c). The RMSE values attained by proposed method for epochs 20, 40, 60, 80, and 100 is 0.626, 0.598, 0.559, 0.537, and 0.499, respectively. Performance analysis with respect to loss function is displayed in Figure 5(d). For time step = 20, loss values of the proposed method with epoch 20 is 0.152, epoch 40 is 0.148, epoch 60 is 0.144, epoch 80 is 0.140, and epoch 1000 is 0.135. The loss value decreases when the time step increases. Figure 5(e) shows the analysis based on MAP. When time step = 20, the MAP values attained by the proposed method is 0.822, 0.770, 0.786, 0.792, and 0.813 for epochs 20, 40, 60, 80, and 100, respectively.

Performance analysis with time step, a) accuracy b) MSE c) RMSE d) Loss e) MAP.
This section presents the comparative analysis based on time step and cross validation. The comparative methods used for analysis are FL,
11
GA-based gradient boosting,
5
OVO-SVM,
20
DNN + CNN,
3
Decision Tree (DT),
31
Logistic Regression (LR),
31
and Random Forest (RF).
32
Comparative Analysis based on time step
Figure 6 shows comparative analysis of Coot-SCA_NN. In Figure 6(a), the evaluation of proposed method referring to accuracy is presented. Accuracy values achieved by FL is 0.785, GA-based gradient boosting is 0.804, OVO-SVM is 0.821, DT is 0.837, DNN + CNN is 0.843, LR is 0.850, RF is 0.854, and the proposed methodology is 0.865, when time step =20. While time step = 80, intended method achieved accuracy of 0.921 and traditional methods achieved an accuracy of 0.849, 0.864, 0.879, 0.898, 0.901, 0.905, and 0.913. The analysis of new methodology regarding MSE is displayed in Figure 6(b). Existing methods and proposed method reached MSE values of 0.648, 0.522, 0.463, 0.410, 0.365, 0.348, 0.299, and 0.249, when time step = 20. The MSE values attained by devised methodology about 0.093 and classic methods about 0.440, 0.361, 0.270, 0.218, 0.160, 0.138, and 0.110 for time step = 80. Evaluation of Coot-SCA_NN in relation to RMSE is mentioned in Figure 6(c). Traditional method and proposed method achieved RMSE of 0.805, 0.722, 0.681, 0.640, 0.604, 0.590, 0.547, and 0.499 for time step = 20. RMSE values attained by FL is 0.663, GA-based gradient boosting is 0.601, OVO-SVM is 0.519, DT is 0.467, DNN + CNN is 0.400, LR is 0.371, RF is 0.331, and Coot-SCA_NN is 0.306, while time step = 80. Figure 6(d) displays the estimation of intended method regarding loss function. When time step = 20, traditional methods, such as FL, GA-based gradient boosting, OVO-SVM, DT, DNN + CNN, LR, RF, and the proposed Coot-SCA_NN have the loss values of 0.215, 0.196, 0.179, 0.163, 0.157, 0.150, 0.146, and 0.135, respectively. For time step = 80, devised methodology reached loss value of 0.079 and traditional methods, such as FL, GA-based gradient boosting, OVO-SVM, DT, DNN + CNN, LR, and RF reached loss value of 0.151, 0.136, 0.121, 0.102, 0.099, 0.095, and 0.087, respectively. Assessment of Coot-SCA_NN relative to MAP is shown in Figure 6(e). For time step = 20, the MAP obtained by intended method is 0.822 and traditional methods is 0.749, 0.763, 0.789, 0.798, 0.800, 0.815, and 0.819. Similarly, for time step = 80, the MAP values attained by proposed method is 0.897 and FL, GA-based gradient boosting, OVO-SVM, DT, DNN + CNN, LR, and RF is 0.759, 0.794, 0.821, 0.854, 0.861, 0.869, and 0.877, respectively.
Comparative Analysis based on cross validation

Comparative analysis with time step, a) accuracy b) MSE c) RMSE d) Loss e) MAP.
Figure 7 shows the comparative analysis based on cross validation. Figure 7(a) shows the analysis based on accuracy. For K-value = 5, the accuracy obtained by the proposed method is 0.890, on the other hand, the accuracy obtained by the existing methods, such as FL, GA-based gradient boosting, OVO-SVM, DT, DNN + CNN, LR, and RF is 0.802, 0.810, 0.850, 0.860, 0.863, 0.877, and 0.884, respectively. The accuracy increases with the increase in the K-value. The analysis based on MSE is given in Figure 7(b). The MSE of the Coot-SCA_NN, FL, GA-based gradient boosting, OVO-SVM, DT, DNN + CNN, LR, and RF is 0.126, 0.352, 0.308, 0.199, 0.189, 0.157, 0.147, and 0.133, respectively, when K-value is 6. Similarly, the performance of the comparative methods considering RMSE is shown in Figure 7(c). When K-value is 8, the RMSE of the comparative methods, such as FL, GA-based gradient boosting, OVO-SVM, DT, DNN + CNN, Coot-SCA_NN, LR, and RF is 0.554, 0.502, 0.387, 0.371, 0.341, 0.330, 0.324, and 0.289, respectively. The comparative analysis based on loss is given in Figure 7(d). For K-value = 7, the loss of FL is 0.187, GA-based gradient boosting is 0.154, OVO-SVM is 0.127, DT is 0.120, DNN + CNN is 0.114, LR is 0.106, RF is 0.101, and Coot-SCA_NN is 0.092. Figure 7(e) shows the MAP values of the proposed method and existing methods for different K-values. When K-value is 6, the MAP of the proposed Coot-SCA_NN is 0.868, while the MAP of FL is 0.732, GA-based gradient boosting is 0.766, OVO-SVM is 0.826, DT is 0.837, and DNN + CNN is 0.846, LR is 0.854, RF is 0.859.

Comparative analysis based on cross validation, a) accuracy b) MSE c) RMSE d) Loss e) MAP.
Discussion based on time step = 80 and K-value = 8
Comparative discussion is displayed in Table 1. Here, the discussion is provided based on the best performance (time step = 80 and K-value = 8) attained by the comparative methods. On seeing Table 1, it is noted that the proposed method offers the high performance than the existing methods with the accuracy of 0.924, MSE of 0.083, RMSE of 0.289, Loss of 0.086, and MAP of 0.910 for K-value = 8.
Comparative discussion.
Statistical Analysis
Statistical analysis is a tool for analyzing raw, unstructured data to extract meaningful insights and draw reliable conclusions. The Analysis of Variance (ANOVA) test is a statistical method used to assess whether significant differences exist between the means of two or more groups. It assumes two hypotheses, (1) The means of all groups are equal. (2) At least one group's mean is different. The ANOVA test is performed using the following steps: Define Hypothesis, Calculate the Sum of Squares, Determine degrees of freedom, Determine F-value, and Accept or Reject the Null Hypothesis. Table 2 shows the results of ANOVA test. The ANOVA test result showed statistically significant results of the proposed Coot-SCA_NN with accuracy: F = 40.48827, and P = 2.29E-14. So, the Null Hypothesis is rejected.
ANOVA test.
Training Time
Training time refers to the duration required for a machine learning model to learn from a dataset and build its structure. During this process, the model examines the data, identifies patterns, and constructs the final model. Table 3 shows the training time of the proposed method and the exiting methods. Here, the training time of the proposed method is minimum than the training time of the existing methods.
Training time.
Comparative Analysis based on time step
Figure 8 depicts the comparative analysis of Coot-SCA_NN using Taiwanese Bankruptcy Prediction Dataset by varying the time step. The analysis of the proposed method based on accuracy is presented in Figure 8(a). When time step =20, accuracy values obtained by FL is 0.788, GA-based gradient boosting is 0.799, OVO-SVM is 0.817, DT is 0.827, DNN + CNN is 0.837, LR is 0.858, RF is 0.865, and the proposed methodology is 0.877. While time step is increased to 80, the proposed method obtained an accuracy of 0.918 and traditional methods achieved an accuracy of 0.827, 0.847, 0.866, 0.877, 0.889, 0.898, and 0.909. The analysis using MSE is provided in Figure 8(b). The existing methods and proposed method have MSE values of 0.688, 0.648, 0.588, 0.558, 0.510, 0.477, 0.328, and 0.225, when time step = 20. Similarly, for time step = 80, the MSE value obtained by the proposed method is 0.158 and the existing methods is 0.638, 0.566, 0.499, 0.477, 0.457, 0.409, and 0.266. The analysis using RMSE is shown in Figure 8(c). The existing methods and the proposed method have RMSE of 0.829, 0.805, 0.767, 0.747, 0.714, 0.690, 0.573, and 0.474 for time step = 20. RMSE values of FL is 0.799, GA-based gradient boosting is 0.752, OVO-SVM is 0.706, DT is 0.690, DNN + CNN is 0.676, LR is 0.639, RF is 0.515, and Coot-SCA_NN is 0.397, while time step = 80. Figure 8(d) shows the analysis based on loss function. When time step = 20, the existing methods, such as FL, GA-based gradient boosting, OVO-SVM, DT, DNN + CNN, LR, RF, and the proposed Coot-SCA_NN have the loss values of 0.212, 0.201, 0.183, 0.173, 0.163, 0.142, 0.135, and 0.123, respectively. For time step = 80, the proposed method has the loss value of 0.082 and traditional methods, such as FL, GA-based gradient boosting, OVO-SVM, DT, DNN + CNN, LR, and RF have the loss value of 0.173, 0.153, 0.134, 0.123, 0.111, 0.102, and 0.091, respectively. The analysis of Coot-SCA_NN based on MAP is shown in Figure 8(e). For time step = 20, the MAP obtained by intended method is 0.847 and traditional methods is 0.727, 0.747, 0.759, 0.777, 0.798, 0.810, and 0.827. Similarly, for time step = 80, the MAP values attained by proposed method is 0.891 and FL, GA-based gradient boosting, OVO-SVM, DT, DNN + CNN, LR, and RF is 0.767, 0.788, 0.799, 0.826, 0.838, 0.847, and 0.877, respectively.

Comparative analysis with time step, a) accuracy b) MSE c) RMSE d) Loss e) MAP.
This section the analysis based on the perturbation radius, which plays a key role in enhancing privacy protection. When handling sensitive financial data, selecting an appropriate perturbation radius is crucial for maintaining a balance between privacy and model accuracy, ensuring data protection against reconstruction attacks while preserving its analytical value. Figure 9 provides the performance of the proposed method by varying the perturbation radius.

Analysis by varying the perturbation radius, a) accuracy b) MSE c) RMSE d) Loss e) MAP.
Figure 9(a) shows the performance based on accuracy. When the perturbation radius is set to 0, the accuracy of the proposed method is 0.858, while the accuracy of the existing methods, such as FL, GA-based gradient boosting, OVO-SVM, DT, DNN + CNN, LR, and RF is 0.735, 0.759, 0.778, 0.799, 0.809, 0.826, and 0.848, respectively. When the perturbation radius is set to 0.4, the accuracy of the methods increases. The analysis based on MSE is shown in Figure 9(b). The MSE of the methods, such as FL, GA-based gradient boosting, OVO-SVM, DT, DNN + CNN, LR, RF, and the proposed method is 0.665, 0.637, 0.558, 0.488, 0.448, 0.377, 0.317, and 0.258, respectively, when the perturbation radius is set to 0.4. Similarly, the analysis based on RMSE, loss, and MAP is shown in Figure 9(c), Figure 9(d), and Figure 9(e), respectively. When the perturbation radius is 0.4, the RMSE of the comparative methods, such as FL, GA-based gradient boosting, OVO-SVM, DT, DNN + CNN, LR, RF, and Coot-SCA_NN is 0.816, 0.798, 0.747, 0.698, 0.669, 0.614, 0.563, and 0.508, respectively. When the perturbation radius is set to 0.4, the loss and MAP of the proposed method is 0.123 and 0.847, respectively. On seeing Figure 8, it is noted that the proposed method offers the good performance than the existing methods. Moreover, the results demonstrate how different perturbation levels impact the model performance while maintaining privacy. By introducing controlled perturbation, the privacy can be enhanced without significantly compromising the accuracy of financial distress prediction.
Shareholders don't know the true financial position of a firm until a financial crisis occurs because of financial information imbalance. Operational sustainability of a company not only affected by financial crisis and also shareholders interest, damages the rights, it harms national economy and society. Therefore, Coot-SCA_NN is proposed for financial distress prediction. The proposed method involves different entities: nodes and servers. The processes in the proposed framework include local training such as local data of each node, updates of server, integration of models in server, and downloading of global model to the nodes. This includes training updates on the basis of downloaded global model and local model and at every epoch of iteration. Here, training model takes input data from the dataset and runs a data augmentation model based on mutual information. Finally, financial distress is predicted using a NN trained with a proposed optimization algorithm called Coot-SCA. The information is processed at each node and the details are logged to the server for further processing. By incorporating Coot algorithm and SCA, proposed Coot-SCA is derived. A major contribution is made through local renewal and consolidation of the service performed by replacing CAViaR. It attained accuracy of 0.924, MSE of 0.083, RMSE of 0.289, Loss of 0.079, and MAP of 0.910. In future work, the predictive outcomes can be further improved by using clustering and feature selection approaches. Also, noise injection techniques (such as Laplace or Gaussian mechanisms), cryptographic methods, and privacy-preserving aggregation techniques, such as Secure Aggregation will be considered to secure and resilient against potential threats.
Footnotes
Funding
The authors received no financial support for the research, authorship, and/or publication of this article.
Conflicting interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
