Abstract
Chronic Kidney Disease (CKD) is a crucial life-threatening condition due to impaired kidney functionality and renal disease. In recent studies, Kidney disorder is considered one of the essential and deadliest issues that threaten patients’ survival with the lack of earlier prediction and classification. The earlier prediction process and the proper diagnosis help delay or stop the chronic disease progression into its final stage, where renal transplantation or dialysis is a known way of saving the patient’s life. Global studies reveal that nearly 10% of the population is affected by Chronic Kidney Disease (CKD), and millions die because of non-affordable treatment. Early detection of CKD from the biological parameters would save people from this crisis. Machine Learning algorithms are playing a predominant role in disease diagnosis and prognosis. This work generates compound features from CKD indicators by two novel algorithms: Correlation-based Weighted Compound Feature (CWCF) and Feature Significance based Weighted Compound Feature (FSWCF). Any learning algorithm is as good as its features. Hence, the features generated by these algorithms are validated on different machine learning algorithms as a test for generality. The simulation is done in MATLAB 2020a environment where various metrics like prediction accuracy gives superior results compared to multiple other approaches. The accuracy of CWCF over different methods like LR is 97.23%, Gaussian NB is 99%, SVM is 99.18%, and RF is 99.89%, which is substantially higher than the approaches without proper methods feature analysis. The results suggest that generated compound features improve the predictive power of the algorithms.
Keywords
Introduction
In endeavoring to work on healthcare by employing computational techniques, extracting relevant and appropriate features is essential. The reduction of high-dimensional data plays a crucial preprocessing method while handling healthcare data analytics [1]. Processing features without a reduction of dimensionality may increase the processing and computational time. Hence optimized feature extraction and selection from high dimensional data has a more significant impact on classification techniques [2]. It is hardly known ahead of time which features are appropriate for classification. Also, it is not practically possible to measure the features at the initial stage that are to be deployed during classification. Extracting meaningful and significant features will reduce the computational complexity, increase the model’s reliability to classify effectively, and ensure precise model representation [3]. Predominantly there are two types of high dimensionality reduction techniques that will optimize the features to a greater extent, namely feature extraction and feature selection [4, 5]. Feature extraction works on feature scaling and transformation that bring all the features on one scale by highlighting the high dimensional feature data to lower subspace which will help to maintain the quality of the feature to be the original and minimize the loss of information during reduction.
Feature selection points directly to the subset of features primarily important from the extracted features for classification. Hence feature extraction and feature selection for dimensionality reduction has become one of the bases for the classifiers. Considering feature extraction’s importance, there are different approaches, namely principal component analysis [6], Isometric mapping, and locally linear embedding. In contrast, feature selection is commonly constructed on information theory, statistics, and rough sets classified depending on the protocols [7]. In the early prediction of medical diseases, feature selection plays a significant role in identifying the essential features needed for diagnosis [8]. Hence, selecting appropriate features helps narrow down the medical expert’s decision during diagnosis. Also, the healthcare dataset contains large voluminous quantities of medical records that are directly taken from the hospitals and are updated daily. These datasets constitute information from expert prescriptions diagnostics reports and posts on social networking like tweets, tags and blogs [9, 10]. The origin of emerging techniques like data mining, machine learning, and intelligent advent technologies like the Internet of Things and Edge computing has given a new outset in diagnosing Chronic Kidney Disease with great force to a new epoch.
Medical analysis has become a remunerative research area that focuses on diagnosing chronic diseases like kidney, heart and diabetes [11]. Developing a proper Chronic Disease Diagnosis system called CDD can be used as a common framework for continuous assessment of chronic disease in affected persons [12]. Hence, developing an automated system with appropriate feature selection aid helps medical physicians give accurate early predictions of disease. Feature selection is also investigated as variable selection. It is a regularly performed preprocessing method that eliminates redundant attributes and irrelevant and missing values and processes on insignificant and surplus data [13]. These processes would enhance the computational speed, accelerate clear visualization of results, propagate exploratory analysis of data, and ensure high accuracy and performance of prognosis of the disease [14, 15]. Various popular selection methods, such as ensemble methods, are more popularly known for combining multiple hypotheses to get a better hypothesis. Wrapper methods are used explicitly on machine learning algorithms and embedded methods more like wrapper methods that give optimized output and filter methods that work on independent features [16]. These methods are more frequently used in prevalent chronic diseases such as kidney, heart, diabetes, stroke and thalassemia [17]. It is proved by developing a highly meticulous diagnostic system to predict joint malfunctions by deploying VAG signals that describe the importance of feature selection in the medical field [18]. Thus, there is a need for an efficient approach to deal with the feature learning and classification process. The significant research contributions are provided below: Initially, acquire the samples from available online data sources like publicly accessible datasets. Here, the UCI ML repository is considered for CKD analysis. To provide a detailed description of the linear feature projection with correlation-based weighted compound features, the mutual information among the features is analyzed. The experimental outcomes are discussed where the simulation is done with MATLAB 2020a environment, and various metrics like precision, recall, F1-score and accuracy are measured and compared. Based on this analysis, it is proven that the model outperforms multiple existing approaches.
This research paper is organized as follows: Section 2 discusses the various existing approaches with the significant research constraints associated with the prediction process. It paves the baseline for the proposed methodology. Section 3 gives a short description of linear feature projection for finding mutual information between the target class and feature for chronic kidney disease. Secondly, Correlation Based Weighted Compound Feature Selection Using Linear Projection to perform mathematical estimation is presented. Further, an algorithm by combination with linear projection is given. Thirdly, Feature Significance Based Weighted Compound Feature Selection Using Linear Projection with the deployment of a partial feature set for chronic disease prediction is shown in Section 4. Section 5 gives a discussion about the results for the prediction of chronic kidney diseases is presented. Lastly, a comparison of different ML algorithms is discussed with the evaluated results as shown in the conclusion.
Related works
Various existing approaches are discussed to predict CKD in its earlier stage. Tangri et al. [23] depict NCA for kidney cancer classification of sub-types with miRNA genome data. The provided miRNA sample classification over cancer sub-types is offered to acquire discriminative properties of LSTM and miRNAs in the RNN form. This anticipated NCA process chooses the biased miRNAs dataset. The miRNA subset facilitates the anticipated LSTM to cluster the cancer-based miRNA to five diverse sub-types with 95% average prediction accuracy and 0.93 MCC for ten random clustered processes (five times) that are nearer to the average output of miRNA for rating the functionality. Sinha et al. [24] initiated a novel DNN approach to classify and detect various renal histologic phenotypes. It is revealed that the ML concept with DNN has provided extensive performance in various processing tasks of histologic images. The NN concepts intend to extract and use quantitative feature learning to categorize diversions among different genotypes (mice). The non-glomerular segmentation and animal-based genotype rely on the quantitative image features and pretend to provide superior performance.
Jena et al. [25] recommend HNN for predicting kidney function. A prediction problem is explicitly formulated as a binary classification function to acquire the complete information from the available Electronic Health Record (EHR). The author recommends HNN for integrating bi-directional LSTM and auto-encoder network model. Based on the data acquired from the raw HER, the author initiates the construction of a dataset and collects roughly 35k reports from various patients’ including patients with hypertension. The testing outcome shows that 90% of the model accuracy is due to the inclusion of NN. The anticipated approach has examined the need for synthetic NN with the predictions for various analyses. The efficacy of the anticipated NN model relies on the construction of an available online dataset. Author et al. [26] model a novel RNN for predicting acute kidney injury development. The model determines the input sequences responding to various prior sequential sections essential for further processing. The prediction probabilities of acute kidney disease with specific clinical parameters/sequences are predicted with RNN. For all acute kidney disorders defined with crucial severity levels, the provided model intends to show reasonable prediction with 55% for all acute kidney diseases. The advancements with the clinical diagnosis in an average time interval of about 48 hours at the selected operation give 35% accuracy. Acute kidney disease is identified for one patient among three different predicted cases, while some other predictions are estimated to be wrong. Moreover, testing reveals that the patients with CKD show 58% FP evaluation.
Kolachalama et al. [27] anticipated an ensemble-based multi-stage DL approach for segmenting kidney tumors. An integration process is applied to analyze the prediction outcomes from the previous phases and to measure the variance among diverse individual models. The dice score (average) for kidney disease and tumor cases are 96% and 74%, respectively, for some unidentified test cases. Some predictions are positive and enhanced by providing advanced knowledge of benign cysts, which decreases tumor segmentation frequently over the IoMT platform. The numerical result drastically diminishes the error, and a relatively large batch size is used. The batch normalization properties are exploited superiorly, contrary to various attempts with smaller batch sizes. The outcomes acquired from 32 samples are enhanced among the evaluations. However, all these models fail to give superior prediction accuracy based on earlier feature analysis. Thus, the drawback needs to be addressed, and this research has initiated an attempt to perform better feature learning for the earlier prediction process. Table 1 depicts the comparison of various existing approaches and their corresponding drawbacks.
Comparison of various existing approaches
Comparison of various existing approaches
This section provides a detailed analysis of the anticipated feature learning approach where the evaluation is done with MATLAB 2020a simulation environment. Various performance metrics like accuracy, precision, F-measure, and recall are evaluated and compared with existing approaches. Figure 1 depicts the block diagram of the anticipated model.

Block of the proposed predictor model.
This research considers the online available UCI ML dataset for the CKD dataset, and the UCI dataset contains 24 attributes collected from 400 patients. Table 2 summarizes the attributes in the dataset.
Attributes in the CKD dataset
Attributes in the CKD dataset
The classifiers are spawning over a multitude of features to derive reliable hypotheses. Many dimensionality reduction techniques like Principal Component Analysis (PCA), Wrappers, and filters are commonly used to: Eliminate redundancy Explore high dimensional more information from features Generalizing the classifier
The maximum discriminative information is preserved between features to obtain a more generic classification. Independent Component Analysis (ICA) is essential to find more appropriate features. Let y = Ax be the maximum informative linear feature projection from m to n dimensions. The projection matric A ∈ Rm*n is optimally approximated by accumulating the mutual information M info (f|t) between features f and target class t [19]. It is mathematically expressed as in Equation (1):
From Equation (1), the aggregation of mutual information between f and t is provided. The projection of features associated with the target class can be ranked by the individual features’ M info (f|t). The entropy measure is maintained to preserve the orthogonality. Conditional entropy is estimated when the features are independent of one another. It is the uncertainty introduced by a feature when correlated with another feature. This uncertainty can be interpreted as the inter-information provided by a feature on another feature [20]. Internally, the estimation of M info (f|t) is grounded on the entropy of the independent features with its target class. The individual entropy of the feature H (f i ) based on the m space estimator is given by Equation (2):
Finally, the estimation of M info (f|t) happens according to Equation (4), where P t is a constant.
Correlation is the mathematical estimation of the intensity of association between the feature and the target class. High correlation indicates that the features play a predominant role in prediction. Weighing the features based on their correlation enhances the efficacy of prediction. Generating compound features from the original features must preserve the orthogonality among new combinations of features and the original features. The proposed correlation-based weighted compound feature selection uses linear projection methodology and then imposes correlation as weights. The Partial Features (PF) consist of features that have M info (f|t) value below the threshold limit (T1). The Subset of Semi Features (SSF) set is constructed using these weighted features and is given in Equation (5):
The correlation values are represented as C ={ c1, c2, …, c n }. The ultimate Correlation-based Weighted Compound Features (CWCF) set is constructed with features that exceed the qualifying thresholds, computed empirically. Algorithm 1 describes the entire process of Correlation-based feature selection grounded on a linear projection of features.
Feature importance is a score assigned to features based on their usefulness in predicting the target class. The information gain-based feature selection is widely used as it dramatically reduces dimensions without loss of information [21]. The proposed algorithm used model agnostic feature significance to construct compound features from the partial feature set. The construction of a partial feature set is governed by Equation (6):
From Equation (6), s1, s2, . . , s n represents the feature significance score concerning the target class t. Algorithm 2 shows the weighted compound feature selection generation using entropy-based linear projection with the feature significance scores as weights.
The selected features need to be optimized using any meta-heuristic optimizer to verify the global and local outcomes. The model needs to fulfill both exploration and exploitation. Here, a novel Rule-based firefly optimizer is proposed to analyze the significance of selected features.
In accordance with the no-free lunch theorem, the averaged objective function of any ML algorithm is as good as the others. So, the proposed work delves into selecting the features which hold higher prominence, than exploring all the features. The proposed work significantly reduces the overhead of the learning algorithm by presenting a subset of significant features with appropriate weights deduced based on the correlation with the target. This scheme offers a two-fold advantage: firstly, it reduces the processing overhead, and secondly, it confines the algorithm to learn only from the significant features. This is in contrast to the existing methods in the literature which are limited to selecting the features from Exploratory Data Analysis. Also, forming subsets of features based on mutual information, significance and correlation in the healthcare domain gives detailed knowledge about the combination of factors that worsens the health condition.
5) Rule-based Firefly optimizer
The target of this optimizer is to attain an optimal solution and enhances the prediction rate [31]. It is a heuristic optimizer energized with the flashing nature of fireflies and works based on the signal system. The model follows certain rules: Rule 1- enables an individual to attract other fireflies; Rule 2- attractiveness is directly proportional to brightness, i.e. less brightness is attracted towards brighter one; Rule 3- firefly moves randomly and determined with objective functions. The light intensity variation is expressed as in Equation (7):
Here, γ specifies light absorption rate; I o specifies initial light intensity. The firefly attractiveness (rule 2) is expressed as in Equation (8):
Here, d specifies distance among fireflies; β0 specifies distance attractiveness and γ specifies variation attractiveness. Finally, rule 3 is measured with Cartesian distance measure as in Equation (9):
Here, i and j specify the distance between two fireflies, x ik specifies the spatial coordinator of k th component; N specifies the number of dimensions. The movement (random) of fireflies is expressed as in Equation (10):
Here, α specifies randomization parameter and d specifies the random number distributed uniformly. Algorithm 1 explains the Rule-based firefly functionality for optimization:
This section provides the numerical analysis of the anticipated model over various existing approaches. Here, MATLAB 2020a is used as a simulation environment where metrics like accuracy, precision, recall, and F1-measure are evaluated and compared with existing approaches. The metrics are mathematically expressed as in Equations (10):
Here, TP is True positive; TN- True Negative; FP- False Positive; FN –False Negative [32]. Based on the analysis, it is evident that medical data is highly unstructured and selecting representative features for developing an accurate prediction model is a tiresome task. Since medical data is susceptible, additional care has been taken to preprocess the data and select pertinent features. The primary objective of this work is to assess the factors that contribute to CKD among the features collected in the benchmarked Chronic Kidney Disease dataset from the UCI machine learning repository [22]. The proposed algorithms, namely Correlation Based Weighted Compound Feature Selection Using Linear Projection and Feature Significance based weighted compound feature selection using Linear Projection, aim to generate the best compound features from the original master feature set. The so-developed compound features are applied over the prediction algorithms, namely SVM, random forest, logistic regression, and Gaussian Naïve Bayes, to test its effectiveness in predicting CKD occurrence among patients. The correlation-based algorithm produces compound features to classify whether a patient is prone to CKD. The correlation values of each feature with the target class are listed in Table 2. The dataset attributes (features) considered for analysis include age, BP, gravity, albumin, sugar, RBC, pus cells, clumps, bacteria, glucose (random), appetite, creatinine, sodium, potassium, hemoglobin, hypertension, diabetes mellitus, coronary heart disease, urea, enema, anemia, RBC, WBC and packed cell volume are included. However, all these features are not highly required for complete validation. Some influencing features are considered for analysis where some analysis metrics like correlation, feature significance and MI are analyzed, and the corresponding values are evaluated. The feature significance value relies on 0 to 2, the MI relies on 0 to 0.45, and the correlation value includes positive and negative values.
The applications in the healthcare domain hold accuracy as its primary metric rather than time consumption. The proposed method investigates all the features based on their mutual information, feature significance and correlation to ascertain its rationale in predicting CKD. Also, the numeric nature of clinical parameters further speeds up the computation as present-day systems are well equipped with enormous processing power.
The compound features consist of original features and SSF to predict CKD occurrence effectively. Table 3 summarizes the selection of features by both the proposed algorithms. Based on Table 3, the master features include f1 → f6, partial features with lesser mutual information have f2 → f4, the semi-subset features are f1, f5 and f6, and the semi subset features FSWCF includes f1, f5and f6 .
Assessment of various features in the context of Correlation, Feature Significance and Mutual Information
The proposed algorithm 1 generates CWCF, used by various machine learning algorithms such as SVM, Gaussian Naïve Bayes, Logistic Regression and Random Forest to test its efficacy. The mean of M info (f|t) is considered a threshold for simplicity. Various techniques like Ant Colony Optimization (ACO) and Particle Swarm Optimization (PSO) can be explored to determine the threshold value. Detailed correlation values and mutual information is listed in Table 2. The train-test ratio of 75-25 is preserved throughout the experiment. The performance of various algorithms on the generated CWCF is assessed on the metrics like classification accuracy, F1-score, Precision and Recall. Table 4 gives the performance results of the CWCF features selection algorithm on standard machine learning algorithms. The impact of compound feature generation is well portrayed in the results.
Summary of feature selection by proposed algorithms
Summary of feature selection by proposed algorithms
Figure 2 visualizes the results of Table 4. The machine learning algorithms exhibit better performance when subjected to features generated by CWCF. Logistic regression shows improved accuracy of 97.23% when deployed with features selected through the CWCF method against raw features. A similar trend can be observed with other algorithms also. In addition to this, the CWCF reduces misclassifications, which are very crucial in any medical disease diagnosis. It is witnessed by the improvement in the F1- Score, precision and recall. Thus by selecting influential features through CWCF, the ML algorithms can output substantially improved efficacy, which is the need of the hour for highly unstructured heterogeneous medical data. In the case of original feature analysis, the prediction accuracy of RF is 98.65% which is 2.82%, 0.54% and 0.005% higher than LR, NB and SVM. The F1-score of RF, NB, and SVM is 98% which is 2% higher than LR. The precision of NB, SVM, and RF is 98% which is 3% higher than NB, SVM, and RF approaches. The recall of RF, NB and SVM is 98% which is 1% higher than LR. From this, it is known that the models like RF, NB, SVM and LR give nominal outcomes for original features. While in the case of CWCF, the prediction accuracy of RF is 99.89% which is 2.66%, 0.76%, and 0.71% higher than LR, NB, and SVM. The F1-score of RF is 99.13% which is 0.13% higher than NB and SVM and 1.13% higher than LR. The precision of RF is 99.13% which is 0.13% higher than NB and SVM and 1.13% higher than LR. The value is the same for recall also. The CWCF model gives comparatively higher values than the original features.

Competitive analysis of CWCF selection on various algorithms.
Results of the proposed FSWCF feature selection algorithm applied to various machine learning techniques
With the mutual comparison, the prediction of RF is 99.89%, SVM is 99.18%, NB is 99.13%, and LR is 97.23% which is 1.24%, 0.58%, 1.07% and 3.95% higher than RF, SVM, NB and LR-based original feature analysis. The F1-score of RF is 99.13%, SVM is 99%, NB is 99%, and LR is 98% which is 1.13%, 1%, 1% and 3% higher than RF, SVM, NB and LR-based original feature analysis. The precision of RF is 99.13%, SVM is 99%, NB is 99%, and LR is 98% which is 1.13%, 1%, 1% and 3% higher than RF, SVM, NB and LR-based original feature analysis. The recall of RF is 99.13%, SVM is 99%, NB is 99%, and LR is 98% which is 1.13%, 1%, 1% and 1% higher than RF, SVM, NB and LR-based original feature analysis. Based on this analysis, it is proven that CWCF works better than the actual feature analysis.
Algorithm 2 deploys feature significance as a primary criterion in selecting features from the master feature set. FSWCF algorithm considers significance scores as weighing metrics for weaker features with low M info (f|t) values. These scores are generated by training the original features with the Random Forest algorithm, one of the best bagging algorithms aiming to reduce overfitting. The performance analysis of the CKD data in predicting the disease on deploying FSWCF compound feature generation on various machine algorithms is given in Table 5. Figure 3 shows the visual performance comparison of the ML algorithms.
Results of the proposed CSCF feature selection algorithm applied to various machine learning techniques
Results of the proposed CSCF feature selection algorithm applied to various machine learning techniques

Competitive analysis of FSWCF selection on various algorithms.
A significant improvement is seen when the ML algorithms are subjected to features selected through the FSWCF method than processing with original features. A steep improvement is sensed in Gaussian Naïve Bayes, where the accuracy improved by 2%, and the misclassification is reduced further. This technique proves its strength in reducing misclassifications rather than just tuning only the classification accuracy. From Table 5, the original feature analysis is done with metrics like accuracy, F1-score, precision and recall. The accuracy of RF is 98.32% which is 6.32%, 1% and 3.32% higher than LR, NB and SVM. The F1-score of SVM is 95% which is 3%, 0.77% and 1% higher than LR, NB and RF. The precision of SVM is 99% which is 2%, 4% and 6% higher than LR, NB and RF approaches. The recall of RF is 95%, which is 6%, 4%, and 2% higher than LR, NB, and SVM techniques. This analysis identifies that SVM works well compared to LR, NB and RF approaches. While in the case of FSWCF, the prediction accuracy of RF is 99.89% which is 5.95%, 0.65%, and 3.68% higher than LR, NB and RF. The F1-score of RF and NB is 99% which is 5% and 3% higher than LR and SVM. The precision of NB is 100% which is 5%, 3% and 1% higher than LR, SVM and RF approach. The recall of RF is 99% which is 1% higher than NB and SVM and 5% higher than LR.
With the mutual comparison, the prediction of RF is 99.89%, SVM is 96.21%, NB is 99.24%, and LR is 93.94% which is 1.57%, 1.21%, 1.92% and 1.94% higher than RF, SVM, NB and LR-based original feature analysis. The F1-score of RF is 99%, SVM is 96%, NB is 99%, and LR is 94% which is 5%, 1%, 1.68% and 2% higher than RF, SVM, NB and LR-based original feature analysis. The precision of RF is 99%, SVM is 97%, NB is 100%, and LR is 95% which is 6%, 2%, 5.77% and 3% higher than RF, SVM and NB LR-based original feature analysis. The recall of RF is 99%, SVM is 98%, NB is 98%, and LR is 94% which is 0.68%, 3%, 0.68% and 2% higher than RF, SVM, NB and LR-based original feature analysis. Thus, the functionality for feature selection with FSWCF is superior to the standard feature analysis.
Though both methods show a considerable increase in performance, the algorithm1 (CWCF) selection strategy proves to be more effective than the algorithm2, based on feature significance scores. The comparative assessment between their mean classification metrics is shown in Fig. 4. The improved performance of CWCF is because the correlation values are estimated based on the number of features in positive and negative classes. Any change in the numbers will alter the values. Hence this score is more reliable and specific to the dataset. On the other hand, the FSWCF considers the ranking of features based on their influence in labeling the classes. Also, the FSWCF needs the support of a good predictor to output enhanced performance. This work mainly focuses on building good compound features by deploying two techniques, namely CWCF and FSWCF. A rational improvement is sensed in using these methods. This work provides lots of scopes to train a dataset with a lesser number of features and to introduce evolutionary algorithms in optimizing the threshold values to select the features.

Comparison between proposed methods.
While comparing the performance of FSWCF and CWCF, it is evident that FSWCF works well compared to CWCF approaches. The prediction accuracy of RF (FSWCF and CWCF) is 99.89%; however, the prediction accuracy of SVM is 96.21%, NB is 99.24%, and LR is 93.94% which is 2.97% lesser than SVM (CWCF) and 0.09% higher than NB (FSWCF) and 3.29% lesser than LR (CWCF). The F1-score (FSWCF) of RF is 99% which is 0.13% higher than RF (CWCF), SVM is 96% which is 3% lesser than SVM (CWCF), 99% for NB and 4% lesser than LR (CWCF). The precision (CWCF) is 99.13% which is 0.13% lesser than RF (FSCWF), 2% higher than SVM (FSCWF), 1% lesser than NB (CWCF) and 3% higher than LR (FSCWF), and recall of RF is 0.13% higher than RF (FSWCF), 1% higher than SVM (FSWCF), 1% higher than NB (FSWCF), and 4% higher than LR (FSWCF). It is proven that CWCF shows superior outcomes compared to other feature selection approaches with the optimal observed points using a rule-based firefly optimizer as in Fig. 5.

Objective function of rule-based firefly.
Chronic Kidney Disease has become a prevalent global health issue. A variety of factors are found to create deterioration in the performance of kidney function, causing them to fail over a while. This research focuses on restricting the influence of various factors that contribute to CKD by assessing the attributes through two techniques, namely CWCF and FSWCF. The results of these techniques are evaluated by testing them with different ML algorithms, which shows that CWCF yields improved results over the FSWCF algorithm. The proposed model delivers better results compared to standard analysis. This research will help medical practitioners to decide whether the patient is prone to CKD. The primary research constraint is the analysis with the feature learning process alone without much concentration towards the classification process. In the future, the classification process will be done with ML approaches to enhance prediction accuracy.
