Abstract
BACKGROUND:
Artificial Intelligence (AI) plays a pivotal role in the diagnosis of health conditions ranging from general well-being to critical health issues. In the realm of health diagnostics, an often overlooked but critical aspect is the consideration of cost-sensitive learning, a facet that this study prioritizes over the non-invasive nature of the diagnostic process whereas the other standard metrics such as accuracy and sensitivity reflect weakness in error profile.
OBJECTIVE:
This research aims to investigate the total cost of misclassification (Total Cost) by decision rule Machine Learning (ML) algorithms implemented in Java platforms such as DecisionTable, JRip, OneR, and PART. An augmented dataset with conjunctiva images along candidates’ demographic and anthropometric features under supervised learning is considered with a specific emphasis on cost-sensitive classification.
METHODS:
The opted decision rule classifiers use the text features, additionally the image feature ‘a* value of CIELAB color space’ extracted from the conjunctiva digital images as input attributes. The pre-processing consists of amalgamating text and image features on a uniform scale, normalizing. Then the 10-fold cross-validation enables the classification of samples into two categories: the presence or absence of the anemia. This study utilizes the Cost Ratio (
RESULTS:
It has been established that the PART classifier stands out as the top performer in this binary classification task, yielding the lowest mean total cost of 629.9 compared to other selected classifiers. Moreover, it demonstrates a comparatively lower standard deviation 335.9, and lower total cost across all four different cost ratio methodologies. The ranking of algorithm performance goes as follows: PART, JRIP, DecisionTable, and OneR.
CONCLUSION:
The significance of adopting a cost-sensitive learning approach is emphasized showing the PART classifier’s consistent performance within the proposed framework for learning the anemia dataset. This emphasis on cost-sensitive learning not only enhances the recommendations in diagnosis but also holds the potential for substantial cost savings and makes it a noteworthy focal point in the advancement of AI-driven health care.
Keywords
Introduction
The advancements in technology go beyond providing medical data for diagnosis; they play a multifaceted role in enhancing various aspects of healthcare [1, 2]. In recent years, Machine Learning (ML) and Deep Learning (DL) models are more supportive to humans in most of the decision making process [3, 4, 5]. The conventional way of assessing the performance of ML or DL will be done mainly based on its classification accuracy assuming that the misclassification costs (false negative and false positive cost) are the same. However in most of the critical applications like fraud detection, cancer diagnosis, and bank loan problems the misclassification of the sample in each class will create a different impact because of a variety of reasons. For example, in loan problems false positives will cost more than false negatives [6]. In certain scenarios, such as the diagnosis of cancer, misclassifications can have significantly different consequences. For instance, a false negative (missed diagnosis) can be extremely serious as it may lead to delayed treatment and potentially cost the patient their life [7, 8]. On the other hand, a false positive (false alarm) may result in unnecessary concern and further tests, but it is less severe in comparison. In our processes, we have recognized the importance of addressing this discrepancy and have therefore reevaluated our equations and parameters for cost calculation, particularly when it comes to materials and resources involved in the classification process.
Let us consider the AI based non-invasive anemia classification problem. Anemia is a prevalent condition characterized by a decrease in the number of red blood cells or a decrease in their ability to carry oxygen effectively. It affects a substantial portion of the global population, with estimates suggesting that around a quarter of the world’s population is affected. The World Health Organization (WHO) identified anemia as a global health concern with significant prevalence and consequences [9, 10]. Diagnosis of anemia typically involves measuring hemoglobin levels through blood tests and evaluating additional parameters such as red blood cell count and morphology. Early diagnosis of anemia helps healthcare professionals to implement targeted interventions and management, mitigating the adverse effects on individuals’ health and well-being [11]. Non-invasive methods, such as conjunctival examination and digital image analysis, have emerged as promising alternatives. These techniques offer the potential for quick and cost-effective screening, making anemia diagnosis more accessible, particularly in resource-limited settings, and facilitating timely intervention and treatment. The research for developing a low cost and non-invasive diagnostic tool/device started a few decades ago. But, it is very important to consider the total cost for misclassification rather than the model development/device design cost. In non-invasive anemia screening if the test participant is misclassified as false positive, he/she will consult the health professional followed by a golden standard hemoglobin test to confirm, whereas the false negative will make either the test person to believe that he/she is normal or to check for other disease conditions with an additional test. This will make the anemic condition worse and result in additional test cost, treatment cost such as blood transfusion, long recovery period etc. The misclassification cost of False Negative is always higher than False Positives for sensitive applications like anemia diagnosis.
The proposed work evaluates the total cost involved in the misclassification of a sensitive application, non-invasive anemia diagnosis classified by rule based classifiers which was motivated by the work done by Kumaravel et al. [12]. Here, they explored the performance metric, total cost of misclassification through the cost ratio (
This article is structured as follows: Section 2 provides an overview of previous research conducted in the field of cost-sensitive classification and non-invasive anemia diagnosis. In Section 3, the materials and methods required for the proposed work are described. This is followed by the presentation of experimental results and corresponding discussions. Finally, the paper concludes with a summary of findings and key insights.
Related works
Cost-sensitive classification is a critical area of research in machine learning and data mining, aiming to improve the performance of classifiers by incorporating the varying costs associated with misclassification errors. Mienye et al. investigated the strength of cost-sensitive learning approaches specifically in the context of imbalanced datasets [13]. They utilized conventional machine learning methods to explore the effectiveness of cost-sensitive techniques. Their study shed light on the potential benefits of considering the varying costs associated with misclassification errors when dealing with imbalanced data. Telikani et al. delved into the domain of cost-sensitive classification using deep learning techniques [14]. They proposed a framework that involved partitioning the dataset and defining a separate cost matrix for each component. By incorporating cost-sensitive classification into the deep learning framework, they aimed to improve the overall performance of the models. While these studies primarily focused on incorporating cost sensitivity into the learning process, it is important to note that misclassification costs are not always known in many imbalanced dataset contexts. Therefore, cost-sensitive learning techniques have emerged as a valuable approach, as they take into account the misclassification costs during model construction without directly modifying the imbalanced data distribution. Assigning distinct costs to training examples has proven to be an effective approach for addressing class imbalance issues. Previous studies [15, 16] demonstrated the impact of varying the cost ratio (
Even though there are numerous works carried out in biomedical applications the significance of misclassification cost is very less bothered. Application of ML in the domain of ophthalmology for studying diabetic retinopathy detection was explored by Vijayan et al. (2020, 2023). The research to find the possibility of anemia screening through conjunctival examination started non-invasive anemia diagnosis a few decades ago. Previous studies [19, 20, 21, 22] assessed the effectiveness of physical examination of conjunctival pallor in anemia diagnosis. Suner et al. [23] used conjunctiva images photographed along with the 18% gray photographic standard card for non-invasive anemia screening and attained 55% sensitivity and 70% specificity. Collings et al. [24] made use of digital image acquisition devices for capturing the eye images along with a color calibration card closer to it. The impact of ambient lighting conditions on palpebral conjunctival Erythema Index (EI) was examined by a two-way analysis of variances (ANOVA) on values of pre and post standardization. This study found the Erythema Index of the palpebral conjunctiva correlated more heavily with hemoglobin concentration when compared to forniceal conjunctiva EI. Bevilacqua et al. [25] developed a head-mounted non-invasive device for capturing the conjunctiva region and to minimize the influence of ambient light. This research aims at decreasing false negatives instead of total performance. This research also revealed a* value of CIELAB color space of the manually segmented conjunctiva region, shows good correlation with both hemoglobin and Hematocrit values [26]. Muthalagu et al. [27] applied Feed Forward Neural Networks (FFNN) and Elman Neural Networks (ENN) for categorizing anemic and non-anemic cases from conjunctiva images. This study showed that ENN performs better classification than the FFNN and EI model with sensitivity of 77% and specificity of 96.11%. Dimauro et al. [28] used SMOTE and ROSE algorithms to address the class imbalance issue.
In all the aforementioned works related to non-invasive anemia diagnosis, the focus has primarily been on metrics such as accuracy and sensitivity, with insufficient consideration given to the costs associated with misclassifications [29]. In this proposed framework, the inclusion of novel metrics, namely Cost Ratio (
Materials and methods
Data description
The data utilized in this study was gathered in real-time, as stated in the study by Kasiviswanathan et al. [30]. It comprises 974 samples containing various attributes such as age, gender, height, weight, and the a* value of the L*a*b* colorspace defined International Commission on Illumination (CIELAB) derived from images of conjunctiva region [26]. The statistical information of the data, its distribution are given in Table 1 and Fig. 1 respectively.
Statistical information of the anemia dataset
Statistical information of the anemia dataset
Box plot of input features (left), distribution of male and female samples (top right) and distribution of output classes (top left).
A set of selected attributes, comprising demographic factors such as age and sex, and anthropometric features like height and weight, along with the ‘a*’ value of the CIE Lab color space, are chosen as input features due to their notable correlation with the anemic status. Previous research in the related works section highlights the significance of the a* value derived from conjunctiva images in non-invasive anemia diagnosis [25, 30, 31]. To enhance the classification performance, normalization is applied since each input attribute exists within a different range. The values are normalized using Min-max normalization, as described by Eq. (1), to rescale them to a range of (0, 1).
where
A cost-sensitive classifier differs from regular machine learning algorithms in its approach to handling misclassification errors. While regular ML algorithms prioritize overall accuracy without considering the varying costs associated with different types of misclassifications, cost-sensitive classifiers explicitly incorporate these costs into the learning process. They assign different costs or weights to different types of misclassifications based on the specific domain or problem at hand. By incorporating cost information, cost-sensitive classifiers make more informed decisions that minimize the expected total cost of misclassification. This is achieved by prioritizing certain classes or minimizing misclassifications that have higher costs or consequences. Cost-sensitive classifiers often require additional inputs, such as cost matrices or misclassification cost estimates, to account for the varying costs. In summary, cost-sensitive classifiers optimize classification outcomes by explicitly considering the costs associated with misclassifications, while regular ML algorithms focus primarily on overall accuracy [12, 32]. For the proposed cost sensitive classification framework the decision rule classifiers such as Decision Table, JRip, OneR, and PART are considered [33].
Decision rule classifiers
Decision rule classifiers are a type of machine learning algorithm that use explicit rules to make predictions or classify data instances. These classifiers operate by defining a set of rules based on features or attributes of the data, and each rule corresponds to a specific class label or outcome. Even though the decision trees and decision rules are functionally equivalent, the rule based algorithms are considered due to their nature of linearity contributing to the readability and storage efficiency as in the inference mechanism based on knowledge base or fuzzy systems. Metrics based on confidence and support are the byproduct or automatic outputs from the decision rule algorithms. The following decision rule algorithms are trained and tested for the above discussed dataset.
a) DecisionTable Classifier
DecisionTable [34] provides a compact and expressive representation using a tabular format. They capture combinations of input conditions and corresponding actions or outcomes, enabling efficient rule-based decision-making. Decision tables are known for their ease of interpretation and implementation, making complex decision-making logic more manageable.
b) JRip Classifier
Jrip [35] is a classification algorithm implemented in the Weka machine learning toolkit. It stands for “Repeated Incremental Pruning to Produce Error Reduction (RIPPER)”, a rule-based classifier that builds a set of rules to make predictions on new instances.It is known for its efficiency and interpretability, as the resulting rule set can be easily understood and analyzed. It has been widely used in various domains, including medical diagnosis, bioinformatics, and customer churn prediction.
c) PART Classifier
The PART [36] algorithm also known as Partial C4.5 Decision Tree algorithm constructs a set of rules that cover different instances in the dataset. These rules are generated by considering various attributes and their thresholds to partition the data. The resulting rule set can be used for classification, providing human-readable rules that offer insights into the decision-making process. By utilizing the PART algorithm in Weka, you can leverage its rule-based approach to create interpretable models that provide insights and transparency in classification tasks.
d) OneR Classifier
The OneR [37] algorithm is implemented as a rule-based classifier. It aims to create a simple and interpretable classification model based on a single attribute, referred to as the “One Rule.” The OneR algorithm evaluates each attribute individually and selects the attribute that yields the best classification accuracy for prediction. This approach allows for easy understanding and explanation of the decision-making process. However, it’s important to note that the One Rule model may not capture complex relationships between attributes, and its performance might be limited compared to more sophisticated classifiers.
Cost sensitive learning
In the context of cost-sensitive classification, the terms “CFP” and “CFN” typically refer to the costs associated with false positives and false negatives, respectively. These costs represent the specific misclassification costs for each type of error. To incorporate these costs into a formula, we can use the following notation:
CFP: Cost of a False Positive CFN: Cost of a False Negative
Table 2 displays the confusion matrix, providing insights into the False Positive (FP) and False Negative rates associated with the anemia classification task. In this classification, the anemic condition is regarded as the positive class, while the non-anemic condition is assigned to the negative class.
Confusion matrix
Confusion matrix
A common approach is to introduce a cost matrix, where each entry represents the cost associated with misclassifying a true instance of one class as a predicted instance of another class. In this case, the cost matrix for a binary classification problem with classes “positive” and “negative” can be defined in tabular form as shown in Table 3.
Cost matrix
The notations C11, C12, C21, and C22 are the elements of the Cost Matrix. The formula to calculate the total cost using the cost matrix and the number of false positives (FP) and false negatives (FN) would be:
The formula presented in Eq. (2) allows you to quantify the total cost by multiplying the respective misclassification costs with the corresponding number of false positives and false negatives. It’s important to note that the specific costs (CFP and CFN) and the structure of the cost matrix may vary depending on the problem and domain. These values are typically determined based on the specific context and the relative importance or impact of each type of misclassification in the given application.
The proposed work aims to implement a cost-sensitive classification approach that incorporates four different types of Cost Ratio (U, UI, NU and NUI). The workflow of the proposed work involves a set of key steps as shown in Fig. 2. Firstly, the input data undergoes min-max normalization preprocessing technique to ensure that the data is in a suitable format and contains relevant features for classification.
Workflow of the proposed cost sensitive classification approach.
The main process contains two components: Classifying and Cost Checking in each iteration. The first component is based on cost matrix-controlled cost-sensitive learning. This classifier takes into account the specific cost ratios derived from the cost matrix. This allows the classifier to assign different costs to misclassifications based on the importance and impact of each class output of an instance. Finally, the total cost of misclassification is found out.
The main focus of the algorithm revolves around the cost ratio (
Case 1. x takes the form 1/y where y is non negative integer. Case 2. x takes the form p/q where p, q are non-negative integers and gcd (p, q)
These four cases encompass all possible values of x in Q, representing the FP: FN ratio. The selection of these cases is guided by number theory findings discussed in previous studies [15, 17, 18].
This study employs the algorithm depicted in Fig. 3 to construct cost-sensitive classifiers, which incorporate the algorithm components described earlier. The construction of these classifiers is carried out using the Weka tool [33] by adjusting the ratio
a) Component for CSC-U
The cost value for CSC-U can be determined with the Cost Ratio (
b) Component for CSC-UI
This is similar to that of CSC-UI by just inversing Components for CSC-U where CFP
c) Component for CSC-NU
The above algorithm is applied with the ratio
d) Component for CSC-NUI
From the ratio
Figure 3 outlines the steps of the algorithm components, indicated by annotations in ‘[…]’, while the remaining steps common to all four cases are described in the rest of the figure. Dataset X comprises the underlying data instances mentioned in Section 3.1 followed by the definition of confusion matrix and cost matrix. The variable bi represents a list of decision rule classifiers, specifically chosen from the set {DecisionTable, JRip, OneR, PART}. The algorithm consists of two nested loops: the outer loop iterates over the selected decision rule classifiers, while the inner loop iterates over the index i variation, which corresponds to the cost ratio extracted from the cost matrix. Once the loop variants are fixed for the current iteration, the “TC” procedure is called to handle the training, testing, classification, and calculation of the total cost involved. The final output is determined by selecting the optimal value from the set of total costs generated by the aforementioned iterations.
Pseudo code for the proposed cost ratio controlled cost sensitive classifier (CSC) framework for anemia diagnosis.
Total cost for anemia diagnosing using CSC-U
Total cost for anemia diagnosing using CSC-UI
Performance of decision rule classifiers for CSC-U.
Performance of decision rule classifiers for CSC-UI.
The algorithm proposed in this study was implemented on the anemia dataset, and the obtained results were organized and presented in the form of tables and plots to facilitate a comparison of the performance of the decision rule classifiers during the 10-fold Cross Validation test based on total cost. Table 4 and Fig. 4 display the total cost generated by the decision rule classifiers for CSC-U (Cost-Sensitive Classification with Uniform cost ratio). Similarly, Table 5 and Fig. 5 depict the behavior of the total cost for the Uniform Inverse (UI) cost ratio.
During the analysis of decision rule classifiers’ performance for CSC-NU (refer to Table 6 and Fig. 6) and CSC NUI (refer to Table 7 and Fig. 7), an intriguing pattern emerged. To gain meaningful insights into these patterns, trend lines were plotted, as depicted in Figs 6 and 7. The parameters of the trend lines, such as slope (m) and intercept (c), along with the coefficient of determination, were recorded and presented in Table 8.
Total cost for anemia diagnosing using CSC-NU
Performance of decision rule classifiers for CSC-NU.
Performance of decision rule classifiers for CSC-NUI.
Total cost for anemia diagnosing using CSC-NUI
The comparative analysis of the mean total cost and standard deviation (SD) across different models for specific categories such as CSC-U, CSC-UI, CSC-NU, and CSC-NUI is represented in Table 8 and Fig. 8. This table aids in comparing the performance of different decision models based on their mean total cost, allowing for a comprehensive understanding of cost implications and variability across various scenarios. The standard deviation provides insights into the variability of the total cost within each model.
Performance of decision rule classifiers based on mean of total cost across different cost ratios
Performance of decision rule classifiers based on mean of total cost across different cost ratios
Trend line information
m-slope, c-Intercept, r2-the coefficient of determination.
Performance of decision tree classifiers for CSC-NUI.
Further statistical analysis for the total cost of the anemia diagnosing metric is as follows: The PART classifier has shown the lowest mean total cost of 400.5 for the CSC-UI framework and the lowest overall mean total cost of 629.9 among all the selected classifiers. It also has a relatively lower standard deviation of 335.9, indicating less variability in the total cost. Using the PART classifier can potentially lead to cost savings and more consistent results in terms of the total cost for anemia diagnosis. The OneR classifier has the highest mean total cost in all cost ratio methodologies, whereas JRIP and DecisionTable exhibit moderate performance in terms of total cost for anemia diagnosis.
The trend line information is shown in Table 9. The parameters slope (m) signifies the relationship between the input and output variables; higher values indicate a stronger relationship. The y-intercept (c) represents the value of the output variable when the input variable is zero; The R-squared (r2) values provide information about the goodness of fit of the model. Higher values indicate a better fit to the data. These facts support the some suggestions:
The JRip and OneR classifiers generally have higher slopes compared to the DT and PART classifiers, implying stronger relationships between input and output variables. The PART classifier consistently shows the lowest slopes and y-intercepts across the four classifiers. The DT classifier has the highest R-squared value for the third scenario (0.86), implying a relatively better fit for that specific case. The PART classifier has the highest R-squared value for the fourth scenario (0.88), implying a relatively better fit for that specific case.
In this binary classification task, PART Classifier produces better results with minimal total cost of misclassification when compared to other decision rule classifiers for all types of cost ratio using the approaches Uniform (U), Uniform Inverted (UI), Non-Uniform (NU) and Non-Uniform Inverted (NUI) as in the proposed framework. The algorithms have been ranked based on their classification performance in terms of total cost, with the order being PART, JRIP, DecisionTable, and One R. The PART classifier has demonstrated the best performance for the specific dataset under consideration; however, it’s important to note that these rankings may vary when applied to different datasets. This highlights the need for careful consideration and evaluation of algorithm performance across diverse datasets to ensure robust and reliable results in various scenarios. Since the algorithms within the proposed framework is a generalized one, the range of cost ratio can be scaled up to cover for denser ratios. The proposed theme for analyzing the cost sensitive classifiers through the control matrix, a novel method in the direction of controlling the cost. However, it is crucial to assess the trade-offs between cost optimization and other performance metrics, such as accuracy or precision, to extend this analysis for a holistic solution.
Funding
The authors report no funding.
Data availability
This study utilized the data described in the research work by Kasiviswanathan et al. [31].
Footnotes
Acknowledgments
The authors would like to thank Dr. A. Kumaravel, Professor, Department of Information Technology and Dr. T. Vijayan, Department of Electronics and Communication Engineering, BIHER, Chennai, India for their support and guidance.
Conflict of interest
The authors declare that they have no conflict of interest.
