Abstract
Malaria remains a major global health issue, with over 229 million cases and 409,000 deaths reported annually, particularly in sub-Saharan Africa. Current diagnostic methods, such as microscopic examination of blood smears, are time-consuming and often lack accuracy due to human error and variability in slide quality. This study introduces Malaria-Net, a novel framework integrating advanced data preprocessing techniques with a Parasite Specific Attention Convolutional Neural Network (PSA-CNN) for enhanced feature extraction and Probabilistic Extremely Randomized Trees (PERT) for classification. The proposed approach begins with preprocessing steps, including image normalization, augmentation, and noise reduction to improve image quality and consistency. The PSA-CNN focuses on relevant features specific to malaria parasites, enhancing the network's ability to distinguish between different stages of infection. The PERT is then utilized for classification, leveraging its ability to handle high-dimensional data and provide probabilistic outputs. This method aims to improve diagnostic accuracy and reduce the reliance on manual interpretation, offering a more reliable and efficient solution for malaria detection. The proposed Malaria-Net achieves an accuracy of 99.937%, demonstrating its strong overall classification performance. It shows high precision (99.669%) and recall (99.337%), indicating that the model correctly identifies positive cases and minimizes false negatives. The F1-score of 99.539% reflects a balanced performance, combining precision and recall into a single metric, confirming its robustness in malaria detection.
Keywords
Introduction
Malaria continues to be a significant global health burden, impacting millions of people every year. The world health organization reported approximately 229 million cases of malaria around the world in 2020, 1 and 409,000 lives were lost, most of which were in sub-Saharan Africa. Less than five-year-olds are at particular risk of dying from malaria, accounting for as much as 67% of all malaria-related deaths. 2 Despite all the ongoing efforts to decrease its prevalence, the parasite is still resistant to drugs, and it has not been easy to make an accurate diagnosis. The ability to detect and treat early reduces the mortality rate, although, in many regions, reliable diagnostic tools are still restricted. In healthcare applications, especially in remote and resource-poor rural areas where specialist microscopists are unavailable, accurate and rapid malaria diagnosis is necessary. Although widely used, traditional microscopy 3 is time-consuming and dependent on trained personnel to interpret blood smears, thus making it less feasible for large-scale screening. In such environments, automated diagnostic systems relying on sophisticated image analysis and Machine learning approaches can significantly deliver consistent, accurate results to improve patient outcomes through early intervention. 4 In addition, it integrates automated systems in mobile health platforms, thus helping in real-time disease monitoring and management in underserved regions.
Manual microscopic examination of blood smear images using trained visual malaria parasite 5 is the mainstay of current malaria diagnosis. Although it is considered the gold standard, this process is prone to human error, especially if the slide quality or parasite counts are low. Additional complicating factors include technical variability in the skill levels of technicians and staining technique variability that further make accurate diagnosis difficult. 6 However, this is a labor-intensive and time-consuming method also, which can cause diagnosis and treatment delays, especially in regions with very high patient loads during malaria outbreaks. However, the rise of artificial intelligence 7 and machine learning 8 in healthcare is an opportunity to overcome these challenges. Blood smear images were processed and analyzed by artificial intelligence-powered systems at speed and accuracy, thereby reducing the need for human expertise and increasing diagnostic accuracy. Deep learning and artificial intelligence techniques were used tremendously in medical image analysis, extracting complex features and patterns from large datasets. These capabilities can provide the basis for the development of artificial intelligence driven systems that could revolutionize malaria diagnosis through the ability to automate the identification of parasites in blood smears to return a faster, more reliable result. However, despite the promise of artificial intelligence and machine learning, 9 existing systems still face several limitations.
Another major problem is the requirement of large, high-quality labeled datasets to train these models properly, which are not always used for malaria detection. Furthermore, real-world performance is decreased by the variability in image quality caused by slide preparation, staining, and lighting conditions, which can pose challenges for artificial intelligence models. The other issue is overfitting, where the models have high accuracy on training data but not new and unseen data. Furthermore, many artificial intelligence systems are black boxes, 10 and healthcare professionals have trouble trusting or interpreting the decisions made by these models, making their widespread adoption by clinical practice a significant barrier. It is critical to leverage AI fully for malaria diagnostics.
This work's contributions include developing a novel Malaria-Net integrating enhanced PSA-CNN and PERT to improve blood smear image quality. Then, the PSA-CNN was introduced for efficient and optimal feature selection from blood smear images, and the challenge of variability in blood smear images was addressed. Finally, the PERT classifier was leveraged to achieve high accuracy and robustness in malaria detection.
The rest of the paper is organized as follows: Section 2 offers a detailed review of related work and current methods in malaria classification. Then, Section 3 describes the proposed method through image preprocessing, feature selection, and classification. In section 4, the results and the performance of the proposed method are presented. Section 5 concludes the paper by discussing findings and future research directions.
Literature review
This section provides a detailed analysis of each related work. Table 1 summarizes the related work, highlighting its strengths and limitations. In, 11 authors proposed a microscopic parasite malaria classification system, which, following feature selection using Generalized Normal Distribution Optimization (GNDO), was then used to propose a relationship between the various features. While the GNDO feature selection algorithm can perform well with high dimensionality when the feature number does not increase with the number of samples, it has limitations when applied in real-world applications where a large dataset of many images is involved. In, 12 authors applied deep learning-based malaria detection over analysis of blood samples using CNNs to identify malaria parasites in blood smear images. Although the proposed method works for image classification, it is prone to overfitting because the training data does not have sufficient diversity, especially in non-standard images from different sources.
Summary of related work.
Summary of related work.
In, 13 authors used CNN algorithms, whose dependence on a single deep learning model restricts its generalization ability for variabilities of microscopy image quality and lighting conditions. In, 14 deep learning and data augmentation were used to diagnose malaria, and they were supported by synthetic image-generated data augmentations to the training set. The drawback of this approach is the limited training data. However, the augmented images do not entirely describe real-world images, which results in misclassification when applied against actual patient samples. In, 15 the authors implemented a hybrid deep learning model consisting of the tail made of traditional CNNs and reinforced by some preprocessing techniques for detecting malaria. This method achieves high classification performance, but the hybrid nature of the framework in conjunction with the computational demands makes it less useful for resource-constrained environments.
In, 16 a deep learning framework was used in MozzieNet to detect malaria parasites from blood smear images efficiently. While the method performs well, it cannot handle noisy or unclear images as the source of false negatives and with the inconsistency of image quality between datasets. In, 17 a hybrid capsule network model of malaria parasite detection is introduced. It also learns spatial hierarchies in the data, which helps improve detection. However, capsule networks are computationally more expensive than CNNs and impractical in environments where diagnosis must be done quickly. In, 18 authors used an optimized YOLOv4 deep learning model to detect malaria in blood smear images. However, the model reduces detection speed slightly while decreasing its effectiveness for early-stage malaria detection as it is less effective for low counts. In, 19 authors designed a bioinspired CNN for automated malaria detection. The detection accuracy is improved through biological processes using this model. Although bioinspired approaches are sensitive to noise and artifacts in the data and produce inaccuracy given low-quality images, they are not sufficiently reliable for clinical applications.
In, 20 authors proposed the Deep Learning-based Random Forest Network (DLRFNet) framework for the malaria parasite classification. Then, a mix of the two methods, such as pixel intensities, are mapped into one dimension. Here, a random forest classifier is combined with a deep learning approach that increases the model's complexity and can entail longer training times and more complex hyperparameter tuning. In, 21 the authors employed machine learning and deep learning-based image analysis to detect malaria parasites. In their approach, a system is developed that integrates image processing with CNNs, which allows for the processing of a wide variety of images but is limited in its deployment in many clinical settings with varied imaging technologies due to its inability to support several different image formats. In, 22 transfer learning-based malaria parasite identification in conventional microscopic blood smear images. Despite this, there are challenges in the high variability of image quality and preparation, which causes inconsistencies in results and poor performance. In, 23 the proposal was to use an ensemble of object detection models for robust and reliable malaria parasite detection in thin blood smear images. Model ensembling is advantageous but not easily deployed in resource-constrained settings with limited computational resources due to the increased computational overhead and complexity. In, 24 the authors finally proposed an image-cropping method for malaria parasite detection due to heterogeneous data. Image cropping helps in feature extraction by focusing on the region of interest. However, the approach fails to capture the contextual information at the cropped area, resulting in reduced classification accuracy when parasites are sparse or irregularly distributed in the smear. In, 25 authors introduced the deep learning approach for malaria parasite detection to improve malaria diagnosis from blood smear images. Nevertheless, this method relies on extensive training data and the possibility of overfitting with high dimensional features, making the method unsuitable for application to smaller or less diverse datasets encountered within real-world clinical scenarios.
This work proposes Malaria-Net, a new algorithm that is a novel combination of advanced techniques not presented by existing surveys. This method integrates a unique data preprocessing pipeline with a PSA-CNN and PERT, improving existing drawbacks in malaria classification. Specialized preprocessing within Malaria-Net addresses image quality, consistency variability, and noise issues common to existing methods. By concentrating on parasite-specific features, the PSA-CNN increases the accuracy of feature extraction. Then, PERT also offers robust classification over high dimensional data and probabilistic results. This innovative combination provides a comprehensive and efficient solution superior to manual examination and improves diagnostic performance.
Figure 1(a) shows the training architecture of Malaria-Net. The blood smear image dataset is initially normalized to standardize lighting conditions and color intensity. Then, advanced noise reduction algorithms are used to clear up images and make the malaria parasites clear to remove artifacts and background noise inherent in manually taken images. The histogram equalization enhances the contrast between malaria-infected cells and background, improving cell structure visibility to identify and segment each cell. Further, The PSA-CNN is designed after preprocessing to highlight features typical of malaria parasites. This attention mechanism motivates network attention to be relevant to the image, which leads to more accurate feature extraction. Deep convolutional layers were used for the network to learn the complex patterns and hierarchical features of malaria parasites. It makes it easier to distinguish and classify the different stages of infection with higher precision. Then, train test splitting is applied, which splits the PSA-CNN features into 80% for training and 20% for testing. Here,

Proposed system model. (a) model training. (b) model testing.
Figure 1(b) shows the prediction architecture of Malaria-Net. A single image predicts normal and malaria classes with PSA-CNN feature extraction and PERT classification. The pre-trained models of Malaria-Net are used for faster prediction. Finally, the user can predict the classes from various images instantly.
The PSA-CNN is proposed to enhance malaria parasite classification with convolutional feature extraction and an attentive mechanism to emphasize relevant features in blood smear images. The proposed PSA-CNN structure is depicted in Figure 2. The operation of the PSA-CNN feature extraction algorithm is presented in Table 2, including several important steps, such as feature extraction through convolutional layers, the application of an attentive mechanism, and then the integration of these features for classification.

Proposed parasite specific attention convolutional neural network architecture.
Proposed parasite specific attention convolutional neural network feature extraction algorithm.
In the first stage of PSA-CNN, the blood smear image is processed to enhance relevant features and normalize the input. Images are operated on to resize, normalize, and augment all training and testing samples consistent with one another. The input image is represented as a matrix
CNN for spatial-temporal feature extraction
Once the input image is prepared, it is processed through a series of convolutional layers designed to extract spatial-temporal features. Each convolutional layer applies a set of filters
Here,
After convolution and activation, pooling layers are applied to reduce the spatial dimensions while retaining crucial information as presented in equation (3). A max pooling (
Here,
To focus specifically on features pertinent to malaria parasites, the PSA-CNN integrates an attention mechanism. This mechanism assigns different weights to different regions of the feature maps, emphasizing areas likely to contain relevant features. Equation (4) presents the attention mechanism. It is expressed as follows:
Here,
Here,
Here,
The loss value in PSA-CNN is presented in equation (7), which consists of a combination of classification loss and attention loss.
Here,
Here,
The Adam optimizer is used to adjust the network weights and to optimize the PSA-CNN model. Adam combines the advantages of momentum-based and adaptive learning rate optimization. The weight update rule presented in equation (9).
Here,
After applying the attention mechanism, the fully connected layers of PSA-CNN aggregate the attended features through pooling and fully connected layers to produce a final feature classification. Equation (10) presents the feature classification using a SoftMax classifier for multi-class classification.
Here,
The PERT classifier is a robust and efficient ensemble learning method that constructs multiple decision trees and aggregates their predictions to enhance classification accuracy. It introduces randomness at various stages, including data sampling and feature selection, to improve model generalization and mitigate overfitting. Table 3 shows the proposed PERT classification algorithm. Figure 3 shows the PERT architecture diagram. A random subset of the training data is selected for each tree in the ensemble. It introduces diversity among the trees, making the ensemble more robust. At each node in a tree, a random subset of features is chosen from the feature set.

Proposed probabilistic extremely randomized trees classifier.
Proposed probabilistic extremely randomized trees classification algorithm.
The classification begins by taking the feature vectors extracted by the PSA-CNN. These features, denoted as
Here, N represents the total number of training samples. This dataset serves as the input for the PERT classification process.
To introduce diversity among decision trees, PERT utilizes a bootstrap sampling approach, where each tree in the ensemble is trained on a randomly selected subset C of the entire dataset S. The subset C is generally smaller than S and is obtained via random sampling with replacement. This process ensures that different trees are exposed to varying portions of the dataset, improving the ensemble's robustness. The selected training subset is presented in equation (12).
Here, c represents the number of samples in each subset.
Each decision tree node selects a random subset of features to introduce additional randomness. Instead of evaluating all features, PERT considers only a small subset, ensuring that different trees explore different aspects of the data. It prevents overfitting and allows trees to specialize in different decision boundaries.
Determine best split
For each feature in the selected subset, the classifier identifies an optimal split point that maximizes information gain or minimizes impurity. The commonly used criteria for determining the best split include Gini impurity (
Here,
Here,
Once the best split is determined, the node is divided into two child nodes containing samples that satisfy the splitting condition. This recursive partitioning process continues until a predefined stopping criterion is met, such as the maximum depth of the tree or the minimum number of samples per leaf.
Repeat for remaining nodes
The process of feature selection, determining the best split and node splitting, is repeated recursively for each newly formed node. The decision tree continues to grow until the stopping criteria are satisfied, resulting in a complete tree.
Ensemble aggregation
Once multiple decision trees are constructed, their predictions are aggregated to make a final classification decision. Each tree independently classifies a given test sample, and the final prediction is determined by majority voting as presented in equation (15).
Here,
This section evaluated the performance of different malaria classification methods using the same dataset regarding primary metrics such as accuracy, precision, Recall, and the F1 Score. The robustness and accuracy of the models, such as Malaria-Net and others, are assessed to find the effective implementation of the models using consistent evaluative criteria. Each method is used to classify malaria in blood smear images, and the comparative analysis identifies which method performs best for malaria detection.
Simulation environment
The research uses Python programming language with a graphical processing unit environment. The hardware contains an Intel Core i7 10th Gen 10750H laptop with 16GB of RAM, 1TB SSD for storage, and an 8GB NVIDIA GeForce RTX 2070 graphics card to crunch the computing requirements. The working software environment is based on Python 3.7 and utilizes TensorFlow 2.x to build and train machine learning and deep learning models in the research. These resources handle the execution and processing of the simulation tasks, carefully selected to achieve maximum efficiency.
Dataset
The malaria classification dataset contains 27,558 blood smear images divided into two folders, i.e., Infected and Uninfected. The folder “Infected” includes images of red blood cells exhibiting malaria parasites, while “Uninfected” consists of images of healthy blood cells without infection. It is a microscopic image collection with a diverse and comprehensive helpful set for training and testing artificial intelligence models. Variations in quality, lighting, and staining, combined with the fact that the images are real-world mammalian bacterial detecting conditions and robust systems for these conditions are desired by both the biologist and industry, make the images well suited for training robust malaria detection systems. The available dataset from the National Institutes of Health gives a good opportunity to gain a step further in deep learning and image processing techniques to automate malaria diagnosis. This dataset is collected from a trusted source, guarantees its credibility, and is a key contribution to developing an AI-based model for malaria detection. Figure 4 shows the sample images. The breakdown of the malaria dataset into training and testing sets is given in Table 4. The dataset is split into 80% of images for training and 20% for testing. The ‘Infected’ and the ‘Uninfected’ classes have the same number of images in each class. The training set consists of 22,046 images, while the testing set contains 5512 images, leading to 27,558 images in the dataset. Here, the classes are balanced, which helps build a strong classification model.

Sample images from the dataset.
Dataset distribution.
Table 5 compares the hyperparameters of various deep learning networks, including the GNDO, 11 MOZZIENet, 16 DLRFNet, 20 and the Proposed Malaria-Net. It highlights the key hyperparameters and settings used in each network, demonstrating the distinct configurations and advantages of the proposed Malaria-Net in classification.
Hyperparameters of various deep learning networks.
Hyperparameters of various deep learning networks.
The learning rate determines how much the model's weights are adjusted concerning the loss gradient during each step of the optimization process. The proposed Malaria-Net uses a learning rate of 0.0001, slightly lower than GNDO (0.0005) and MOZZIENet (0.009). With a lower learning rate, the chance of overshooting the optimal point is lower, especially in medical image classification tasks such as malaria detection. The learning rate of the proposed Malaria-Net is optimal when considering the balance between slow convergence speed and high prediction accuracy.
Batch size
Number of epochs
The term is several epochs or how the training during the training as a whole number of times this dataset will pass through the model. As for the number of epochs (maximum 30 for the proposed Malaria-Net), it is less than for the GNDO (200 epochs) and MOZZIENet (150 epochs). It will also help prevent overfitting and reduce the training time, especially when the architecture is sophisticated with attention mechanisms and similar techniques like dropout, as seen in the proposed Malaria-Net.
Optimizer
Parameters are adjusted according to the gradient of the loss function. The Adam optimizer helps adapt the learning rate for each parameter independent of others to perform better in complex models and is used by the proposed Malaria-Net. In contrast, the Adagrad learning rate is adapted based on the previous gradient, which GNDO uses. In contrast, MOZZIENet uses Adadelta, a variant of Adagrad with bounds on past gradient accumulation.
Weight initialization
Weight Initialization is important to start faster and prevent vanishing exploding gradients. The proposed Malaria-Net mitigates gradient flow issues, which arise with ReLU activation functions, using He initialization, which is tailored explicitly for layers using the ReLU activation function. In contrast to GNDO's Normal (Gaussian) initialization or MOZZIENet's LeCun initialization, which is less suited for deep networks, He initialization is preferred.
Activation function
An activation function will produce an output for all the layers. The proposed Malaria-Net uses ReLU, a simple and deep learning network. It works around the vanishing gradient problem that can occur with other activation functions like Tanh used in GNDO and Sigmoid used in DLRFNet, making the training faster and more efficient.
Regularization technique
One of the regularization techniques is dropout, which helps prevent overfitting by randomly dropping units during training. The proposed Malaria-Net uses a 0.5 dropout rate, which is suitable for keeping the model shape regular yet not too complex. Here, MOZZIENet and GNDO underfit due to a higher value and tested with a higher dropout rate of 0.9, while the proposed Malaria-Net has a dropout rate of 0.5, like DLRFNet.
Learning rate
A learning rate scheduler is a method for adjusting the learning rate in training based on certain predefined conditions. The Proposed Malaria-Net uses a step decay scheduler (decreases the learning rate every 10 epochs). The model will fine-tune weights towards the end of the training process. Unlike GNDO, MOZZIENet uses cosine annealing, and DLRFNet uses cyclical scheduling, but step decay is well-tuned for fine-tuning large neural networks with complicated architectures.
Depth of network
The depth of the network stands by the complexity and learning of intricate features. Using a relatively simple architecture consisting of 2 convolutional, two max-pooling layers followed by one fully connected layer. This simplicity is highly efficient for learning, combined with more sophisticated techniques such as attention mechanisms. GNDO and MOZZIENet are architectures with more layers than the current state of the art, representing an increased computational complexity and a slight improvement in malaria detection performance.
Attention mechanism
The attention mechanism makes it easy for the network to focus on relevant regions in an image, which is helpful for tasks such as malaria parasite detection. The proposed Malaria-Net uses the PSA-CNN mechanism to utilize features specific to malaria parasites to enhance diagnostic accuracy. Instead of SoftMax, MOZZIENet uses Tanh, and DLRFNet uses Sigmoid, which is not as specific to where they focus on the image.
Loss function
It controls the process of optimizing the model. Categorical Cross entropy is suitable for multi-class classification problems like Malaria detection, and the proposed Malaria-Net uses it. It is more suitable for multi-class tasks than hinge loss and is used in GNDO and Kernal Divergence in MOZZIENet.
Early stopping
Early stopping helps prevent overfitting by stopping training when performance stops improving. The Proposed Malaria-Net uses monitored class-based early stopping, which stops training based on the performance of each class separately. It ensures the network does not continue training when performance becomes suboptimal for any class. It is a more nuanced approach than GNDO's minimum validation stopping or MOZZIENet's scheduled learning rate stopping.
Prediction results
In Figure 5, Output Predicted Probabilities (OPP) values represent the predicted outcomes on sample blood smear images. In Figure 5(a), the OPP values are {0.01, 0.99}, where 0.99, the higher probability, is associated with the “Uninfected” class. Similarly, Figure 5(b) has OPP = {0.02, 0.98}, and Figure 5(c) has OPP = {0.12, 0.88}, both showing higher probabilities for the “Uninfected” category, confirming the exact prediction. In Figure 5(d), the OPP values are {0.82, 0.18}, and in Figure 5(e), {0.76, 0.24}, both show higher probabilities for the “Infected” class. Finally, Figure 5(f) shows OPP = {0.95, 0.05}, with 0.95 indicating a strong prediction for the “Infected” category. Based on the OPP values assigned to each sample image, the values suggest high confidence in the respective predictions, whether infected or uninfected.

Predicted outcomes on sample images. (a), (b), (c) predicted as uninfected. (d), (e), (f) predicted as infected.
Table 6 comprehensively compares the performance of various malaria classification methods. The accuracy of a classification model indicates the proportion of correct predictions out of all predictions made. The proposed Malaria-Net achieves an impressive accuracy of 99.937%. Compared to GNDO, 11 which has an accuracy of 96.796%, this represents an enhancement of approximately 3.243%. Similarly, compared to MOZZIENet, 16 with an accuracy of 97.740%, the proposed Malaria-Net shows an improvement of 2.243%. Finally, compared to DLRFNet, 20 which achieves an accuracy of 98.805%, the proposed Malaria-Net's accuracy is 1.143% higher. These improvements underscore the robustness of the Proposed Malaria-Net in accurately classifying instances.
Performance comparison of various classification methods.
Performance comparison of various classification methods.


Accuracy and Loss Graphs. (a) Validation accuracy. (b) Training accuracy. (c) Training loss. (d) Validation loss.
The precision is the ratio of the correctly predicted positive to the total predicted positives. The Malaria-Net proposed attains a precision of 99.669%, which is higher than 96.232% reported in GNDO 11 and is an improvement of 3.574%. The proposed Malaria-Net has a more significant improvement of 1.862% than MOZZIENet, 16 which has a precision of 97.852%. Furthermore, compared to a precision of 98.240% in DLRFNet, 20 the proposed Malaria-Net has been enhanced by 1.453%. These striking increases suggest that the proposed Malaria-Net can better remove false positives and thus improve the credibility of optimistic predictions.
The recall or sensitivity ratio is the number of correctly predicted positive observations to the number of observations in the actual class. The proposed Malaria-Net recall is 99.337%, which is an improvement of 3.013% compared to the Recall of GNDO 11 (96.324%). Compared to MOZZIENet, 16 Malaria-Net improves recall by 1.413% (97.951%). Last, the proposed Malaria-Net achieves 0.515% more than the Recall of 98.822% of DLRFNet. 20 The Proposed Malaria-Net has improved since these three methods, and these improvements indicate that the Proposed Malaria-Net is more effective in identifying positive instances than the other three.
The F1-Score is the harmonic mean of precision and recall, fitting well between the two. In addition, the proposed Malaria-Net achieves an F1-Score of 99.539%. It represents an improvement of 2.612% from the F1-Score of 96.924% for GNDO. 11 The F1 Score obtained for the proposed Malaria-Net compared to MOZZIENet, 16 which has an F1 Score of 97.265%, is 2.338% enhancement. In the last, we achieve the F1-Score of 98.848% using DLRFNet 20 's, which is 0.691% higher than Malaria-Net's F1-Score of 98.188%. The improvements in the Proposed Malaria-Net's balanced performance are precision and recall.
Figure 6 shows the confusion matrices of four different malaria detection models, such as GNDO, 11 MOZZIENet, 16 DLRFNet, 20 and the proposed Malaria-Net. Each model's confusion matrix will assess the classification's performance depending on the true positives, true negatives, false positives, and false negatives present, giving details of the accuracy and the possible misclassification errors the models have made. Although the GNDO 11 model provides strong performance with an accuracy of 96.796%, several misclassifications exist—particularly to correctly identify some malaria parasite types. Detecting targets using the MOZZIENet model 16 with a slightly increased accuracy of 97.740% bests the case of general image quality inconsistency, which leads to a few false negatives. Although the DLRFNet obtains an even higher accuracy of 98.805%, 20 this still produces some errors due to long training time and increased complexity. The most accurate model is the proposed Malaria-Net, which achieves an accuracy of 99.937%, ensuring the highest ability to accurately classify malaria parasites with very low misclassification, confirming its success in dealing with the related issues of its previous models like overfitting, variability in image quality, and computational requirements.
Figure 7 shows the networks presented, such as Malaria-Net, DLRFNet, 20 GNDO, 11 and MOZZIENet, 16 and comparative accuracy and loss graphs for these networks. Figure 7(a) illustrates the validation accuracy of each model, where the proposed Malaria-Net reaches the highest accuracy compared with DLRFNet, MOZZIENet, and GNDO. The proposed Malaria-Net generalizes well to unseen data, with less overfitting than the others. Figure 7(b) illustrates the training accuracy, where the proposed model again leads, indicating that it learns the training data most effectively. Figure 7(c) displays the training loss curves, where the proposed Malaria-Net shows the lowest training loss, indicating that it converges more quickly and efficiently than the other models, which exhibit higher and more fluctuating training losses. Figure 7(d) depicts the validation loss, where the proposed Malaria-Net maintains the lowest loss throughout the training process, reflecting its superior ability to balance accuracy with generalization. In comparison, the other models show higher validation losses, with GNDO 11 exhibiting the highest, suggesting overfitting issues or less effective optimization.
Table 7 presents an ablation study of the proposed Malaria-Net, illustrating the impact of excluding key components such as preprocessing and PSA-CNN feature extraction on the performance metrics. When preprocessing is excluded from the Proposed Malaria-Net, the accuracy drops to 97.324%, reducing 2.613% from the overall accuracy of 99.937%. Similarly, precision falls to 97.926%, a decrease of 1.743% compared to the overall precision of 99.669%. Recall also experiences a decline, reaching 97.618%, which is 1.719% lower than the 99.337% achieved by the complete methodology. The F1-Score, balancing precision and recall, decreases to 97.988%, marking a reduction of 1.551% from the overall F1-Score of 99.539%. These figures highlight the significant role of preprocessing in enhancing the model's overall performance.
Ablation study of proposed Malaria-Net.
Ablation study of proposed Malaria-Net.
Excluding PSA-CNN feature extraction impacts the methodology's efficacy, albeit to a lesser extent than omitting preprocessing. Without PSA-CNN feature extraction, the accuracy is 98.099%, 1.838% lower than the overall accuracy. Precision drops to 98.708%, a decrease of 0.961% from the complete methodology. The recall is 98.245%, reflecting a reduction of 1.092% compared to the overall recall. The F1-Score without PSA-CNN feature extraction is 98.724%, 0.815% lower than the overall score. These reductions underscore the importance of PSA-CNN feature extraction in optimizing the model's performance, though its absence has a slightly lesser impact than the absence of preprocessing. Together, these components significantly contribute to the high performance of the proposed Malaria-Net, as evidenced by the overall metrics.
The proposed Malaria-Net framework presents a novel and effective solution for automated infected and uninfected cell classification from blood smear images, addressing the limitations of existing methods, such as manual examination errors, variability in slide quality, and inefficient feature extraction. The method significantly improves diagnostic accuracy and reliability by combining advanced data preprocessing techniques, the PSA-CNN for targeted feature extraction, and PERT for probabilistic classification. This hybrid approach not only overcomes the drawbacks of traditional methods but also enhances the automation and scalability of malaria diagnostics. This methodology was expanded in the future by integrating additional parasite-specific attention mechanisms to detect various malaria species or other blood-borne pathogens. Furthermore, more extensive and diverse datasets and real-time processing capabilities could extend their applicability to remote and resource-constrained regions. Incorporating advanced explainability techniques for medical professionals to interpret results and combining the framework with telemedicine platforms could revolutionize global malaria surveillance and management systems.
Footnotes
Funding
The authors received no financial support for the research, authorship, and/or publication of this article.
Declaration of conflicting interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
