Abstract
In recent days people are affected with lung cancer in, and the severe stage of this disease leads to death for human beings. Lung cancer is the second most typical cancer type to be found worldwide. Pulmonary nodules present in the lung can be used to identify cancer metastases because these nodules are visible in the lungs. Cancer diagnosis and region segmentation are the most important procedures because the prosperous prediction-affected area can accurately identify the variation in cancer and normal cell. By analyzing the lung nodules present in the image, the radiologists missed several useful low-density and small nodules, and this may tend to the diagnose process very difficult, and the radiologists needs more time to decide the prediction of affected lung nodules. Due to the radiologist’s physical inspection time and the possibility of missing nodules, automatic identification is needed to address these issues. In order to achieve this, a new hybrid deep learning model is developed for lung cancer detection with the help of CT images. At first, input images like CT images are gathered from the standard data sources. Once the images are collected, it undergoes for the pre-processing stage, where it is accomplished by Weighted mean histogram equalization and mean filtering. Consequently, a novel hybrid segmentation model is developed, in which Adaptive fuzzy clustering is incorporated with the Optimized region growing; here, the parameters are optimized by Improved Harris Hawks Optimization (IHHO). At last, the classification is accomplished by Ensemble-based Deep Learning Model (EDLM) that is constructed by VGG-16, Residual Network (ResNet) and Gated Recurrent Unit (GRU), in which the hyperparameters are tuned optimally by an improved HHO algorithm. The experimental outcomes and its performance analysis elucidate the effectiveness of the suggested detection model aids to early recognition of lung cancer.
Keywords
Introduction
In recent days, the rapid enlarging cancer and other primary cause of death is lung cancer. Lung cancer is determined as a crucial health issue, and it is a highly diagnosed cancer in worldwide based on the reports of the World Health Organization’s (WHO’s) [1]. The method for identifying lung cancer is more important. Since, it gives more relevant data to practitioners and radiologists for making better decision about the detection of lung cancers. The study of the causes of the disease is useful for great management to save the lives of humans, animals and swarms [2]. Cancer is a terrible medical condition that encompasses more than 200 different dangerous medical problems and is not just one illness [3]. Cancer is generally defined as, any disease that is characterized by contaminated cells, unchecked growth and aberrant spread. If the spread of abnormal cells is not properly regulated, the anomaly identified in cells results in death and impairment [4]. Infectious agents, organisms, chemicals and tobacco are the external risk factors for the spread of cancer. Internal risk factors include immunological disorders, mutations and hereditary hormones. Various techniques are utilized to locate the categories and detect lung cancer at the early stages. The “Computed tomography (CT)” and “Magnetic resonance imaging (MRI)” are the two best approaches for detecting and diagnosing lung cancer [5]. The CT is an important to modal the image for diagnosing and monitoring the lung cancer. The benefit of CT is detecting lung cancer in its early stages and allowing doctors to recommend more efficient treatments [6].
Because of the compound anatomical area’s nodular position, tiny size and low contrast in the lung, the affected lung nodule is difficult to identify in its early stages [7]. The tumorous pulmonary nodules in CT thorax images are 3cm or less in size and round shaped. Therefore, detecting and diagnosing the tumorous pulmonary nodules are complicated. Automatic lung cancer prediction uses multiple techniques to identify the disease, including classification of cancer, selection of features, and extraction of cancer features, region segmentation and removal of image noise [8]. Some segmentation methods are self-organizing maps, fuzzy k-means clustering, fuzzy c-means, sobel detection, canny edge detection, distributed clustering and k-means clustering [9]. The likelihood of rectifying lung cancer is significantly higher with earlier observation and treatment. Monitoring of lung nodules is the first stage of classification, and other effective methods of identification lead to better outcomes. When compared to single view radiography, advanced recognition of lung cancer lessen the mortality rate by 20% [10]. However, diagnosing cancer from the image of a lung is based on the true negative and false positive results, burdening the patient’s using extra testing, expense, and stress as well as the doctor’s added workload [11]. The problem in the effectiveness of lung cancer treatment is inherited concerning forecast accuracy. In the field of biomedical, a sensitive process requires high qualification and time for the diagnosis and evaluation of the lung CT image, and it is done by field experts [12].
The screening techniques inspect the lung cells and cell variations that are used to forecast lung cancer, but maintaining the predictability is complex. CT is one of the screening technologies that efficiently identify the variations and alterations that are presented in the human body. These are discovered by exposing the body to X-rays [13]. Despite successfully predicting lung cancer, typical automatic systems still struggle with recognition accuracy rate [14] and take longer to analyze huge amounts of data. Furthermore, the system does not handle CT images of minimal quality, which may result in higher misclassification rates and incorrect lung features [15]. The enormous success of deep learning approaches in image identification and recognition is leads to various medical imaging issues. “Convolutional Neural Network (CNN)” excels the other modern techniques in a variety of medical image problems [16]. It is also better than human experts and other approaches in regards to accuracy and sensitivity. Therefore, to create reliable cancer identification, the clinical centre must implement Computer Assisted Automated Detection (CAD) technology [17]. Some of the drawbacks of the existing methods are could not be solved, and it is listed below. In existing approaches, the early detection of lung cancer is not possible for the ResNet model. Moreover, it fails to handle the huge amount of data while the training process takes place in the VGG-16 technique. Thus, it leads to cause overfitting issues. In GRU, the accurate position of the lung nodules is not detected exactly while screening takes place in the lung nodules. However, the existing model shows a high error rate thus, the misclassification problems occur while detecting lung cancer. To resolve the above-mentioned problems, a new IHHO-EDLM method is introduced to detect lung cancer better. Moreover, the offered model provides enhanced performance in terms of accuracy. Here, the classification performance can be increased due to the error rate reduction. Here, the pre-processing takes place; hence; it can minimize the background noise to improve the quality of the image to detect lung cancer. Moreover, the segmentation is performed using the VGG-16, ResNet and GRU to locate the exact position of the lung cancer effectively.
The contributions of the designed ensemble-based deep learning lung cancer detection model are mentioned as follows.
To adopt an ensemble-based deep learning model to detect and identify lung cancer at the early stage. In addition, it is helpful for patients who have breathing issues. The enhancement of the recommended technique is broadly applicable to healthcare applications. To develop an IHHO algorithm for lung cancer detection. With the help of recommended IHHO algorithm, the parameters of maximum iterations and seed range are optimized to get better performance. Moreover, the offered IHHO algorithm is used to improve the feature propagation. However, the complexity issues while optimizing the complex parameters are resolved. To segment the lung nodules, the adaptive fuzzy clustering and region-growing techniques are used for attaining effective segmentation, where the parameter is optimized to enhance the segmentation outcome. The segmented images make it easier to detect the affected lung nodules in an accurate manner. The segmentation takes place without losing valuable information. To obtain the final outcome of lung cancer detection, an EDLM is used. In this EDLM, the VGG-16, GRU, and ResNet are used to develop EDLM, where the parameters are optimized from EDLM to get higher detection results with respect to accuracy and precision. Moreover, the recommended EDLM technique is performed to resolve the overfitting issues and underfitting issues. The experimental outcome of the offered IHHO-EDLM-based lung cancer detection model is analyzed among several optimization techniques and other existing works to verify the performance of the recommended lung cancer detection model.
The upcoming sections of the ensemble learning-based lung cancer detection model using CT Images are explained in further sections. Here, Section 2 consists of a variety of existing lung cancer detection models with its features and drawbacks. Section 3 consists of a structural representation of the designed EDLM-based lung cancer detection model, and it describes about dataset and system model. Section 4 describes about the proposed algorithm and segmentation process in detail. Section 5 depicts about EDLM-based deep learning model and explains about the classifiers that are used to detect lung cancer. Section 6 includes the results and discussion of the suggested lung cancer detection model. In Section 7, the summarization of the developed EDLM-based lung cancer detection model using CT images is given.
In this section, several existing research work is focused and elaborated in Sub-section 2.1. Here, the diverse existing techniques are focused for detecting lung cancer using CT images. However, the challenges and benefits of the existing model are listed in the Sub-section 2.2. Owing to the limitations of the existing model, we developed a novel technique for the lung cancer detection model.
Related works
In 2019, Shakeel et al. [18] have introduced a lung cancer detection framework from CT images by utilizing deep learning and clustering approaches. Standard datasets were used for collecting the lung CT images. Histogram equalization has been used to remove the noise. Similarly, the profuse clustering technique was used for splitting the affected area in the image. For predicting lung cancer, those images were examined by utilizing Deep Neural Networks (DNN), and the simulation results were obtained using MATLAB. This approach gave better outcomes among lung cancer detection with minimum errors and better accuracy.
In 2021, Alzubaidi et al. [19] have investigated a feature extraction framework, that is used for identifying lung cancer. This implemented framework contained three main parts: i) collection of data’s ii) global and local training, and iii) testing. Image cropping and warping methods were used to preprocess the accumulated images during the testing and global training stage. Then the global features were retrieved from the accumulated images for representing every image with the feature vectors. The learning algorithms with the best performance in the global phase were used to build the detection model by the feature vectors. Every image was splitted into a group of local blocks in the testing and local training stage. From the results, the gabor filter features have obtained better specificity, sensitivity and accuracy rates.
In 2021, Sori et al. [20] have investigated a deep learning technique for detecting lung cancer using denoised CT scan images. The recommended “Denoising First Two-Path CNN (DFD-Net)” contained two parts for detecting and denoising. In the preprocessing phase, the noise was removed by using the denoising model. For detecting lung cancer, a two-path CNN was used, where the input was the noise removed images. The combination of global and local features was focused by two paths. Then the image contrast also balanced by utilizing the implemented retraining technique. Therefore the suggested model was useful in noise reducing, adapting different sizes and structures and balanced the size of the receptive field. This implemented method achieved competitive results.
In 2020, Karthiga and Rekha [21] have described a dynamic algorithm named “Accelerated Wrapper-based Binary Artificial Bee Colony (AWB-ABC)” for detecting lung cancer in its early stages. “Improved Naive Bayes classification (I-NBC)” of images was done to classify the different cancer stages. In the pre-processing phase, image enhancement and removing noise were done using Mini–mental state examination (MMSE). The pre-processed image was used for the retrieval of morphological features. By using the AWB-ABC algorithm, the features which have a great impact on causing lung cancer were chosen. For the final classification, an I-NBC algorithm was used. MATLAB was used for the simulation. Therefore, by using this proposed method, the classification accuracy has been improved.
In 2020, Shakeel et al. [22] have implemented an improved DNN and ensemble networks for the automatic identification of lung cancer. A dataset of CT scan images was collected to identify lung cancer. Deduction of the noise from the image and quality enhancement of the image was done by utilizing the multilevel brightness-preserving technique in the accumulated images. The affected area in the image was segmented, and also different features were retrieved by improved DNN. “Hybrid spiral optimization intelligent-generalized rough set technique” was used to choose the efficient features from the retrieved features. Then an ensemble network was used to classify the efficient features. This implemented method has given high lung cancer prediction results.
In 2019, Manickavasagam et al. [23] have proposed a “Neuro-Fuzzy Classifier” for detecting lung cancer automatically. This method contained two phases. During the first phase, the classification of abnormal or normal images was done using the Naive Bayes model. Similarly, in the second phase, various lung cancer stages have been classified. For the classification and detection process, 10 features were selected from the lung image. Therefore, the implemented algorithm classified lung cancer very effectively with the optimal selection of features. The results of the experiments were shown that the investigated model gave an improved classification accuracy rate.
In 2020, Halder et al. [24] have implemented an adaptive morphology-based segmentation method for identifying lung cancer. An adaptive morphological filter was used to identify the lung nodules using an “Adaptive Structuring Element (ASE)”. It was used to give higher accuracy outcome of the lung nodule detection process. For feature extraction, the detected candidate nodule regions have been processed. This method has reduced false positives. For detecting lung nodules, the “Support Vector Machine (SVM)” was used with intensity and texture-based features. This implemented method was accomplished with better outcome while detecting lung nodules automatically.
In 2020, Togacar et al. [25] have introduced a maximum relevance feature selection with a CNN-based lung cancer detection mechanism to give reduced redundancy results. In the classification and feature extraction process, CNN was used. The image augmentation approaches were applied to improve the classification rate among the collected CT images. By using the AlexNet model, the features were retrieved, and it was applied as an input to “softmax classifiers”, “SVM, Decision Tree (DT)”, “K-Nearest Neighbor (KNN)”, “Linear Discriminant Analysis (LDA)” and “Linear Regression (LR)” for checking the efficiency of the developed model. A better classification rate has been obtained by using both the NN classifier and the AlexNet model. This proposed model has given stable results while detecting lung cancer.
Merits and demerits of existing lung cancer detection model
Merits and demerits of existing lung cancer detection model
Lung cancer is the most disastrous disease rather than people get affected by breast or colon cancer. The detection and segmentation of cancer affected lung nodules becomes the major issue to treat patients. Also, various approaches are developed to the early detection of cancer but exist with some noteworthy challenges. The advantages and disadvantages of existing lung cancer detection techniques are given in Table 1. IPCT [18] acquire high accuracy and less classification error. But, for training the features, it consumes more time. SVM with HOG [19] maximizes sensitivity, accuracy and specificity. Yet, it creates overlapping issues as it comprises multiple images. CNN [20] achieves a better detection rate. However, sometimes the data get lost during the implementation of the denoising process. I-NBC [21] obtains higher accuracy results and reduces the error value. But, it does not entail multiple classifiers to enhance the performance. Ensemble [22] increases the detection accuracy and mitigates the dimension. On the other hand, the performance gets degraded since it contains a computational burden. Neuro Fuzzy Classifier [23] obtains more precision and accuracy. However, it does not utilize other factors like location, growth rate and lung nodule size. AMST [24] yields desired results with distinct performance measures. It does not provide effective segmentation results since it has low-intensity variation among the nodules and structural complexity. CNN [25] provides a high success rate in accordance with specificity, sensitivity and accuracy for cancer detection. But, it does not consider other medical imaging techniques for performance enhancement.
Motivation
Some limitations of the existing approaches are shown here. In the existing model, the training time is high. Thus, a timely diagnosis is impossible for radiologist to detect lung cancer. Here, the parameter optimization is not performed to get a better solution. On account of all aforementioned challenges and limitations, it is provoked to implement a novel method for the detection of lung cancer using CT images. Here, the developed model resolves the complex issues while the training the network. Moreover, it tends to manage a huge amount of data. Consequently, the offered model provides better intensity variations, which helps to identify the exact location of the lung nodules. The validation takes place with various evaluation measures for an effective outcome. Additionally, the accuracy rate is enhanced in the developed model.
Lung cancer detection using CT images: System model and dataset model
The section explores the developed model for lung cancer detection. Initially, the research gaps has been briefly expanded and then, the architecture of the developed ensemble-based deep learning model is visualized and evaluated in Subsection 3.1. In Subsection 3.2, the data that has been collected in this research work is explained. After the data collection, the preprocessing techniques are performed, and it is described in Subsection 3.3.
System model: Proposed lung cancer detection
Among the variety of cancer types, lung cancer is the major death causing cancer. Smoking is the main cause of affecting lung nodules. Therefore, the starting stages are necessary, and the initial stage detection may helpful to cure or control lung cancer [38]. Different approaches are used to identify lung cancer. But, the predicted results are not accurate all the time, and false identification of cancers is also possible during the detection of lung cancer. The prediction models require more cost to maintain and analyze the results effectively. For performing lung cancer detection, high contrast CT images are required. High radiation exposure is one of the main disadvantages of the existing lung cancer detection models. Automatic identification of lung cancer is necessary for avoiding the small and low density missing nodules. Moreover, the precise and accurate detection of affected lung nodules is essential to diagnose lung cancer at the initial stage. The deep learning techniques have the ability to learn the features very effectively from the CT images that may be useful for identifying the affected lung nodules. Therefore, to overcome those difficulties, an ensemble learning-based deep learning model is implemented. The architecture of the implemented ensemble-based deep learning model for detecting lung cancer is given in Fig. 1.
The architecture of the implemented ensemble-based deep learning model for detecting lung cancer.
An ensemble-based deep learning model for lung cancer detection model is performed using CT Images to reduce the chance of people dying from lung cancer. In order to implement this, the CT images are collected from the benchmark data links. Then the collected raw images are preprocessed using the Weighted mean histogram equalization and mean filtering. These techniques magnify the image quality. Then the lung nodule segmentation is performed using the Adaptive fuzzy clustering and region growing approaches. In adaptive fuzzy clustering, maximum iterations are optimized using IHHO. Similarly, from the region growing approach, seed ranges are optimized using IHHO. This optimization of maximum iterations and seed range during the segmentation stage may improve the detection accuracy of the investigated framework. The segmented images are given to the ensemble learning network, where the images are detected using VGG-16, ResNet and GRU. Moreover, epochs from VGG-16, hidden neuron count and epochs from GRU, hidden neuron count and epochs from ResNet are tuned by the IHHO. The predicted scores from the used ensemble-based deep learning model such as vgg-16, ResNet and GRU are averaged and the final prediction score is obtained and this score is used for detecting lung cancer. The prediction results obtained from the implemented ensemble learning-based model for identifying lung cancer with the help of CT images are compared with other lung cancer detection models to ensure the effectiveness of the IHHO-EDLM-based lung cancer detection.
Dataset 1 (“Chest CT-Scan images Dataset”): The required CT images are gathered from Chest CT-Scan images that are taken from the dataset link of
Sample images gathered from the dataset for lung cancer detection.
The collected sample CT images are indicated by
The CT image pre-processing is performed in the implemented lung cancer detection model for enhancing the quality of images without losing any information in the original image. The sample image
Weighted mean histogram equalization [26]
The most beneficial method for improving the brightness of the images is weighted mean histogram equalization. This approach is used for adjusting contrast in image. In this process, the information in images is denoted by close contrast values for improving the global contrast. The input image
The statistical procedure of weighted mean is computed by multiplying each product’s weight by the quantitative result. A weighted mean is produced by dividing the resultant by the total weights attached to the observations. The arithmetic mean is identical to the weighted mean if all observation weights are the same. The formula for the weighted mean is given below in Eqs (1) and (2), respectively.
Here, WM defines the weighted mean,
After histogram equalization, the resultant image
Soft computing-based lung nodule segmentation using adaptive fuzzy clustering and optimized region growing
In this section, the brief explanation of the developed IHHO algorithm is further explored in sub-Section 4.1. Moreover, the algorithm for the IHHO is provided, and also the flowchart for the suggested IHHO algorithm is shown in Sub Section 4.1. Then, the lung module segmentation takes place using the pre-processed image. Moreover, the adaptive fuzzy clustering, as well as the region growing approach, is utilized for segmenting the lung nodules in sub Section 4.2. The output images that are obtained by the preprocessing and segmentation are visualized in Subsection 4.3.
Improved harris hawks optimization
An IHHO is proposed in the implemented ensemble-based deep learning model to detect lung cancer. It is used for optimizing the parameters such as seed size from adaptive region growing and maximum iteration from adaptive fuzzy clustering are optimized during segmentation to improve the segmentation performance. Moreover, the parameters like epochs from VGG16, ResNet, and GRU are tuned, and also the hidden neuron count from ResNet and GRU are optimized by the IHHO for maximizing the accuracy and precision of the lung cancer detection model. The HHO approach can manage high dimensional data. When compared with other optimization techniques, HHO provides high performance and provide better optimization results. Still, HHO has some drawbacks like population diversity, local optima and feature selection problems. To rectify these issues, the IHHO algorithm is proposed. By using this IHHO algorithm, the precision and accuracy of the suggested lung cancer identification using ensemble learning framework is maximized. The IHHO is developed in accordance with the current fitness function, worst fitness function and also the current fitness function. The value of
The variable
HHO [28] is a population-based optimization approach, and it is formulated based on the behavior of nature. It is a gradient-free optimization technique. The surprise pounce chasing technique and cooperative attitude of Harris’ hawks act as an important source of inspiration for HHO. Based on the static natural situations and the prey’s flying movements, Harris hawks can pursuit a numerous strategies.
Exploration phase
By the powerful eyes of Harris’ hawk, it can find and identify the prey. But, the prey cannot be seen periodically. In order to find prey, the hawks will observe, detect and wait for the prey, in desert areas for hours. The Harris’ hawks rest at various locations, and it will find prey based on the two strategies. For all resting strategies, an equal chance for
In the above equation,
Based on the other hawks and different locations, the solutions are generated by the first rule. In the second rule, a random scaling coefficient is considered to find different locations. By using Eq. (5), the average position of the hawk is achieved, and it is given in Eq. (5).
Here,
The exploration phase has the ability to change into the exploitation phase in the HHO algorithm. Based upon the prey’s energy to run away from the hawk will lead to a change the functioning of the explorative phase. When the prey tries to run away from the hawk, the prey’s energy level will decrease. Therefore, the energy level of prey is considered in Eq. (6).
In the above equation,
The prey that is found from the exploration phase faces the surprise pounce from the hawk’s attack. When the prey tries to escape, various chasing modes will occur. There are four chasing modes are used in the proposed HHO algorithm.
Assume the prey’s chance of runaway is
Soft encircling
When the rabbit attempt to escape from the hawk but, it cannot able to escape and it still has the energy to escape, then consider
Here, the difference between the rabbit and position vector is represented in the term
The variable
When the prey does not have the energy to escape and the same time, the hawk will handle the hard procedure to surround the prey, then
At the moment
In the HHO algorithm, the Levy Flight (LF) concept, various run away methods of prey and leapfrog movements are used. The hawks do more dives in order to catch the prey. They choose the best dive to surround the prey in terrific situations. Hence, the hawks calculate the next move to encircle the prey in a soft manner. This calculation is shown in Eq. (11).
The hawks evaluate their previous dive whether it is good enough to catch prey, if not it start doing twisted dives to catch the prey. Therefore, the hawk’s dive using the LF-based model. It is given in the following Eq. (12).
Here, the term
In the above equation
The rabbit cannot escape at
Here, the value of
In the above equation
Flowchart of the proposed IHHO.
Lung Nodule Segmentations are done in the developed lung cancer detection models for splitting the lung CT images into several image segments. Segmentation is used to rationalize the portrayal of an image to analyze it easier. It is also used to identify the curves and lines in images. In this ensemble learning-based deep learning lung cancer detection model, adaptive fuzzy clustering and region growing techniques are used for segmentation.
Adaptive fuzzy clustering [29]
Splitting the data into equivalent areas according to the likelihood of the items is known as clustering. Data that is logically and physically comparable is combined in order to decrease the number of disc accesses and maximize the efficiency of the database system. The clustering approach involves grouping the
Domains like fuzzy identification and pattern recognition have adopted in the fuzzy clustering method. Many different fuzzy clustering techniques are initiated, and the majority of them are based on distance standards. Computation of fuzzy weights using reciprocal distance is known as FCM, and it is the most popular approach. It requires a set of clusters as input, denoted by
Each feature vector can belong to many clusters using the FCM, each with a different fuzzy membership value. Final categorization will be done by choosing the highest feature vector’s weight across all clusters. The comprehensive algorithm is given below:
Input is the image pixels that contain three RGB dimensions and
Every pixel is assigned with a random weight. It makes use of fuzzy positive weight The initial weights are equalized using Eq. (19).
For obtaining
Calculate the new centroid value using Eq. (21).
Upgrade the weights using Eq. (22).
Repeat the process from step 3, if any change in input or else stop. Every cluster is assigned with pixels based upon the highest weight.
Lung segmentation model using adaptive fuzzy clustering and region growing technique.
Resultant images after the pre-processing and segmentation process.
After the adaptive fuzzy clustering, the segmented image
The resultant images
Create pattern cells from the entire image. Using a similarity metric, every pattern cell is compared to the neighboring cells to check if they are similar. If the cells are similar, combine the cells to create a fragment and upgrade the property used for comparison. Examine every neighbor of the fragment until there are no more joinable regions. Therefore, the completed segment of the fragment is tagged. Continue in this manner until all cells have tags, and then go to the next blank cell.
The resultant image from the region’s growing process is defined as
The resultant images obtained after the pre-processing and segmentation process is shown in below Fig. 5.
Intelligent lung cancer detection by optimized ensemble-based deep learning model
In Section 5, the detailed description that is related with deep learning models like VGG-16, ResNet, and GRU are discussed in Subsection 5.1. However, the developed ensemble based deep learning model is explained in Subsection 5.2.
Description of all ensemble models
Ensemble models uses various modeling algorithms to provide a final outcome. It collects the results of each algorithmic models and gives the final predicted results. Ensemble-based deep learning models are used to predict lung cancers in the developed lung cancer identification model, which incorporates VGG-16, ResNet and GRU models to formulate the ensemble learning model. The segmented image
VGG-16 [31]
The VGG-16 network contains 16 various convolution layers, and it visualizes a tiny 3
Residual Network (ResNet) [32]
The production of every residual layer is combined with the input of the next layer. In ResNet, the gradients can flow back, enables fast training and add many more layers. The convolutional and identity blocks make up the two fundamental building components of the ResNet model. The identity block has equal proportions. Identity blocks have three elements. The initial element is a 2D convolutional layer, and the filter size is
Developed EDLM-based lung cancer detection model.
Performance evaluation of the developed ensemble learning-based deep learning lung cancer detection model with diverse algorithms regarding “(a) Accuracy, (b) F1-Score, and (c) Precision”.
Performance evaluation of the developed ensemble learning-based deep learning lung cancer detection model among different existing classifiers in terms of “(a) Accuracy, (b) F1-Score, and (c) Precision”.
5-fold analysis of the designed ensemble learning-based deep learning lung cancer detection model with existing algorithms in terms of “(a) Accuracy, (b) F1-Score, and (c) Precision”.
In a convolutional block, the proportions of output and input are not the same. Linear projections are performed by shortcut connections to alter the proportions. The structure of the convolutional block is similar to the identification block but in the convolutional block, a 2D convolutional layer is added in a shortcut way. In this way, the size of the input changes in order to match the main path. At last, the altered shortcut is merged with the resultant output from the main path. The disappearing gradients are controlled in the modified shortcut, and it is the most beneficial process of the convolution block. Finally, the lung cancer prediction score is obtained at the final layer.
The upgraded version of the Long Short-Term Memory (LSTM) neural network is known as the GRU neural network model. It optimizes the LSTM’s network structure, and it reduces three gating units into two gating units of LSTM. The reset and update gates are the two gating units. Moreover, the characterization of the GRU model is low, and also it maintains the information reliance over a long distance. GRU model is composed of three layers such as implicit layer, input layer and output layer. The collection of GRU neurons forms the hidden layer. Time series data are the input of the GRU model. If the input is
Here, at the time
Hence, the GRU model reduces the gating units to preserve the main information and deduct the unnecessary information. For storing the information, it uses the hidden layer. Then, the lung cancer detection score is attained at the output layer.
The proposed ensemble-based deep learning model is used to detect lung cancer that may prevent the people from death. Some deep learning approaches that are used in the suggested lung cancer detection model are is VGG-16, ResNet and GRU. In this ensemble approaches, the parameters like epochs in VGG-16, hidden neuron count and epochs from ResNet, hidden neuron count and epochs from GRU are tuned by the IHHO algorithm. VGG16 is useful for detecting diseases from images. It consumes more time to get trained, and also it needs more bandwidth and space. Object separation and identification of images is done by using ResNet. More number of layers can be trained easily. It consumes more power. GRU takes less amount of time to get trained. Modifying GRU is simple. Learning efficiency is low in GRU. The objective function of this parameter optimization is the maximization of accuracy and precision. Here, the objective function is given in the Eq. (28).
In the above equation, accuracy is defined as Acy, and precision is defined as
Here, the term TPS defines the true positive value, the term TNG refers the true negative, FPS defines the false positive, and the term FNGdefines the false negative value. The formula for precision is given in Eq. (30). The schematic diagram of the designed EDLM-based lung cancer detection model is depicted in Fig. 6.
The result section shows the description related with the implemented platform, and also the various evaluation metrics are evaluated and described in all sections.
Experimental setup
5-fold evaluation of the developed ensemble learning-based deep learning lung cancer detection model among various existing techniques in terms of “(a) Accuracy, (b) F1-Score, and (c) Precision”. 
Performance analysis of the designed ensemble learning-based lung cancer detection model
The ensemble learning-based deep learning model for lung cancer identification was implemented using Python. The outcome of the designed ensemble-based deep learning model for the identification of lung cancer was evaluated ‘among various algorithms to ensure the progress of the recommended algorithm. The experiments are organized with a population rate of 10, chromosome length of 7 and maximum iteration of 25. The optimization algorithms like Deer Hunting Optimization Algorithm (DHOA) [34], Tree–Seed Algorithm (TSA) [35], Salp Swarm Algorithm (SSA) [36] and HHO [28] and techniques like CNN [20], SVM [19], VGG16 [31], and VGG_ResNet_GRU [37] are used for the evaluation process.
The evaluation metrics are performed to improve the performance of the developed model as follows.
The performance evaluation of the designed IHHO-ELDM-based lung cancer detection model is shown in below Fig. 7. As well as, Fig. 8 shows the performance evaluation of the implemented ensemble learning-based deep learning lung cancer detection model among various existing techniques. The accuracy of the proposed EDLM-based detection of lung cancer model has improved than HHO, SSA, TSA, and DHOA with 0.9%, 1.36%, 1.89% and 2.0% at a learning percentage of 60. Moreover, the graph analysis shows equivalence performance. Here, the existing HHO-EDLM algorithm attains second better performance concerning accuracy. However, it tends to enhance the system performance. Additionally, the TSA-EDLM algorithm achieves low performance in terms of accuracy, precision and F1 score. Moreover, it degrades the system’s performance. Thus, the training of the network becomes high, and also it is unsuitable for effectively detecting lung cancer. While considering the classifiers, the SVM model provides a second better performance. On the contrary, CNN does not provide effective performance other than the existing models. Therefore the proposed model has provided a better performance rate in accordance with precision, accuracy and f1-score when comparing with other algorithms.
Effectiveness analysis using 5 fold
The 5-fold evaluation of the offered ensemble learning-based lung cancer detection model among various algorithms is given below in Fig. 9. As well as, Fig. 10 shows the 5-fold evaluation of the suggested ensemble learning-based lung cancer detection model with diverse existing approaches. The precision of the developed EDLM-based lung cancer detection model is enhanced than VGG16-ResNet-GRU, VGG16, SVM and CNN with 2.7%, 3.9%, 3.09% and 5.02% with the 5-fold value of 5. In K-fold analysis, the given data is splitted into five datasets. While considering the accuracy based analysis, the TSA-EDLM shows a second better performance in terms of 1-fold. The developed method is performed well than the conventional prediction model by analyzing the precision, F1-score and accuracy.
Overall analysis of the developed lung cancer detection model
The overall computational analysis of the developed ensemble learning-based deep learning detection for lung cancer is shown in Table 2. The suggested ensemble learning-based lung cancer detection model has given improved F1-Score of 0.5%, 1.0%, 1.77% and 1.2% than HHO-EDLM, SSA- EDLM, TSA- EDLM and DHOA-EDLM. Furthermore, the developed method attains high performance than prior algorithms.
Conclusion
An ensemble learning-based lung cancer detection using CT images was implemented to identify lung cancers for humans at an early stage. For performing this, a set of sample CT images were collected from a traditional database. Here, the input images were pre-processed to enhance the image quality. In this model, preprocessing has been done by using two approaches like, weighted mean histogram equalization and mean filtering. Then the preprocessed images were segmented using adaptive fuzzy clustering and region growing techniques, where the parameter optimization taken place via the IHHO. The outputs of the segmented images were then detected using the ensemble-based classifiers such as VGG-16, ResNet and GRU. Moreover, parameters like epochs in VGG-16, hidden neuron count and epochs in ResNet, hidden neuron count and epochs in GRU were tuned using IHHO. These classifiers gave the predicted outcome, and the values were averaged to get the final output. The result of the developed model has been ensured among various classifiers, and the outcome was shown that the proposed method attained with 12.3%, 13.3%, 12.4% and 19.79% higher F1-score than VGG_Resnet_GRU, VGG16, SVM and CNN. Therefore, the experimental outcomes show the developed ensemble-based lung cancer detection model has high performance with respect to accuracy and precision among various algorithms and classifiers. The scientific explanation for attaining the best performance of the developed model is shown below. The designed IHHO-EDLM model tends to optimize complex parameters in order to improve system performance. Thus, it can to increase the accuracy rate in the lung cancer detection model. Owing to these, the radiologist can provide the appropriate treatment for the patients who are suffering from lung cancer. Moreover, the result shows that the developed model provides the significant performance when compared with the existing approaches. In future work, a novel deep learning technique will be adopted to evaluate lung cancer detection using PET/MRI images.
