Abstract
Medical image segmentation is a key step to assist diagnosis of several diseases, and accuracy of a segmentation method is important for further treatments of different diseases. Different medical imaging modalities have different challenges such as intensity inhomogeneity, noise, low contrast, and ill-defined boundaries, which make automated segmentation a difficult task. To handle these issues, we propose a new fully automated method for medical image segmentation, which utilizes the advantages of thresholding and an active contour model. In this study, a Harris Hawks optimizer is applied to determine the optimal thresholding value, which is used to obtain the initial contour for segmentation. The obtained contour is further refined by using a spatially varying Gaussian kernel in the active contour model. The proposed method is then validated using a standard skin dataset (ISBI 2016), which consists of variable-sized lesions and different challenging artifacts, and a standard cardiac magnetic resonance dataset (ACDC, MICCAI 2017) with a wide spectrum of normal hearts, congenital heart diseases, and cardiac dysfunction. Experimental results show that the proposed method can effectively segment the region of interest and produce superior segmentation results for skin (overall Dice Score 0.90) and cardiac dataset (overall Dice Score 0.93), as compared to other state-of-the-art algorithms.
Keywords
Introduction
Medical image segmentation delineates the lesions or organs from the background images like in Magnetic Resonance Imaging (MRIs), Computed Tomography (CT) scans, Ultrasounds. In different medical imaging modalities, the segmentation results are highly affected due to the presence of different artifacts such as intensity inhomogeneity, noise, low contrast, ill-defined boundaries, and many more. These challenging artifacts make the image segmentation task very difficult. Segmentation accuracy highly affects the subsequent medical treatment and diagnosis [1]. State-of-the-art techniques for medical image segmentation involve manual segmentation, which is performed by relevant experts and it is a cumbersome and extremely time taking process [2]. The manual segmentation of medical images also suffers from inter-observer or intra-observer variability [3].
Different medical imaging modalities (such as Cardiac Magnetic Resonance Imaging (CMR), Ultrasound, X-rays, CT scans, Positron Emission Tomography (PET), Skin lesion images) are suffered from different challenges. Some problems are common in all medical imaging; however, few are specific for some imaging types, for example in skin lesion segmentation, the main challenges are irregular and fuzzy lesion borders, intrinsic skin-related features such as blood vessels, skin lines, hairs, and distortion [4]. Similarly, for Left Ventricle (LV) segmentation in CMR, low contrast and inhomogeneity are present but some specific challenges of LV segmentation are variability in types of slices, different phases of cardiac cycles [5].
In recent years, several full/semi-automated segmentation methods for medical images have been published. Though, there is no single algorithm that is suitable for different types of medical images. This is due to the variations in anatomic structures, and each structure has its own specific challenges/artifacts. Some automated segmentation methods are proposed for a specific imaging modality and may be it would be more accurate for one type and less accurate for the other imaging type [1, 6].
Over the past decades, many supervised and unsupervised methods have been proposed for medical image segmentation. Supervised segmentation methods such as the multi-atlas method [7], convolutional neural network [8, 9], random forest [10], and many more are used for the segmentation, however, they generally require large amounts of annotated datasets, which is a very time-consuming procedure [8, 12]. These methods require several stages of intensity standardization and normalization. In addition, these models are very sensitive to the variations in the scanner, acquisition protocols, and optimization of a large number of hyper-parameters. Recently, deep learning networks have achieved great attention of researchers due to significantly improved segmentation results, but they usually require huge datasets to be annotated, which is a tedious and expensive procedure in medical images. However, data augmentation and transfer learning have been used to overcome the requirement of large annotated datasets but they have their own issues and problems and they are also suffered from the overfitting problem [12]. On the other hand, unsupervised learning algorithms do not require labeled datasets for medical segmentation; these methods use image-guided features to delineate Region of Interest (ROI). These characteristics make unsupervised algorithms more robust and applicable to the medical image segmentation [13]. Some common models of unsupervised learning include K-means clustering, self-organizing maps and multi-thresholding [14, 15] to name a few.
In last few years, Thresholding for image segmentation is very common and has attracted the attention of many researchers in the last decade [6, 14]. Thresholding is good for segmentation when image has distinct homogeneous regions. To date, many researchers have proposed different methods for optimal gray-level threshold such as using Kapur and Otsu but they only work well in the case of the bi-level thresholding method. Bi-level thresholding is useful if ROI is clearly distinguished from the remaining image using a single threshold value [17], which is generally not true in the case of medical imaging modalities due to the presence of noise, low contrast, and other artifacts problems. Multilevel thresholding techniques are more accurate than the traditional bi-level thresholding for the medical image segmentation because a diverse number of intensities are used to represent distinct regions in the image. For medical image segmentation, thresholding is mostly used as the first stage, which is followed by a series of image processing steps that make it computationally expensive [14]. Many algorithms have been proposed to get the optimal thresholding value for image segmentation [14–18]. Although an exhaustive search is required for finding the optimal thresholds, which results in the application of swarm intelligence and evolutionary algorithms [14, 16]. Many researchers have used different entropies based multilevel thresholding (like Shannon [16], fuzzy [14], Tsallis [19]) using evolutionary algorithms and nature-inspired algorithms for the image segmentation [18].
The key limitation of the thresholding method is that it does not consider the spatial characteristics of the image, which makes the segmentation results very sensitive to intensity inhomogeneity and noise. ACMs are widely used methods for medical image segmentation being robust to intensity inhomogeneity and low contrast artifacts [5, 21]. ACMs can be categorized into edge-based [22], and region-based methods [23, 24]. Edge-based models utilize the gradient to stop the evolving segmenting curve near the ROI’s boundary. Though these techniques produce poor segmentation results for regions with weak boundaries and they are sensitive to noise in the gradient estimates [22]. Region-based models separate the image into the foreground and background, based on local/global intensity-guided features [24, 25]. These models [23–26] are more robust to noise as compared to edge-based methods. Local intensity information is very useful in handling the intensity inhomogeneity and noise problems but they are very sensitive to the contour initialization and poor initialization can easily trap into a local optimum [23, 26].
For medical image segmentation, many algorithms based on ACMs have been proposed which are robust, but they are semi-automatic methods and require human intervention to provide the proper initialization for active contour models [21]. Similarly, thresholding methods are also considered as a good choice for segmentation, but they failed to produce if the ROI contains inhomogeneous regions or it has ill-defined boundaries. To combine the features of the afore-mentioned segmentation methods, we propose a method for medical image segmentation; which overcomes the intrinsic problem of thresholding (inability to handle inhomogeneous regions) and active contour models (initialization problem). The proposed method uses Harris Hawk optimization, which is used to produce an initial contour using an optimal threshold value. This contour is further refined using an active contour model. Harris hawk algorithm [18] considers the Minimum Cross Entropy Thresholding (MCET) as a fitness function to calculate multilevel thresholding values. To get one preeminent threshold value out of multiple values, we define an objective function, which is used to get the initial segmenting curve. The initial contour is further refined using an ACM, which is based on a spatially varying Gaussian kernel [26] to get the final segmentation for medical imaging.
The main contributions of this paper can be summarized as follows: A new initialization scheme using Harris hawk optimizer, is proposed, which is used to obtain the initial contour for the active contour model. An automated framework is developed, which is able to handle intensity inhomogeneity, low contrast images, noise, weak edges, and other artifacts that are usually present in medical images. To the best of our knowledge, this is the first method which incorporates a metaheuristic algorithm with an active contour model to solve segmentation problems effectively.
The proposed framework is compared with different state-of-the-art thresholding algorithms and segmentation methods for skin and LV (deep-learning and non-deep learning methods).
The remaining paper is presented as follows: Section II describes the background and related work of medical image segmentation; Section III presents the proposed model for medical image segmentation. In Section IV, a set of experiments are performed, which empirically evaluate standard thresholding algorithms, evolutionary method, deep learning models, and different active contour models. Section V and VI present discussions and conclusions, respectively.
Background
Notations
In this paper, we use following notations as described below: pixel coordinates of the input Image I are denoted by x, y ∈ Ω (imagedomain ⊂ R2). φ (x, t) is a zero-level-set function at a certain time t and it denotes a closed curve C, which can be defined as: {x∈ Ω|φ (x, t) = 0 }. ACMs define curves that move within input image to find object boundaries. ACMs are applied for separating the image into distinct regions; foreground and background. Foreground. Image foreground and background regions are described as Ω1 (C) ={ x ∈ Ω : Φ (x) < 0 }, and Ω2 (C) ={ x ∈ Ω : Φ (x) > 0 } respectively. α, γ1 and γ2 are penalized parameters which are used in active contour model (ACM), they influence the contribution of each energy term in the minimization process of ACM.
Related work
Image segmentation is required as a preliminary phase for examining the medical images. Because of the complex nature of medical images, segmentation itself is a very challenging and difficult task [27]. Different segmentation methods have been proposed for different types of medical images [20, 28]. We can categorize them as supervised and unsupervised learning methods. Thresholding is one of the most popular method for image segmentation [16]. However, bi-level thresholding produces the desired segmentation results if the object has distinct homogeneous regions, but medical images have main challenge of intensity inhomogeneity and low contrast. Many multilevel thresholding methods have been proposed in the last few years for the medical image segmentation [29, 30] however they are still failed to produce good segmentation results due to severe inhomogeneity and ill-defined boundaries. Recent algorithms are based on the use of entropies as objective functions in evolutionary methods to select the optimal threshold values for segmentation [16, 19].
The main problem with thresholding method is that it does not consider the spatial characteristics of the input image, due to which it is not a good choice to use them solely for the segmentation of medical images because the results are very sensitive to intensity inhomogeneity and noise. For medical images, ACMs are considered as a good choice for medical image segmentation being robust to intensity inhomogeneity and low contrast artifacts [20, 21]. ACMs can be categorized into edge-based [22], and region-based methods [23, 26]. The main problem with ACM is dependency of each model on the initial placement of the evolving contour [24, 26]. In last decade, different hybrid and two stage ACMs have been proposed to overcome the initialization problem [25], but they are computationally very expensive. In different ACMs which are based on the spatial information of the input image, local intensity information is very useful in handling the intensity inhomogeneity and noise problems but they are very sensitive to the contour initialization and poor initialization can easily trap into a local optimum [26, 27].
Different supervised and un-supervised algorithms have been proposed for different types of medical images. An unsupervised technique was proposed for skin lesion segmentation which was based on the sparse coding method [31], the method outperformed state-of-the-art methods but still couldn’t attain very promising results (DSC = 0.80) on ISIC 2016 dataset due to presence of ill-defined boundaries of skin lesions. In 2019, a hybrid approach was proposed using classical gradient-based histogram and clustering-based methods, it outperformed the previously defined methods but still, it was able to achieve DSC < 0.80 for the Dermofit skin dataset [32]. In another paper, two adaptive approaches were used to identify the global threshold for skin lesion segmentation, they produced good segmentation results, but they tested the proposed approach only on 25 skin images [33].
In different medical imaging modalities, CMR plays a vital role in the diagnosis of several congenital heart diseases [34]. Segmentation of the Left Ventricle (LV) is very important in the assessment of ejection fraction, regional function parameters like end-systolic and end-diastolic volumes, these parameters are very significant to detect the abnormalities of the heart [7]. Many supervised and unsupervised algorithms have been proposed for endocardial segmentation. In the last decade, different active contour models (edge-based, region-based, or hybrid) have been proposed for LV segmentation. Hybrid of edge-based and region-based models are good for segmenting the LV in the presence of intensity inhomogeneity and low contrast. A two-stage active contour model was proposed by Soomro et al. [25], which was computationally expensive as the contour evolution was performed in each stage of the model. Active contour models are very sensitive to the initialization, to have good initialization, many semi-automated techniques and few fully automated algorithms have been published [20]; though these models are based on some computationally expensive pattern matching [35] or some complex preprocessing pipelining [5]. Many deep learning networks have been proposed in the last few years for LV segmentation but the main problem is that they require either a very large annotated dataset or need to combine deep learning with some deformable model for LV segmentation as proposed by Avendi et al. [36]. Several evolutionary algorithms have been proposed for image segmentation [19, 29], but no nature-inspired algorithm produces better results than ACM for LV segmentation. Many recent LV segmentation methods are not fully automated and these methods require human intervention or ground truth to initialize the contour [21].
As mentioned above, different methods have been proposed for different types of medical images, but they are specific for one type of imaging modality and fail to produce on some other medical imaging type. Some segmentation methods are good for LV segmentation [35, 36], some methods are good for Brain MRIs [28], some are good for segmenting skin lesions [32, 37], some produce good segmentation for liver on CT scans [38], however, There is no method that can be adapted for different types of medical images.
Methodology
Data
This paper focuses on two datasets from two different medical imaging types; namely the ISIC 2016 skin database and the ACDC database. The skin data consists of 900 skin lesion images with different artifacts. In 2016, the skin dataset was released by the International Skin Imaging Collaboration (ISIC) at the International Symposium on Biomedical Imaging (ISBI) challenges [39]. A manually drawn ground truth is provided with each skin image.
The second image modality which we use in this study is CMR and the dataset is taken from Automated Cardiac Diagnosis Challenge (ACDC) MICCAI 2017 [40]. CMR is acquired in breath-hold with a retrospective or prospective gating in short-axis orientation (SAX), The data covers images from different phases (systolic and diastolic) and types of slices (Apical, Left Ventricular Outflow Tract (LVOT), Basal). The pixel spacing varies from 1.37 to 1.68 mm2 per pixel. All images are acquired over a 6-year time period using MRI scanners of different magnetic strengths (3.0 T and 1.5 T).
Proposed method
We can precisely define our segmentation method in six steps as follows: Automatically determine multilevel threshold values for the input image using the Harris hawk algorithm. Using an objective function, automatically calculate one optimal threshold from multiple threshold values. Convert input image to binary image using optimal threshold value. Remove objects that are touching the image boundary using imclearborder (MATLAB function) as shown in Fig. 1. Determine ROI among different connected components, using prior information of the input image. Calculate the final segmenting contour using an active contour model.

Overall diagram of proposed segmentation methodology.
The overall process of proposed method is shown in Fig. 1. Further details of thresholding and active contour model is given below:
For the input image, we search for the optimal threshold values for multilevel thresholding. We use Harris hawk optimization algorithm [18], which uses cross entropy as the fitness function. This function is quantitatively (as fitness score) used to evaluate different threshold values for the given input image. Cross
Entropy
is the entropy between the input image and its segmented image, it calculates the information theoretic distance between the probability distributions of input image I ={ I1, I2, … I
n
} and segmented image S ={ S1, S2, … S
n
}, which can be defined as:
Smaller value of the cross-entropy depicts greater homogeneity and less uncertainty.
First, the image is converted into gray scale and then the histogram of the converted image is acquired. We used Harris hawk method [18] to obtain the vector of optimal threshold values, for clarity, all the steps of Harris hawk algorithms are described in Algorithm 1.
The inputs to the proposed method are the number of the Harris hawks in the population, the number of thresholds and the maximum number of iterations to execute the algorithm.
We use another objective function to get one final threshold value from the vector Pbest which consists of multiple thresholding values. This final threshold value is used to get the initial segmenting curve for the final segmentation. For this, we calculate the minimum score of the objective function using Eq. (2). The generalized objective function that can be used for different types of medical imaging modalities is defined below:
Cross Entropy calculates the entropy between the input image and its segmented image, it is defined in Algorithm 1 (Equation 3).
We use two penalty terms penalty1 and penalty2 which are calculated using Eq. (4) and (5) respectively. They are same for any medical imaging modality. B(x,y) denotes value of the pixel located at row x, column y of the thresholded binary image array. An M × N has total M rows and N columns in the image. penalty1 is used to determine if the final output of a thresholded image is all white region, using the following condition:
Similarly, penalty2 is used to determine if the final output of a thresholded image is all black region, using the following condition:
Though the first three terms in Eq. (3) are the same for any medical imaging modality, however, the last term/s prior
modalitydependent
is solely dependent on the type of the medical image. In Eq. (2), prior
modalitydependent
is one or more priors that can be used to segment the ROI, for instance in LV Segmentation, we have prior knowledge that LV is usually most circular, next to Right Ventricle (RV) and located close to the center of the CMR as shown in the Fig. 3. We incorporate the priors using this term as a prior
modalitydependent
, for instance, we add normalized Euclidean distance of each object from the center of the image and circularity of each object (we prefer circular objects over non circular), the prior
LV
comprises of the following terms:

Binary images produced using different threshold values. Fitness scores calculated using objective functions is shown for skin dataset.

Binary images produced using different threshold values. Fitness scores calculated using objective functions is shown for CMR dataset.
Similarly, for skin lesion segmentation, we use prior knowledge such as skin lesions are usually present close to the center of the image and some properties of the markers and hair artifacts removal are also used. Marker or borders are present in some skin images, so we create masks to identify them and hair artifacts are identified using the circularity of the objects. The prior
skin
comprises of the following term:
This objective function is calculated for each value in the vector Pbest which was obtained from Algorithm 1. This step is explained in Algorithm 2.
We use region-based ACM to refine the segmentation result, which we have obtained in the previous step using the optimization algorithm. Region-based models [23, 26] are considered to be better as compared to the edge-based models [22] due to the global nature of these methods. We use an ACM, which utilizes a region-based energy term EReg augmented with two regularization energy functional RL and RD i.e., a length penalty term and a distance regularization term, respectively. The total energy functional is described as follow:
We use a Local Gaussian Distribution Fitting Energy (E
Reg
), which utilizes a spatially-varying Gaussian kernel to weight the pixel intensities in the ROI. Using local statistics of the image, this region term (E
Reg
) is capable to handle the intensity inhomogeneity artifacts and variable contrast problems. The model partitions image into two distinct regions; foreground and background [26] which are denoted by Ω1 and Ω2 respectively. E
Reg
minimizes the following energy functional:
where K is a truncated Gaussian kernel that assigns larger weights to nearby pixels i.e.
ui(x) and
where H1 (φ) represents regularized Heaviside function for background and H2 (φ) represents Heaviside function for foreground using (1 - H1 (φ)).
In addition to the region term (E
Reg
), we used two regularization terms to have smooth and better contour evolution. The first regularization term RL is used, which penalizes the length of the contour for smooth contour evolution [23, 27] and can be described as follows:
The second regularization term RDev (φ) is used to obtain an accurate and stable level set function at each iteration to penalize the deviation of the contour from a Sign Distance Function (SDF) [41]:
Iteratively, the curve evolution is performed by minimizing the following functional:
Using the gradient descent algorithm, we obtain the following gradient flow equation minimizing
All hyper parameters (α, γ1 and γ2) are set using grid search method, however we used same values for both datasets and they give good segmentation results.
All the experiments were performed on a processor with a 2.5 GHz Intel Core i5-7200 U processor and 8 GB of RAM. We performed all the experiments in MATLAB R2017a. We used two different types of medical images, both are publicly available; one dataset is from ISIC2016 for segmentation and the other dataset is obtained from an automated cardiac diagnosis challenge (ACDC). All the results are evaluated and compared with other state-of-the-art methods.
Performance measures
The experimental results are compared with the segmented ground truths, which are determined manually by experienced cardiologists and dermatologists for ACDC and ISIC, respectively. We quantitatively analyzed the segmentation obtained from different algorithms and our proposed method. We used Dice Score (DSC), Jaccard Index (JI), Sensitivity (SE), and Specificity (SP) to compare our method with other algorithms. The formulas for these evaluating metrics are given below:
where A and B are manual and automated segmentations, respectively
where Good Contours (GC) are calculated with Average perpendicular distance (APD)≤5 mm and APD is the distance from the ground-truth (manual) segmentation to the automated segmented contour averaged over all contour points [42].
In recent years, many deep learning [43–45] and non-deep learning algorithms [33, 46] have been proposed for skin lesion segmentation. We compared skin lesion segmentation with standard thresholding (Otsu), evolutionary method (Harris Hawk) [18], ACM [26] and some recent deep learning methods [4, 47]. Table 1 shows that our proposed algorithm outperforms non-deep learning methods, and it has produced comparable results to the deep learning methods. This table also shows that ACM, which is used to refine the segmentation results, significantly improves the dice score (from 0.83 to 0.90 with p-value≤0.001) Fig. 4 presents a qualitative analysis of proposed method for skin lesion segmentation.
Comparison of proposed method with state-of-the-art methods for skin lesion segmentation
Comparison of proposed method with state-of-the-art methods for skin lesion segmentation
Comparison of proposed method with state-of-the-art methods for LV segmentation

Different types of skin images, with irregular boundaries, intensity inhomogeneity, variable contrast, presence of markers, hair artifacts. Segmenting curves are shown with each image where green contours are manually determined skin lesions and red contours are generated using the proposed approach.
We used 40 random subjects from the ACDC dataset and found that our proposed algorithm outperformed other state-of-the-art algorithms p≤1 × 10–8 [18, 26]. The proposed method also produced a very high percentage (92%) of good contours (GC) (APD≤5 mm). This method yielded superior segmentation results from non-deep learning methods and comparable results to the deep learning methods. We have analyzed the results using different evaluating measures. Figure 5 presents a qualitative analysis of LV segmentations obtained from proposed method using different types of slices.

Different types of Cardiac MRIs, with severe intensity inhomogeneity, presence of different organs, variation in LV sizes. Segmenting curves are shown with each image where green contours are manually determined skin lesions and red contours are generated using the proposed approach.
Figures 4 and 5 show the robustness of our proposed algorithm for different types of images (skin lesion and LV). It is very challenging to produce good segmentation results in the presence of markers, hair, intensity inhomogeneity, different organs, low contrast, and ill-defined boundaries.
In this study, 500 dermoscopic images were taken from another publicly available ISIC archive to validate the proposed method and priors which we used for ISIC 2016 dataset. Our proposed hybrid segmentation model achieved an average Jaccard index score of 0.81. This results to an improvement of 10.5% over the accuracy of the winner of the ISBI 2017 challenge (Jaccard Index = 0.765).
Time complexity
In medical image segmentation, accuracy and efficiency are both very important evaluating measures. Figure 7 shows our method produces superior segmentation results as compared to other state-of-art algorithms. Figure 8 shows Harris Hawk optimization method is most efficient in segmenting LV boundary and our proposed method is the second most efficient method; however. Figure 7 depicts that the Harris Hawk fails to produce good segmentation results as compared to our proposed hybrid model.

Overestimated, underestimated and estimated segmentation results produced by thresholding, ACM & proposed methods, respectively.

Dice Score of different methods for LV segmentation.

Computational efficiency of different methods for LV segmentation.
Our method takes approximately 1.5 seconds to segment one CMR (2D) image, which is more efficient than other state-of-the-art methods.
Medical image segmentation is the key to the diagnosis of several diseases and accurate segmentation plays an important role for the assessment of several skin, lungs, brain and cardiac diseases [21, 48]. Many methods have been proposed to get ROIs; however, there is no single framework that can be adapted and applied to different modalities of medical imaging. Recently, deep learning networks have outperformed the image segmentation, but they all require a large size of annotated data for training, tuning of a large number of hyper-parameters and requirement of Graphics Processing Units (GPUs) to solve the segmentation task. Data augmentation has resolved the problem of large datasets, however, overfitting can still cause misclassification on the unseen data examples [12]. Thresholding is suitable for those types of images where we can distinctly divide the image into n groups; however, the traditional thresholding algorithm (Otsu method) produces ineffective and undesired segmentation results in many cases, especially for images with low-intensities and ill-defined boundaries. Similarly, the Histogram Thresholding algorithm (HT) is also good to produce accurate delineation if there is a sharp difference between the ROI and the surrounding region [32].
ACMs are very robust to inhomogeneity and low contrast artifacts, which make them a suitable choice for medical image segmentation [20, 35]. However, ACMs are very sensitive to initialization [23, 27], which makes ROI segmentations mainly dependent on the initial placement of the contour and the initial contour can easily be trapped into a local minimum [24, 26]. Several semi-automated methods and fully automated algorithms have been proposed to have an appropriate contour initialization, though they involve computationally expensive pattern matching [35] or complex preprocessing pipeline [5], which may not generalize to multi-center datasets. Moreover, in the past recent years, some two-stage ACMs [25] have been published in which the first stage is used to get the initial curve (or contour), which is utilized as the initial placement of the contour in the second stage for the final segmentation. However, these two-stage methods are computationally very expensive because there is a complete execution of ACM in each stage [25].
In order to overcome the limitations of ACMs and thresholding methods for medical image segmentation where one may cause overestimation or underestimation of the ROI [32] as shown in Fig. 6, we need to design a framework which incorporates the advantages of both techniques to achieve a better estimation. In the proposed method, we have used a metaheuristic algorithm to get the optimal threshold value to get the initial segmentation, which is further refined by a region-based ACM to produce the final segmenting curve.
The experimental results in the last section validate our method using two publicly available medical imaging datasets. Figures 4 and 5 show the detailed qualitative analysis of our approach and proved that this proposed method is able to accommodate the artifacts present in the medical images and it is capable of handling the challenges like variable contrast, indistinct boundaries, shape differences and undesirable residues (markers, borders and many more) to get the ROI [45]. The accurate delineation of any method is a relevant achievement to produce particular features for machine learning procedures. One more advantage of our proposed method is that we have a new framework for medical image segmentation which can be adapted for different types of medical images, we have validated our approach on two entirely different datasets (a skin dataset and a CMR dataset). As given in Eq. 3, we can define prior that is mainly dependent on the type of the input image, this method can be applied to any medical dataset. In the future, we plan to apply this framework to other medical imaging modalities as well.
Conclusion
We propose a new framework for medical image segmentation, consisting of an evolutionary method and an active contour model. The evolutionary algorithm is used to obtain the optimal threshold value to get the initial segmentation, which is further refined by the evolution of an active contour model. The region-based active contour model includes a Gaussian based spatially-varying, local statistical model to handle intensity inhomogeneity and other artifacts. A length penalty term and a distance regularization term are incorporated to ensure the smooth evolution of the segmenting curve. The model was robust to low contrast, ill-defined boundaries, intensity inhomogeneity, and other artifacts as exhibited in challenging medical images. It also showed that the proposed method was able to handle diverse types of images using one generalized algorithm and it can be adapted for other types as well.
