Abstract
This study proposes a new predictive segmentation method for liver tumors detection using computed tomography (CT) liver images. In the medical imaging field, the exact localization of metastasis lesions after acquisition faces persistent problems both for diagnostic aid and treatment effectiveness. Therefore, the improvement in the diagnostic process is substantially crucial in order to increase the success chance of the management and the therapeutic follow-up. The proposed procedure highlights a computerized approach based on an encoder–decoder structure in order to provide volumetric analysis of pathologic tumors. Specifically, we developed an automatic algorithm for the liver tumors defect segmentation through the Seg-Net and U-Net architectures from metastasis CT images. In this study, we collected a dataset of 200 pathologically confirmed metastasis cancer cases. A total of 8,297 CT image slices of these cases were used developing and optimizing the proposed segmentation architecture. The model was trained and validated using 170 and 30 cases or 85% and 15% of the CT image data, respectively. Study results demonstrate the strength of the proposed approach that reveals the superlative segmentation performance as evaluated using following indices including F1-score = 0.9573, Recall = 0.9520, IOU = 0.9654, Binary cross entropy = 0.0032 and p-value <0.05, respectively. In comparison to state-of-the-art techniques, the proposed method yields a higher precision rate by specifying metastasis tumor position.
Introduction
In all over the world, liver cancer presents one of the foremost causes of death. The manual cancer tissue detection is a problematic aspect and time consuming. For accurate recognition and suitable therapy, a computer-aided diagnosis [1] is appreciated in decision making procedure. Accurate liver cancer detecting is consequently the chief aim using automated system.
Numerous medical imaging techniques such as ultrasonography (US), computed tomography (CT) and magnetic resonance imaging (MRI) are used for liver lesions examination. CT stills the vigorous imaging method to identify the cancerous tissue. Although, the qualified radiologists need to invasive methods, in some complicated cases, for confirming their diagnosis. In general, principal tumors for instance colon, pancreas and breast cancer frequently spread metastases to the liver in the anomaly development. Premature discovery of new-fangled liver metastases is still key because it can enhance the patient consequence. To explore the potential of computer-aided analysis, useful image processing techniques can be employed for liver tissue classification in order to assist the expert in diagnosis and decision-making method [2].
Artificial intelligence methods [3] have significance in several research applications for optimal characterization of the liver tumor. Regarding liver cancer detection, numerous systems are realized containing machine learning methods, watershed transform and region-based methods. A computer aided diagnosis (CAD) method proposed by Huang et al. [4] for segmenting and identifying the liver tumors using CT images. 81.7% of tumor classification precision is achieved in the extension work based on the auto-covariance texture features [5]. In clinical diagnosis of hepatocellular carcinoma, Zhou et al. [6] projected an important computational method focused on a particle swarm optimization technique. Newly, deep learning; in which neural networks and images are respectively processed of numerous layers and multiple types of filters; has been drawing attention as regards effective image pattern recognition [7–12]. Deep convolutional neural network (CNN) method permits all of the information limited in the input image to be employed depending on the feature set picked out in convolutional machine learning. These methods [13–15] achieve a pertinent performance in the pattern recognition of CT images which can successfully distinguish liver masses without radiologist intervention. In different computer vision domains such as region detection [16, 17], semantic segmentation [18] and image classification [19], deep learning schemes have reached massive achievement. Using CNN approach, alternating convolutional and pooling layers are provided to automatically extract multiple-level visual characteristics. This allows to obtain important progress in automated medical image diagnosis [20]. Melendez et al. [21] carried out many- instance learning focused on chest X-rays in order to detect the tuberculosis. It attained an AUC of 0.86. In the work of Hu et al. [22], a liver segmentation is appreciated based on CNN and globally optimized surface evolution, having 97% of mean Dice similarity coefficient (DSC). To classify skin cancer, Esteva et al. [23] trained a CNN and completed advanced accuracy compared to dermatologists. Besides, CNN has been applied in the segmentation process of different substructures for instance nuclei [24], neuronal structures [25], brain [26], cells [27], ventricles [28], and liver [29]. Still CNN presents the finest of used deep learning methods. In this context, Elshaer et al. [30] decreased the computation time of a great slices number based on two trained deep CNN models: The first one the liver region obtaining and the second for averting mistiness from image re-sampling and missed small lesions. In the work of Ma et al. [31], the CNN system is utilized in image patches; considering an image patch for each pixel (pixel of interest is in the patch. center). The patches are separated into two categories: normal or tumor liver. Least 50 percent or more of tumor tissue confirms that the patch is labeled as a positive subject. The authors report that the resulting accuracy reached 80.6%. ResNet, Alex Net, VGG-Net, etc. present different architectures of CNN model [32]. Additional researches [33, 34] have employed two-dimensional (2D) U-Net, intended mostly for medical image segmentation task.
In this work, two deep neural networks focused on U-Net and SegNet constructions [13, 14] were proposed to effectively resolve the image segmentation problems. In Table 1, an overview of previous approaches using CNN for different medical image segmentation applications. Using for increasing in attractiveness as relevant implementation of predicted segmentation learning systems, the U-net and Seg-net models present two important deep learning methods that have simpler models recognized as convolutional networks. This unsupervised learning offers a multilevel structure layer by-layer, automatically selecting increasingly more representations from the layers. The CNN Encoder-Decoder architecture mostly averts the gradient problem that can occur when training a standard neural network (without pre-initialisation). U-net pre-training increases the performance of the model by optimizing the CNN architecture and avoiding overfitting compared to others learning methods [14].
Previous approaches using CNN in medical image segmentation problems
Previous approaches using CNN in medical image segmentation problems
Due to the deep learning methods need an important data base, these two architectures can supply accurate models even with moderately small datasets. An enhanced fully convolutional neural network (CNN) concept is obtained by the U-Net method [41] via CNN layers followed by up-sampling ones. In this paper, an automated segmentation is established using CT liver cancer images. The included lesions are in low-contrast heterogeneous medical volumes. The proposed contribution is divided into three folds. A fully convolutional CNN is firstly trained in order to prove the adaptability to tricky metastasis liver lesions segmentation. To automatically identify benign and malignant tumors of the liver images, this paper reports these outcomes and elaborates a CAD scheme. The proposed process is applied in order to compare the extracted data by means of their ability for differentiating between malignant and benign tissue. Performance analysis process is completed through means of accuracy and other measures using several experimentations.
This paper is organized as follows: Section 2 describes the CAD system, Section 3 deals with results and discussion of the algorithm and Section 4 concludes the work.
CAD model called watershed Gaussian based deep learning (WGDL) is proposed. This technique consists of intensity-based-segmentation to capably describe the tumor lesion in CT liver images. The proposed segmentation process is revealed in Fig. 1. In this paper, the proposed work is split into two important stages: the first step is introduced by testing some segmentation methods such as U-Net and SegNet. After that, the next step is affected to demonstrate the effective obtained results regarding to qualitative and quantitative results by computing the performance model.

The proposed segmentation process.
In this work, an overall of 400 images collected from diverse CT scan machine including 200 healthy and additional 200 malignant topics. The CT images considered were collected from the Rabta Hospital of Tunis. We have used the DICOM images from 2D axial slices. The exported data are in JPG format and 512×512 size. Total 200 patients with pathologically confirmed metastasis cancer were used to realize this current study. In this dataset, each patient has 1 tumor measured 2 cm or less; considered in stage 0 (very early). The tumor has not invaded the large blood vessels in the liver. About 500 slices for each patient are extracted. A total of 8,297 slices were used to train and validate the proposed model. In effect, we were selected the pertinent slices by removing the overlapping slices. The datasets affected by liver lesions are divided into two groups: 170 training and 30 testing datasets. In this study, the examined malignant tumor is hepatic metastases. An entire of liver metastases images were also collected from the CT imaging center of Charles Nicole Hospital. The patients’ median age was 54 years (between 30 and 79). An influential patient number were employed to directly train the proposed Encoder–Decoder CNN model.
Data augmentation
Data augmentation is carried out to the training dataset focused on random acts in terms of rotations, translations, scale and flips. Novel training examples were produced for each training epoch. Diverse examples were generated for 500 epochs in order to effectively train the deep neural network system. In fact, only linear transformations were employed for averting non-real cancers in the validation.
Hepatic metastases tumor segmentation phase
The cancer segmentation phase supplies two phases procedure as defined in [8]. It includes firstly an automated liver localization and secondly segmentation of metastasis tumor region. Due to CT tumor images are tissues external to the liver with an analogue intensity to the cancers, its segmentation is so difficult. The original input is simplified based on the prior location information and the likelihood intensity variety in liver region via histogram equalization. Similar region intensity to the liver refers the appearance of other organs in the simplified image can disturb the process of tumor segmentation. Actually, the deep learning techniques can resolve the difficulty of the medical image treatment. It leads to effectively ovoid the classical segmentation approaches such as mathematical morphologies, actives contours. The centroid of the largest connected region in the treated CT image is defined as the starting point for region development.
Encoder–Decoder Architecture
The Encoder–Decoder method remains a deep completely convolutional neural network structure [44]. The proposed architecture contains firstly an encoder and secondly a corresponding decoder networks attended by a pixel-wise classification process. Figure 2 shows the structure of the mentioned approach. In effect, the decoder layers are employed to delineate the low resolution from the encoder layers for a whole input resolution. This leads to improve a 2D mask integrating the tumor segmentation. The input image and the used mask are in the same size. In this paper, we propose an advanced method based on two dissimilar encoder–decoder structures. These architectures are Seg-Net [45] and U-Net [46].

Encoder-Decoder structure.
Seg-Net technique was considered to well-organized CNN structure with regard to pixel-wise semantic segmentation. The used method is well known in different road scene understanding applications such as building, cars, and etc. Investigated by the CNN layers in VGG16 [47] structure, the encoder layers in Seg-Net architecture is employed. Hence, the decoder provides max-pooling indices stored and forwarded on behalf of the corresponding encoder’s layers. This technique achieves non-linear image upsampling of their input characteristic schemes. In the decoding procedure, the benefit of reemploying max-pooling indices is to enhance the process of boundary delineation. This method can be combined into any encoder–decoder structure having negligeable changesets. The sigmoid activation function is carried out in the last layer which can efficiency classify each pixel as a cancerous or background area. The applied Seg-Net architecture is exemplified in Fig. 3.

Seg-Net architecture.
Suppling a rapid and exact 2D-3D image segmentation [48], U-Net [49] is an entirely CNN architecture. The main advantage of this theory raised in its forcefulness and efficiency results even when used small training data set. U-Net architecture is separated into two matches: primary down-sampling (SAMPLING) and additional up-sampling (UP-SAMPLING). As shown in Fig. 4, the upper part is a standard completely CNN. It includes a blocks sequence of 3×3 convolutions layers presented as follow: A rectified linear unit (ReLU) activation function. A max pooling with a 2×2 filter and two strides for down-sampling.

The proposed U-Net architecture (AF: Activation Function, ReLU: rectified linear unit).
The feature number is doubled after each block. Regarding to the down part, we employed a 2×2 up-sampling. The feature number of maps is split into two after each block and enchained with the corresponding feature map of the left side which presented by: A 3×3 convolution A ReLU activation function.
In the last layer, a one-by-one convolution is appreciated using a sigmoid activation function. This later is applied for connecting each 32-component feature vector to the desired class. In our work, 26 convolutional layers along an entire number of 9, 239, 681 training parameters were achieved. The used U-Net method is revealed in Fig. 4.
Different statistical measures can be used to control and evaluate the correctness and the effectiveness of the analysis results. In this study, the CNN architecture is appreciated to get a hopeful and truthful detection ratio. The segmentations accuracy is computed focused on the resemblance degree between the ground truth and the resulting outputs projected by the two applied architectures (U-Net and Seg-Net). Hence, the included evaluation measure shown by the mean IoU is exposed by the following equation:
In our experiments, the evaluation of the proposed method is carried out using a statistical analysis in terms of Recall ‘Rec’, precision ‘PC’ and F1_score ‘F1’, of the hepatic metastasis’s tumor images. These criteria are frequently used for the evaluation of analytical tests:
TP and TN: the true positive and true negative numbers, FP and FN: the false positive and false negative numbers, respectively.
Comparing actual performance in controlling tumor process, the one-sided Mann–Whitney U test is applied. This is a non-parametric test by verifying the null hypothesis. If the resulting segmentations and the ground truth derived from the same group, an important performance is achieved. In effect, a significant threshold is set to 0.05 which correspond to 95% of confidence for accepting or not the null hypothesis.
In this study, we displayed an advanced experimentation to model hyper-parameter tuning based on two dissimilar structures of Encoder–Decoder architecture. Table 2 demonstrates the greatest significant parameters in terms of the obtained optimal values focused on the proposed Seg-Net and U-Net. A CNN (Seg-Net, U-Net) can vigorously segment hepatic metastases tumor lesion even if data augmentation placed in dissimilar orientations in order to have the property (invariance). Different experiments are realized to choose the optimum selected hyperparameters. In order to minimize the cost function, the stochastic gradient descent training method is applied to reduce the error rate between the actual and the desired output. In Table 3, all tested parameters are proved for final choice rationale.
Optimum results according the used hyper-parameters via Seg-Net and U-Net models
Optimum results according the used hyper-parameters via Seg-Net and U-Net models
Tested hyper-parameters
*Linear “L”, No-linear “nL”, Normal “N”, uniform “U”, Glorot uniform “GU”, RMSprop “RMS”, Adam “A”, Adagrad “A”, Stochastic gradient descent “SGD”.
Training process was proved the highest data (85% of the database), the proposed techniques were validated on the remaining CT data (15% of the remaining database) by computing the precision score (mean IoU). The mean IoU (accuracy) attained 96.12% and 93.70% using U-Net and Seg-Net, respectively. Besides, F1-score, Recall and precision are respectively 95.73%, 95.20% and 96.52% for U-Net method compared to Seg-Net architecture 92.18%, 93.26% and 92.45%. The resulting loss error focused on binary cross entropy are 0.0032 and 0.048 for U-Net and Seg-Net, respectively. The segmentation results in term of qualitative evaluation of Seg-Net and U-Net methods are exhibited in Fig. 4. In this context, we report that U-Net segmentation is closer to the ground truth compared to the obtained results of Seg-Net. 200 CT images including cancer lesions are tested in the proposed liver tumor detection approach. The projected scheme splits into two phases: liver separation and tumor lesion detection. We applied the statistical analysis to effectively demonstrate the relevant difference between the predicted segmented region and the ground truth labels supplied by the two models Seg-Net and U-Net. Simulation results illustrated in Fig. 5 highlight the robustness of the proposed method using large CT images. The projected approach was able to estimate metastasis tumor segmentation with an interesting accuracy result in the total database. Using U-Net model, experimental results are analyzed by computing the different statistical measures: the mean IoU, Recall ‘Rec’, precision ‘PC’ and F1_score ‘F1’. From the obtained results in Fig. 5, the performance of a segmentation algorithm usually depends on sensitivity and robustness; corresponding to the highest value of these statistical measure.

The obtained results in view of U-Net and Seg-Net architectures.
The p-values are computed between the ground truth and the two predicted segmentations. After several experiments, we can notice that is no important difference between the U-Net segmentation and the ground truth one which reaches a p-value of 0.035. Compared to Seg-Net architecture, a failed result is attained (p-value = 0.057) in order to determine the difference between the ground truth labels and the Seg-Net’s segmentation. The two target segmentations furnished by U-Net and Seg-Net methods are different depending on U-test. For the validation results, we applied a statistical analysis using Student’s t–test in order to evaluate the difference between the predicted segmentation provided by the two architectures and the ground truth. In fact, if, p-value is inferior to 0.05 (p-value <0.05) then, it was considered significant. Besides, for each experiment, the pertinent binary cross entropy is depended on the lower concerned value. Figure 5 shows the obtained results using the U-Net and Seg-Net methods.
In this paper, we introduced an advanced approach based on two different fully CNN encoder–decoder architectures. These U-Net and Seg-Net structures were effectively realized to accomplish hepatic metastases tumor segmentation. As shown in Fig. 6, the U-Net segmentation results displayed better accuracy compared to the manual segmentation of the experts. It is clear that Seg-Net segmentation results are not highly near to the ground truth. In fact, the foremost variation between the two applied models is presented in their connection type in view the convolution and deconvolution. Seg-Net employed merely the kept pooling indices through convolution process compared to U-Net is focused on features maps linking in terms of convolution and deconvolution. In this work, the attendance of tumor lesion can be exposed in rapports of analysis reliability of the hepatic metastasis’s cancer localization by different evaluations. Three radiologists, with varied expertise degrees, contributed in specifying the liver tumor ground truth, via the attentive lesion detection on each patient.

Quantitative segmentation results using U-Net and Seg-Net architectures.
The majority of current approaches used segmentation improvement techniques based on statistical form and emergence models. In the work of Lu et al. [50], authors proposed an advanced method focused on multiple features for training the kernel SVM via RBF kernel. This method can vigorously distinguish the hepatic lesion and the surrounding tissues through their corresponding statistical features. Still, this study requires an enhancement characterization stage by reducing the feature number. Besides, an important approach for metastases liver detection is applied in the work of Albishri et al. [51] based on multimodal U-net model. The authors notice that the segmentation results are efficiency improved than those obtained by atlas-based methods. Still, deep learning methods confirm the high potential in the medical segmentation applications. But the learned features require to be trained on a greater data base. For an improved qualitative analysis, a statistical measurement could be used to compute the quality of image segmentation. Table 4 reveals that the segmentation accuracy of the proposed method demonstrates operative segmentation precision using our dataset with a dice average of 93.54%, compared to other tested state-of-the-art techniques [50] and [51]. Finally, the proposed CT image segmentation approach can faithfully be exploited for the analysis of metastases tumor in precocious stage and used consequently as a further tool to assist radiologist in the diagnosis process.
Summary of some studies reporting some liver lesion segmentation approaches.
Summary of some studies reporting some liver lesion segmentation approaches.
In this study, we proposed and tested a new automated tumor segmentation approach based on two different encoder–decoder architectures: SegNet and U-Net. In each architecture, all layers and parameters are rectified in order to repeat the deep neural network training based on computed tomography data. The U-Net model highlights the finest segmentation results. To enhance the forcefulness of the proposed model, a novel data can be added that is still following annotation. We plan to use the segmented cancer approach for advanced tumor stages to efficiency improve prediction system and provide a prospective validation. Besides, the proposed segmented hepatic metastases method can be deployed to other lesion organs such as breast cancer response.
