Abstract
Recent reports indicate a rise in retinal issues, and automatic artery vein categorization offers data that is particularly instructive for the medical evaluation of serious retinal disorders including glaucoma and diabetic retinopathy. This work presents a competent and precise deep-learning model designed for vessel segmentation in retinal fundus imaging. This article aims to segment the retinal images using an attention-based dense fully convolutional neural network (A-DFCNN) after removing uncertainty. The artery extraction layers encompass vessel-specific convolutional blocks to focus the tiny blood vessels and dense layers with skip connections for feature propagation. Segmentation is associated with artery extraction layers via individual loss function. Blood vessel maps produced from individual loss functions are authenticated for performance. The proposed technique attains improved outcomes in terms of Accuracy (0.9834), Sensitivity (0.8553), and Specificity (0.9835) from DRIVE, STARE, and CHASE-DB1 datasets. The result demonstrates that the proposed A-DFCNN is capable of segmenting minute vessel bifurcation breakdowns during the training and testing phases.
Keywords
Introduction
Glaucoma and diabetic retinopathy are two eye conditions that seriously affect the optic nerve and eventually result in blindness. Patients with glaucoma experience intra ocular pressure in their eyes, some of whom may feel it and others who may not. Glaucoma’s irreversible harm has a significant impact on the number of blind people worldwide. Through the analysis of fund us images, it is possible to determine how many people are blind worldwide due to these serious eye disorders [1]. Quick recognition and regular treatment will reduce optic diseases and avoids growth in the blindness population. Manual segmentation includes Optic Disc (OD) Segmentation, after separation of the red and blue channel inputs by pre-processing. A dynamic intensity threshold is employed to determine the dissimilar threshold values from areas. Then identification of all the objects is made using eccentricity. Thus, it holds only the value of <0.9 to remove semi-circular reflections at the border of the images.
Blood vessels Removal and In-painting, blood vessels are removed using blue and red channels as input. Equation (1) is used to create a brightness channel to luminance blood vessels.(1). Then the bottom-hat morphological operation is applied with the help of a disk-shaped framing component with 26 as its size.
The binary images of the segmented optic disc and the blood vessel are multiplied pixel-wise by using Equation (2). Finally, blood vessels are removed.
Optic Cup Segmentation, the variance maximization method is used to repaint the eliminated areas in the optic disc. In this method region bounding box is used. The pixels in this region are to be centred with a 7×7 window which is equivalent to the blood vessels present in the optic disc. Then rotate the 7×7 window in 12 different directions to predict the disparity of pixels in this domain. The direction with the highest discrepancy is anticipated, and that greater variance value replaces the central pixel. By following this procedure, a green channel that has been in-painted is obtained. The disc cup is then segmented using an intensity-based threshold.Image created with threshold replaces each pixel if intensity x (i, j) is less than A. It is defined as:
An automatic segmentation method is of three different types: i) optic disc segmentation ii) optic cup segmentation and iii) vessel segmentation [2]. An algorithmic way of segmenting vessels in the retina will be an advantage to a great extent by reducing time and minimizing complexity. The evolution of CNN-based vessel classification is done with connectivity patterns in neurons and with multiple convolutional layers [3, 4]. Encoder-improved Unet enhances network performance without computational burden. In the Encoder additional layer is carried out after each convolutional layer except the first stage. A new layer is introduced between feature maps of the first convolution layer output and feature maps of the max pooling output. In between obtained output and the second convolutional layer, the feature maps second addition is included.
The addition of residual or attention blocks also improves performance [5, 6]. Proposed fully convolutional neural network to segment vascular tree is done on colour fundus images i) Includes attention mechanism into the Unet model architecture ii) Discussed loss function and parameter used during training process iii) Appliance of a trained network [7]. Fully convolutional networks as semantic segmentation with skip connections works in deep shallow architecture. The Unet model [8] was designed with a contraction path and expansion path that will localize the image, applied in major bio-medical segmentation tasks. Proposed dense U-net with the patch learning process, undergoes patch training process of fundus images. In general, all the deep CNN-based structures are moving towards automated applications of medical imaging. Fully automated segmentation of retinal vessels follows the contraction path and expansion path. The model is assisted with skip connections to pass hierarchical representation between different layers [9]. The performance of the article’s suggested SineNet model is assessed using well-known public datasets as DRIVE, STARE, and CHASEDB.The cross-database study is done for vessel segmentation which may be helpful in clinical applications.
The contribution of this article includes, the proposed A-DFCNN model is consequent from existing Unet utilizing subsequent changes: Downsampling and upsampling stages remain with a similar number of kernels when stages get increased. Assign the same weight for down-level and high-level features. ii) Uncertainty quantification research is done for the AV classification. The construction of the paper is as follows: Section 2 expresses the state of art comparison with other deep learning models. The section 3 describes the proposed approach with AV classification and architecture model. The section 4 comprises the output attained, and the performance of the proposed work. Section 5 concludes the overall work done in fundus images.
Fully convolutional neural networks (FCNN) are used for vessel segmentation with stationary wavelet transform (SWT) exposing the structure of the retinal vascular map. Additional channels are incorporated into the fully convolutional neural network structure. SWT conserves pixels by enriching the input of FCNN. The segmentation of vessels can be carried out either globally or locally. In some circumstances, performance can be enhanced by region-wise semantic segmentation combined with transfer learning. Fundus image segmentation using upsampling and downsampling techniques in FCNN will reveal thin and thick vascular characteristics [10, 11]. In fundus image segmentation usage of moderate vessel features by the decoder can represent tiny objects pixel-wise. There are unfair pixels between vessels and pixels in the surroundings. Accordingly, it is proposed to use a round-wise feature aggregation method with a low stride value that has rich features with tiny blood vessels [12]. ResNet with scale space approximation layers is used for the multi-scale representation of fundus images, with down-sampling and up-sampling processes [13].
New Cross connected CNN model developed for vessel segmentation improves the overall performance of segmentation, as well as a strong segmentation algorithm, was proposed. Performance was assessed in terms of accuracy, prediction speed, and specificity. Feature fusion with skip connection produces great results in DRIVE and STARE datasets [14]. Retinal vessel segmentation in the Dense-Unet model using a random transformation-based training approach was proposed. Random transformation applied for image augmentation and patch-based learning is done to improve the training effectiveness. The dice loss function in fundus image segmentation addresses the data imbalance problem between segmented and ground truth images. An attention mechanism is proposed in future work for the segmentation of tiny vessel features [8, 15]. Skip connection-based residual deep learning networks combine features of various methods. Skip connections extract necessary information from fundus images with a 10-fold validation strategy. SCB (skip connection block) has a fully connected layer followed by ReLU activation. It has advantages in the feed-forward network and weight-updating method of back propagation [16, 17]. Modified encoder-based Unet architecture is proposed with two additional layers at each stage which extracts useful edge information of tiny vessels [6, 18]. The addition of attention blocks and residual blocks also improves performance comparatively while using Unet architecture alone. Attention ResUnet can be seen as an upsampling/ downsampling method that builds by attention blocks. The performance is improved by adding a new type of BSE block (Attention mechanism) with residual structures.Skip connection in residual blocks plays a major role in the modification of the depth of a network without a change in the performance of a network [19, 20]. Table 1 shows the various methods of proposed work.
Different method of the retinal blood vessel segmentation
Different method of the retinal blood vessel segmentation
Materials
The proposed system is assessed using DRIVE, CHASE_Db1, and STARE fundus image databases. The DRIVE record [21] comprises 40 RGB retinal images with a 584×565 pixel resolution. It uses half of its fundus images for testing and training processes respectively. The STARE database [21] has 20 retinal fundus images with a 605×700 pixel resolution. The CHASE_Db1 record [22] has 28 fundus images taken from an NM-200 fundus camera with an image resolution of 999×960 pixels. Figure 1 shows the proposed work and Fig. 2 are depicted as a DRIVErecord of input fundus image and artery vein reference.

Proposed workflow diagram.

DRIVE Record: (a) Input fundus image and (b) artery vein reference model.
Let model with input image x and parameters θ to a function that maps probabilities with spatial extent as the source image:
Probabilities of the mapping function
A classification pixel of artery and vein is significant for a multi-label system [23, 24]. The training dataset has an image mask of segmented vessels.
For classification networks uncertainty of loss, and attenuation occurs in the neural network. Input-reliant uncertainty is common in many classification processes [26]. The defined regular classification technique helps to influence transitional hetero regression. This chosen model works well with neural networks for heterogeneous categorization. Uncertainty is detained by adapting the innovative technique to calculate average and dispersed data of logits:
To evaluate dispersed data, logits of the subsequent layer are included in the amount of produced dataset [27–38]. An activation function is included in the resulting layer to ensure evaluated dispersed data is positive or not. The likelihood of samples is evaluated as follows:
Where, ⊙ is set as the product and ∈ are labelled in the conclusion.
The main implication method for AV remains the same with the exemption. The loss is evaluated from models NA labels, minimum loss is nothing but a mean of individual labels. Uncertainty is identified in the view of parameters as random variables
Where D is a pair of input and output, intractable parameter X (θ | D) is used in fully connected deep neural networks.
The structure used is A-DFCNN. A-DFCNN outperforms other fully convolutional network architectures in AV vessel structure prediction. Layer Input is added pixel-wise with an output of the previous layer in dense convolutional blocks shown in Fig. 3. The model has normalization, activation function, convolution operation, and dropout to produce feature maps. It is similar to other downsampling and upsampling structures where downsampling will compress the input and the transition of skip connection is done between the encoder and decoder. The up-sampling process has dense convolutional blocks and transitions.

Attention-based dense fully-convolutional neural network model.
Skip connection: Skip connections will combine high-level and low-level DFCNN characteristics that perform well in terms of sensitivity and accuracy. By modulating the skip concatenate across the contracting path and the expansion process, the DFCNN architecture encourages information synthesis among local and global features. It aids in emphasising the vessel’s characteristics. By using the skip-connections shown in Fig. 3, the dense block Db 1 is joined to the dense block Db 5 for the sample.
Attention mechanism is the process of including parameters in every channel of a convolutional block, so that the method can adaptively fine-tune the weighting of each feature map. Figure 4 consists of five basic units of operations: i) Input added to the convolutional block and the number of channels used for future intervention. ii) Squeeze channels into a single value by average pooling. iii) ReLU function next to a fully connected layer will avoid nonlinearity. iv) Smoothing operation is given by a second fully connected layer. v) Based on the result produced in the network feature maps are weighted.

Attention mechanism.
The training of the proposed A–DFCNN model is done on NVIDIA Tesla k40c GPU, 32GB RAM, and Intel E5-2630 processor. Four training groups are assigned with 30 epochs, with a learning rate of 0.0001. For pixel-wise vessel segmentation, it produces four measures: TP, TN, FP, and FN viewpoints for the number of true positive and negative samples and the number of false positive and negative samples, which is depicted as Equations 11 to 16. The following metrics are used for assessment: Accuracy, sensitivity, specificity, and area under the curve.The A-DFCNN model is compared on the following datasets CHASE_DB1, STARE, and DRIVE. Every dataset is divided into the training phase, validation, and testing phase. The addition of the Adam optimizer and the binary cross entropy loss function. The stochastic descent method’s Adam optimizer is an enhancement that works well for deep learning applications in image processing and computer vision.
Proposed DFCNN algorithm test image and input model of corresponding segment image pseudo code described. Let H, W is image width and height, P is patch size, Op is output patch size, Os is output stride size, and S is sampling step size. The usage of attention mechanism segments tiny vessel features precisely concerning the ground reality images revealed in Fig. 5. Output images are compared with ground reality images in Fig. 6. Attained results from section (d) show that the Attention-based DFCNN network performs well as compared to DFCNN from section (c). The proposed technique’s effectiveness depends on attention maps produced from fundus images. The self-attention mechanism includes different comparison functions such as multiplicative attention and addictive attention.

Rows 1, and 2 of DRIVE and STARE datasets show a partial view of attention results (a) Input fundus image (b) partial views (c) ground truth (d) A-DFCNN.

Rows 1, 2 and 3 are DRIVE, STARE and CHASE_DB1 datasets respectively. (a) Input fundus image (b) Ground truth (c) DFCNN (d) A-DFCNN.
Numerous works has been done on the segmentation of retinal vessels based on STARE, DRIVE, and CHASE_DB1 datasets depicted in Table 2. Since the proposed technique depends on a self-attention mechanism which provides better results in all the datasets. Individual database results: The DRIVE dataset hathe most challenging dataset which provides less accuracy and sensitivity. STARE dataset has low-contrast images which affect vessel structure prediction. It has high accuracy and specificity but less sensitivity. It is found that the CHASE_DB1 dataset produces better results concerning all aspects. Combined dataset results: In the overview, the combined images from DRIVE, STARE and CHASE_DB1 used for training andthe STARE dataset’s challenging images which affect the segmentation result. Figure 7(a&b) depicted as individual/combined dataset training of proposed model and result image presented.
Performance comparison of proposed model to show how it is varied from existing models in DRIVE, STARE and CHASE_DB1 database

Individual and combined dataset training of A-DFCNN model. A row belongs to DRIVE, STARE and CHASE_DB1 datasets, respectively. Columns shows individual and combined dataset training results.

Result image.
Table 3 shows the Sensitivity, Specificity and Accuracy obtained on each fold and its comparisons. Each is found to be closer in the STARE dataset compared to the DRIVE and CHASE_DB1 datasets. In k-fold cross-validation, there is not much variation in the performance metrics of all three datasets.
Comparison of Spc, Sen and Acc on each fold
The ROC values obtained after training aregiven in Figs. 8–10. The common issue in this method is to identify the vessel patterns for segmentation. Figure 11 depicted as visualize F1 score and all three datasets comparison of image segmentation for each image in test case dataset.

Receiver operating characteristic curve (ROC) showing the maximum result of a proposed model at all segmentation thresholds in STARE dataset.

ROC in the DRIVE dataset.

ROC in CHASE_DB1 dataset.

Comparison of image segmentation results for each image in the dataset a) DRIVE, b) STARE, and c) CHASE_DB1.
In this study, the proposed attention mechanism with a dense fully convolutional neural network shows better results in segmentation of optical vessel. The segmentation of vessels can be done with uncertainty evaluation and it is also compared with state of art methods for vessel segmentation. The public datasets CHASE DB1, STARE, and DRIVE were all evaluated to the suggested methodology. In this situation, the A-DFCNN outperforms other ground-breaking models across several datasets. By using skip connections and the attention method, it is proved that A-DFCNN method is achieved better performance in segmenting the low and high-resolution fundus images. This paves the way for novel area of research in the domain knowledge, in addition to the FCNN model.
Conflict of interest
No, “The authors declare no conflict of interest.”
