Abstract
With the recent advancements in technology, there has been a tremendous growth in the usage of images captured using satellites in various applications, like defense, academics, resource exploration, land-use mapping, and so on. Certain mission-critical applications need images of higher visual quality, but the images captured by the sensors normally suffer from a tradeoff between high spectral and spatial resolutions. Hence, for obtaining images with high visual quality, it is necessary to combine the low resolution multispectral (MS) image with the high resolution panchromatic (PAN) image, and this is accomplished by means of pansharpening. In this paper, an efficient pansharpening technique is devised by using a hybrid optimized deep learning network. Zeiler and Fergus network (ZF Net) is utilized for performing the fusion of the sharpened and upsampled MS image with the PAN image. A novel Dingo coot (DICO) optimization is created for updating the learning parameters and weights of the ZF Net. Moreover, the devised DICO_ZF Net for pansharpening is examined for its effectiveness by considering measures, like Peak Signal To Noise Ratio (PSNR) and Degree of Distortion (DD) and is found to have attained values at 50.177 dB and 0.063 dB.
Keywords
Introduction
Recent development in the field of remote sensing has made it an effective tool in many applications, such as defense, security, precision agriculture, environmental management, and so on. In such applications, there exists a need to accurately discriminate land covers as well as the definition of shapes and textures. For this purpose, it is highly essential to capture images with high spectral and spatial resolutions [1]. The deployment of satellites, like WorldView-2, Pleiades, IKONOS, Kompsat, QuickBird, and WorldView-3 has two space-borne sensors, namely the MS and PAN sensors. MS sensors acquire images with high spectral information but lack of spatial information. PAN sensors have the ability to capture PAN images that have clear texture and edges. However, these images fall short of spectral information. The sensors do not have the ability to provide images with high spectral as well as spatial resolution instantaneously, owing to technical limitations [2, 3]. Thus, to acquire an image of rich spectral as well as spatial resolution, pansharpening is employed. Pansharpening has been a hot research topic in the field of remote sensing and targets at attaining an MS image with a rich spatial resolution by performing the fusion of MS image with PAN image. Basically, it can be defined as the enhancement of spatial resolution in an image by injecting it with spatial information extracted from another image [4]. This step is highly required in several remote sensing image analysis processes, like image correction [5], registration [6], terrain classification [7], image segmentation [8], object detection [9], and so on.
Pansharpening aims at developing an MS image that contains both the spectral and spatial information of the MS and PAN image of the scenario acquired by a single satellite through performing fusion. Image fusion is the process of combining two or more images to generate a high-quality image [10, 11]. Image fusion performs the incorporation of several images acquired from multiple sensors to improve the quality of image, thereby enabling efficient decision-making. Moreover, it makes the image appropriate for easy perception in humans as well as machines. Image fusion helps to preserve the significant details and eliminate any discrepancies in the image by extracting all vital details contained in the image [12]. Furthermore, it can be effectively employed to solve the issue concerning missing data, which may occur due to the contamination of the time-series images captured by satellite because of shadow or cloud. The techniques used in pansharpening can be broadly classified as decision, feature, and pixel levels [13]. The decision-level approaches involve separate processing of input images followed by decision extraction, and then the extracted decisions are then fused. The feature-level techniques consider the fusion of the extracted spectral, structural, and geometrical features, whereas the pixel-level fusion technique performs the image fusion pixel-by-pixel [10].
Over the last few years, there has been an increased focus on developing various methodologies of pansharpening. Generally, the prevailing methods of pansharpening can be classified as, traditional methods as well as deep-learning techniques [14]. The traditional techniques of pansharpening can be further classified as Variational Optimization (VO)-based techniques, Multiresolution analysis (MRA)-based techniques, as well as Component Substitution (CS)-based techniques. The VO-based approach comprises two processes namely devising energy function as well as optimizing solutions. MRA-based techniques decompose the image into a series of bandpass channels, and the high-frequency channel of PAN is inserted into the MS channel. The CS-based techniques project the MS band into a new space, and the component representing spatial information is injected into the PAN image. Finally, a fused image is obtained by performing inverse projection [15]. The recent progression of deep learning approaches has led to remarkable accomplishments in various computerized visual processes and has inspired the use of Deep Neural Networks (DNN) in the field of pansharpening [16]. The DNN-based approaches have the capability to achieve an enhanced fusion effect by learning complex features from a number of samples [17]. With the advent of Convolutional Neural Network (CNN), utilization of deep learning techniques in multiple fields has increased tremendously [34, 35]. CNN has gathered huge interest and has been successively used in the pansharpening. A number of CNN-based techniques have been devised for enhancing the spatial resolution of the image acquired from sensors and also for effectively performing fusion and effectively monitoring the health also [18, 32, 33]. Several deep learning techniques, such as the CNN-based pansharpening method (PNN) [19], a Generative Adversarial Network (GAN) for pansharpening (PSGAN) [20], PanNet [21], etc., have been developed for performing pansharpening [14].
In this paper, a hybrid optimized deep learning method named DICO_ZF Net is devised for fusing the MS and PAN images. The MS images are initially subjected to the sharpening process, where they are sharpened using Contrast-limited adaptive histogram equalization (CLAHE), and later, they are upsampled. These images are then fused with the PAN images by employing ZF Net for obtaining the pansharpened image. Here, the fusion process is carried out by ZF-Net, and it is optimized by means of the DICO algorithm to enhance the effectiveness of pansharpening process.
The key contributions of this research are highlighted as follows;
Devised DICO_ZF Net for pansharpening: A innovative network named DICO_ZF Net is developed for performing the process of pansharpening. ZF Net is employed to fuse the MS and PAN images, wherein the developed DICO algorithm is employed for modifying the learning parameters and weights of network. Here, the DICO algorithm is developed by modifying the Dingo Optimization Algorithm (DOX) as per the COOT optimizer for achieving enhanced performance.
The rest of the article has the following organizational structure: Section 2 details the related works in pansharpening, Section 3 elaborates on the introduced pansharpening technique in detail, and in Section 4, experimental results and the assessment of the method are detailed. The conclusion of the research article is provided in Section 5 with the further scope.
Motivation
Pansharpening is a highly essential process in remote sensing and it is also a crucial pre-processing phase in many image processing applications, such as classification, segmentation as well as feature extraction. The information content of the MS as well as PAN images are complementary to each other and comprise redundant data, which makes it difficult to build an effective technique of pansharpening. This section briefs the existing pansharpening methodologies and the issues encountered that motivated the development of the introduced technique.
Literature review
A large number of research have been carried out for developing effective pansharpening techniques. Here, a few of the devised methodologies are considered, and they are shortly explained with their advantages and demerits. Luo et al. [18] developed an unsupervised CNN-based technique for performing a fusion of the MS and PAN images. Here, features were extracted and fused from the input images by utilizing an iterative network based on the similarity between the MS and PAN images. The technique was extremely effective in generating full-resolution images, but it failed to reduce spatial and spectral distortions. The drawback in [18] was overcome in [22], where Wang et al. developed a Dual-Path Fusion Network (DPFN) for performing pansharpening. The DPFN was implemented using two components, namely the Local Sub-Network (LSN) as well as the Global Sub-Network (GSN). GSN was employed for capturing the global spatial, and textural features, and the global contour was obtained by performing the fusion of PAN and MS images. Further, the spatial information in MS images was enhanced by the use of LSN. This technique successively restored the edges and fine detail for improving the visual effects. However, it was unsuccessful in employing an optimization method for accelerating the process. The drawback listed in [22] was overcome in [17], where Yang et al. devised a low-rank fuzzy fusion model for carrying out pansharpening. The pansharpening method was implemented in two phases, wherein the low-rank fuzzy fusion technique was utilized for fusing high-frequency information of PAN and MS images, and later, an adaptive detail supplement was employed for obtaining the final injection information, which was injected to bands of MS images. This method was highly effective in extracting detailed information without tampering with spectral resolution. However, the computational complexity of this technique was high. An improved tradeoff between computational complexity and performance was achieved by the context-based generalized Laplacian pyramid (GLP), developed by Vivone, G in [23] for pansharpening. Here, the injection coefficients were estimated by using regression-based techniques, which combine the GLP technique and the M-estimators. Moreover, homogenous clusters were created by the application of K-means for segmenting the MS images. The technique can be effectively employed in cases where the images contain vegetated areas, but it was unsuccessful in improving the performance of system.
An enhanced performance approach was introduced in [24], wherein Lai et al. presented a Multi-Scale Fusion Network (MSFN) for fusing the MS as well as PAN images. Extraction of the multi-scale features was effectively carried out by using an encoder-decoder structure, and the information was preserved in an information pool, and later, both were subjected to the multi-scale feature fusion module for reconstructing the MS image. The SFN technique was extremely efficient for real-time implementation, but this method failed to consider the effectual spatial enhancement process. The spatial detail was enhanced by considering an adaptive details injection method in [25], where Wu et al. created a multi-objective decision model for performing pansharpening. Here, the pixels in MS images were intrinsically adapted for eliminating the effect of injected details by using the multi-objective optimization model. Later, the spectral modulation coefficient was computed by a multi-objective decision algorithm, and later the spatial information was enhanced by the adaptive detail injection technique. This technique was highly effective in enhancing the texture and edge of the image, although it suffered from low performance owing to various filter requirements corresponding to various fusion issues. The problem of dedicated filters was overcome by the usage of dynamic filters in Multiscale Dynamic Convolutional Neural Network (MDCNN) [16], developed by Hu et al. for pan-sharpening. Here, the network adaptivity was strengthened by the utilization of filters that were dynamically generated using convolutions. The PAN image features were extracted by dynamic convolutions at various scales and the relationship between the features was adjusted using the weights acquired with the help of the weight generation network. MDCNN was successful in overcoming the gradient vanishing and degradation issues, although it endured high computational time. Reduced time consumption was attained in [14], where Zhou et al. presented an unsupervised generative adversarial network (UCGAN). Fusion was carried out by utilizing a two-stream generator for extracting the features related to modalities of the MS as well as PAN images. Moreover, the output image quality was boosted by the hybrid loss function. The UCGAN was capable of reducing the memory cost as well as time, but it failed to employ an optimization technique for enhancing the performance of the network.
Challenges
The prevailing methodologies of pansharpening encountered the following challenges:
An unsupervised CNN was developed in [18] for pansharpening, which offered pansharpened images with high spectral and spatial quality. Although, this method did not consider the utilization of single image spatial resolution techniques or deep image priors for minimizing the spatial as well as spectral distortions in images ahead of fusion. The spatial, as well as spectral distortions, were minimized by the DPFN developed for pansharpening in [22]. This approach was highly effective for enhancing the high-frequency information of MS bands, although it failed to minimize the computational complexity, and it remained the main challenge. The drawback in [22] was overcome by the MSFN [24] proposed for pansharpening, which can be successfully employed in real-time applications. However, this method was unsuccessful in considering the gradient characteristics of the image in order to enhance the spatial resolution of the image. The spatial resolution was improved by the multi-objective decision model in [25], which was developed for attaining a high-quality fused image. The main challenge encountered by this method was that it did not consider addressing the high dimension image fusion issue by investigating image reduction. The prevailing pansharpening methodologies encounter huge challenges as they need ideal high-resolution MS images for making them applicable in real-time scenarios. Moreover, these techniques suffer from degraded performance when they are applied to full-scale images rather than down-scaled images.
This section describes the process of pansharpening by using devised DICO_ZF Net. Pansharpening is highly critical and is considered a vital pre-processing step in many applications, which require images with high spectral as well as spatial resolution. The spectral details in the low-resolution MS image are integrated with the spatial information of PAN images for obtaining a high-resolution MS image. The devised technique of pansharpening is implemented using the following steps, such as data acquisition, sharpening, upsampling, and fusion. The primary process is the acquisition of MS and PAN images from datasets. The MS image is sharpened using CLAHE [26], and then the sharpened images are upsampled using the upsampling process in Red Green Blue (RGB)-Guided Hyperspectral Image Upsampling algorithm [27]. Later, the PAN image and the upsampled MS image are fused by using the ZF Net, wherein the learning parameters and the weights of ZF Net [28] are modified according to devised DICO algorithm. The devised DICO algorithm is developed by adapting the encircling behavior of Dingoes in DOX [29] algorithm with respect to Coot [30] algorithm. Figure 1 displays the schematic representation of the introduced DICO_ZF Net for pansharpening. The entire process is elaborated in the ensuing subsections.
Schematic representation of the introduced DICO_ZF Net for pan sharpening.
The initial phase in the process of pansharpening is the acquisition of images. Here, pansharpening is carried out using the PAN and MS images, and these images are acquired from datasets.
MS image acquisition
Consider a database
Here,
Consider a database
Here,
The MS image acquired
Here,
Where,
Here, pdf represents the probability density function, Interpolation of pixels is carried out from the mapping functions for preventing block artifacts. Bilinear interpolation is performed for finding the remapped pixel and is represented as,
Here,
Thus, the block artifacts are eliminated by interpolation. Moreover, as the CLAHE performs independent processing of blocks, the computational complexity is minimized. The sharpened image is represented by
The captured MS images have very low spatial information owing to the limitation of sensors and so, for improving the spatial resolution of MS image, sharpened image
Learning the high resolution-low resolution (HR-LR) exemplar
The image
HR hyperspectral image reconstruction
Assume
Here,
Once,
After estimating
Here,
The process of fusion of the MS images with the PAN image using the ZF-Net is elaborated here. The upsampled MS image
ZF-Net for image fusion
Here, fusion of the MS image with the PAN image is carried out by means of the ZF Net, wherein the upsampled MS image
Architecture of ZF Net.
The output obtained from the ZF Net is the pansharpened image
An innovative DICO algorithm is proposed in this paper for optimizing the fusion process, by adjusting hyperparameters of ZF Net, which minimizes the loss and achieves desired outcomes. Here, the DICO is developed by altering the encircling behavior of dingoes, while hunting in DOX [30] with respect to COOT algorithm [29]. DOX is a metaheuristic optimizer, which is developed from the inspiration of the hierarchical conduct of dingoes in pack. The optimizer is created by considering the way in which dingoes target prey. The dingoes normally perform an investigation of area for prey, followed by encircling and then attack, based on these processes, DOX is formulated. The optimizer is extremely effective for finding solutions with less effort in real-time problems. Although, this optimizer did not take into account solving multi-objective matters. On the other hand, COOT is a swarm-based algorithm devised by considering the conduct of coot birds as a swarm on the surface of water. The birds undertake two different kinds of motion on the face of water, such as irregular and regular. In the initial phase, birds fly towards leaders for reaching food supply in an irregular manner and later, every bird fly behind following bird in front thereby generates the coot chain. The coot algorithm is realized by considering different movements, like random motion, chain motion, adjust the position with respect to the group leader, and chaperoning the group to the optimal region. The COOT algorithm is highly efficient for solving engineering optimization issues and it is effective for determining the solutions to unknown search spaces. However, this algorithm suffers from a low convergence rate. Hence, by amalgamating two optimization algorithms, the devised DICO optimizer effectively combines the merits of both algorithms and improves the optimization process of ZF Net. The devised DICO is implemented by means of the following steps.
Step 1: Initialization
The population of dingoes is initialized primarily in the search space, and it can be expressed by using the following equation.
where,
Step 2: Estimation of Fitness
The optimal solution is computed with some flaws, and hence the optimization is regarded to be a minimization problem and so, the objective is to find a solution with minimal error. Thus, the fitness function chosen here is mean square error (MSE), and it is represented by,
where, the obtained output of ZF Net is designated as
Step 3: Encircling
Dingoes are proficient in determining the position of prey. As the position of prey is traced, the entire pack is led by alpha to the position, and pack begins to encircle the prey. The hierarchical conduct of dingoes is modeled based on the assumption that the prey is located by considering the best search method as quest space is not known ahead. Concurrently, other search agents keep refreshing searching methods in subsequent possible strategies. The conduct of dingoes can be formulated using the equations given below.
Here,
Substituting Eq. (14) in (15),
Now assuming
Based on the arbitrary motion of Coot birds, position of bird can be determined from the following equation,
where,
The arbitrary location
Here,
Rewriting Eq. (23), we get,
Now assume,
Substituting Eq. (29) in (22), we get,
Step 4: Hunting
The dingoes do not generally have an estimation about the optimal location of prey position, but the hunting plan is modeled by considering that all dingoes in the pack have data regarding prey position. Normally, alpha directs hunting, but in some scenarios, other dingoes might be participating in the hunting process, and hence, two of the best values are considered. The rest of the pack also has to keep updating their position as per the best search agents, and this can be modeled as,
Moreover, for every dingo, the intensity has to be computed and is expressed as,
Here,
Step 5: Attacking prey
When no position is available for updating, it indicates that the hunting process is completed by dingoes by carrying out an attack on prey. The attacking prey method can be formulated by decreasing the value of
Step 6: Feasibility re-computation
The optimal solution is estimated by taking into account the fitness of the solution using Eq. (13), when the exiting solution has higher fitness than the current solution, then current one replaces the prevailing one.
Step 7: Termination
The entire operation is reiterated for determining the optimal solution till the stopping criterion is attained. Table 1 displays the pseudo-code of devised DICO algorithm.
Pseudo-code of devised DICO algorithm
The integration of DOX and COOT algorithm has resulted in a highly efficient DICO algorithm, which improved the performance of ZF Net for obtaining fusion of two images to attain enhanced Pansharpening.
The section elaborates on the analytical evaluation of devised DICO_ZF Net for pansharpening. The developed DICO_ZF Net for pansharpening is evaluated in comparison to the state-of-art methodologies, and the same is detailed here.
Experimental results of the devised DICO_ZF Net for pansharpening. (a) Input images 1, 2, 3 and 4; (b) sharpened images 1, 2, 3, and 4; (c) upsampled images 1, 2, 3, and 4; (d) fused images 1, 2, 3, and 4.
The implementation of the devised DICO_ZF Net for pansharpening is carried out using PYTHON tool on a system with the following specifications: Windows 10 OS, Intel i3 processor, and 8 GB RAM.
Dataset description
For demonstrating the performance of the devised DICO_ZF Net for pansharpening, the devised technique implemented four datasets, namely Indian Pines, Salinas, Pavia Centre, and University [31].
Indian Pines: The dataset contains scenes, which are collected from North-western Indiana comprising the Indian Pines test location. The data is collected by the Airborne Visible/Infrared Imaging Spectrometer (AVIRIS) sensor and comprises of 16 classes and 224 spectral reflectance bands with a wavelength ranging from (0.4–2.5) Salinas: This dataset contains images captured from the Salinas Valley, California using 224-band AVIRIS sensor. The dataset contains 16 classes comprising areas, such as vineyard fields, vegetables, as well as, bare soils and the region encompasses 512 lines by 217 samples. The images have a spatial resolution of 3.7-meter pixels. Paviacentre: This dataset comprises images captured over Pavia, Northern Italy using ROSIS sensors. It comprises of 9 classes, with 102 spectral bands, and the images captured have a dimension of 1096
Comparative assessment of devised DICO_ZF Net using Indian pines by varying number of bands (a) PSNR and (b) DD.

Here, the experimental outcomes of the proposed method are portrayed using Fig. 3. In Fig. 3a, the input images are portrayed. Figure 3b displays the sharpened image, Fig. 3c and d corresponds to the upsampled and fused images, respectively.
Evaluation metrics
Two evaluation measures are utilized for illustrating the effectiveness of the developed DICO_ZF Net for pansharpening, such as PSNR and DD. These measures are detailed in the following subsections.
PSNR: PSNR is a metric used to compare the pansharpened image with the original image and it is computed by finding the average of all bands and is given by,
DD: DD can be defined as the relative degree by which the object can be misrepresented or distorted and is expressed as,
where,
The devised DICO_ZF Net is tested for its effectiveness by comparing it with traditional techniques, like CNN [18], low-rank fuzzy fusion model [17], Context-based GLP [23], Competitive Multi-Verse Feedback Artificial Tree (CMVFTA)-based Deep Maxout network (DMN), Fractional CMVFTA (FrCMVFTA)-based DMN.
Comparative assessment of the devised DICO_ZF Net using Indian pines by varying neta value (a) PSNR and (b) DD.
Comparative assessment of the devised DICO_ZF Net using Salinas by varying number of bands (a) PSNR and (b) DD.
The introduced DICO_ZF Net is examined for its efficacy using the comparison with prevailing approaches considering the four datasets by varying the number of bands and neta value.
Assessment using Indian pines
The proposed DICO_ZF Net is evaluated using the Indian pines dataset by varying neta value number of bands, which is illustrated in this section.
i) Assessment by varying number of bands
Figure 4 portrays the assessment of the introduced DICO_ZF Net using Indian Pines by varying number of bands. In Fig. 4a, the comparative analysis is portrayed using PSNR. For 20 bands, the PSNR computed by FrCMVFTA-based DMN, CMVFTA-based DMN, low-rank fuzzy fusion model, CNN, Context-based GLP, and devised DICO_ZF Net is 47.917 db, 43.829 db, 19.278 db, 10.666 db, 27.211 dB, and 51.655 db. The evaluation with respect to DD is displayed using Fig. 4b. The value of DD computed by the prevailing methodologies, such as the FrCMVFTA-based DMN, CMVFTA-based DMN, low-rank fuzzy fusion model, CNN, and Context-based GLP is 0.067 db, 0.071 dB, 0.419 db, 0.193 db, and 0.099 db, whereas devised DICO_ZF Net attained a low value of DD is 0.064 db.
ii) Assessment by varying neta
Comparative assessment of the devised DICO_ZF Net using Salinas by varying neta value (a) PSNR and (b) DD.
The assessment of devised DICO_ZF Net is elaborated in Fig. 5 by varying neta value. In Fig. 5a, comparison of various techniques of pansharpening is portrayed based on the PSNR value. When value of neta is 30, PSNR value computed by prevailing techniques, like FrCMVFTA-based DMN, CMVFTA-based DMN, low-rank fuzzy fusion model, CNN, and Context-based GLP is 43.093 dB, 41.944 db, 22.106 db, 11.718 db, and 28.256 dB, respectively, but the devised DICO_ZF Net attained a higher value of PSNR at 45.679 dB. The analysis with respect to the DD is shown in Fig. 5b. The value of DD attained by the methods, like Context-based GLP is 0.106 db, CNN is 0.172 dB, low-rank fuzzy fusion model is 0.417 dB, CMVFTA-based DMN is 0.081 db, and FrCMVFTA-based DMN is 0.077 db, while the neta value is kept at 40. However, a less value of 0.074 db DD is computed by devised DICO_ZF Net.
The proposed DICO_ZF Net is evaluated using Salina dataset by varying the neta value and number of bands and the same is illustrated in this section.
Comparative assessment of the devised DICO_ZF Net using Pavia centre by varying number of bands (a) PSNR and (b) DD.
i) Assessment by varying number of bands
The analysis of devised DICO_ZF Net is displayed using Fig. 6. In Fig. 6a, the effectiveness of the developed approach with respect to PSNR is shown. When 30 rounds were considered, value of PSNR computed is 42.774 db by the introduced technique, which is better than the values of PSNR at 41.177 dB, 39.877 dB, 17.902 db, 20.054 db, and 37.283 db computed by other methods, such as FrCMVFTA-based DMN, CMVFTA-based DMN, low-rank fuzzy fusion model, CNN, and Context-based GLP. The DD-based assessment is demonstrated in Fig. 6b. The developed DICO_ZF Net attained a low value of DD at 0.063 db, whereas the state-of-art methodologies, like FrCMVFTA-based DMN, CMVFTA-based DMN, low-rank fuzzy fusion model, CNN, and Context-based GLP calculated higher values of DD at 0.066 db, 0.071 db, 0.402 dB, 0.183 db, and 0.097 db for 10 rounds.
ii) Assessment by varying neta
The assessment of the proposed DICO_ZF Net by varying the value of neta is displayed using Fig. 7, wherein Fig. 7a shows the evaluation with respect to PSNR and Fig. 7b portrays the assessment considering DD. When the neta value is fixed at 10, the PSNR values attained by the existing methodologies, like FrCMVFTA-based DMN, CMVFTA-based DMN, low-rank fuzzy fusion model, CNN, Context-based GLP, and the developed DICO_ZF Net is 47.432 db, 44.511 db, 25.863 db, 10.402 db, 26.870 db, and 49.273 db respectively for neta value is 10, which shows improved performance. Similarly, when the value of neta is 40, proposed DICO_ZF Net attained a low value of DD at 0.074 db. Although, value of DD computed by the techniques, like FrCMVFTA-based DMN, CMVFTA-based DMN, low-rank fuzzy fusion model, CNN, and Context-based GLP is 0.077 db, 0.081 db, 0.395 db, 0.152 db, and 0.102 db.
Comparative assessment of the devised DICO_ZF Net using Pavia centre by varying neta value (a) PSNR and (b) DD.
Comparative assessment of the devised DICO_ZF Net using Pavia university by varying number of bands (a) PSNR and (b) DD.
Comparative discussion
Comparative assessment of the devised DICO_ZF Net using Pavia university by varying neta value (a) PSNR and (b) DD.
The analysis of the devised DICO_ZF Net is carried out to reveal its effectiveness using the Pavia centre dataset by considering various values of rounds and neta.
i) Assessment by varying number of bands
This section elaborates the assessment of the introduced DICO_ZF Net by considering different rounds and the same is demonstrated in Fig. 8. In Fig. 8a, the PSNR based evaluation is displayed in Fig. 8a. For 40 rounds, the value of PSNR computed by the prevailing methodologies, namely FrCMVFTA-based DMN, CMVFTA-based DMN, low-rank fuzzy fusion model, CNN, and Context-based GLP is 41.009 db, 39.877 db, 17.902 db, 20.054 db, 37.283 db, for 40 rounds. But, the devised DICO_ZF Net computed a higher value of PSNR at 42.601 db. The assessment using DD is portrayed in Figure 8b. The existing methodologies, such as FrCMVFTA-based DMN, CMVFTA-based DMN, low-rank fuzzy fusion model, CNN, Context-based GLP, and the devised DICO_ZF Net computed values of 0.066 db, 0.072 dB, 0.473 dB, 0.091 db, 0.078 db, and 0.063 db for 10 rounds, which shows improved performance.
ii) Assessment by varying neta
The evaluation of devised DICO_ZF Net is carried out by varying the neta values and this is illustrated using Fig. 9. Figure 9a and b displays the assessment with respect to PSNR and DD respectively. When neta value is kept at 30, value of PSNR achieved by proposed DICO_ZF Net is 42.938 db, which is higher than the values of PSNR at 41.335 db, 38.775 db, 18.564 db, 20.568 db, and 37.859 db calculated by prevailing FrCMVFTA-based DMN, CMVFTA-based DMN, low-rank fuzzy fusion model, CNN, and Context-based GLP. While the neta value is 20, DD computed by the introduced DICO_ZF Net is 0.074 db, however, DD calculated by FrCMVFTA-based DMN, CMVFTA-based DMN, low-rank fuzzy fusion model, CNN, and Context-based GLP is, 0.077 db, 0.082 db, 0.459 db, 0.112 db and 0.088 db.
Assessment using Pavia university
The assessment of the devised DICO_ZF Net using Pavia university dataset is evaluated in this section by considering various neta values and rounds.
i) Assessment by varying number of bands
Figure 10 portrays the assessment of the introduced technique by varying the number of rounds. In Fig. 10a, assessment of the devised DICO_ZF Net with PSNR is displayed. The prevailing approaches, like FrCMVFTA-based DMN, CMVFTA-based DMN, low-rank fuzzy fusion model, CNN, and Context-based GLP computed a value of PSNR at 42.800 db, 40.146 db, 19.759 db, 18.404 db, and 35.437 db. However, the devised DICO_ZF Net attained a higher value of PSNR at 44.461 db for 10 rounds. Likewise, the evaluation of the devised approach using DD is depicted in Fig. 10b. For 30 rounds, the devised DICO_ZF Net computed a low value of DD at 0.064 dB, when compared to the values of DD at 0.067 db, 0.072 db, 0.449 db, 0.110 db, and 0.081 db, achieved by the existing approaches, such as FrCMVFTA-based DMN, CMVFTA-based DMN, low-rank fuzzy fusion model, CNN, and Context-based GLP.
ii) Assessment by varying neta
The proposed DICO_ZF Net is evaluated using Pavia university dataset by varying the neta values and the same is illustrated in Fig. 11. In Fig. 11a, the value of PSNR achieved by the approaches, like FrCMVFTA-based DMN, CMVFTA-based DMN, low-rank fuzzy fusion model, CNN, and Context-based GLPforneta value 20 is 43.562 db, 39.072 db, 20.456 db, 18.654 db, and 35.725 db. But the devised DICO_ZF Net attained a higher value of PSNR at 45.253 db, which revealed improved performance. The DD-based assessment of the developed DICO_ZF Net is represented in Fig. 11b. When the neta is set at 40, the DD value attained by the state-of-art approaches, such as FrCMVFTA-based DMN, CMVFTA-based DMN, low-rank fuzzy fusion model, CNN, and Context-based GLP is 0.078 db, 0.082 db, 0.438 db, 0.130 db, and 0.091 db. However, the developed DICO_ZF Net calculated a lesser value of DD at 0.075 db, revealing improved performance.
Comparative discussion
This section deals with a comparative discussion of the devised DICO_ZF Net for pansharpening. The developed DICO_ZF Net is compared to the techniques, such as FrCMVFTA-based DMN, CMVFTA-based DMN, low-rank fuzzy fusion model, CNN, and Context-based GLP by considering parameters, such as PSNR and DD, by varying the number of rounds and neta. The Table 2 displays the values of PSNR and DD computed by the various approaches using Indian Pines, Salina, Pavia centre, and Pavia University datasets corresponding to the value of neta at 50 and 50 rounds. From the table, it can be revealed that the developed DICO_ZF Net computed a value of PSNR at 50.177 dB and DD at 0.063 dB.The high value of PSNR is accounted by the usage of the hybrid optimization algorithm during the training of ZF Net. Further, the utilization of CLAHE for sharpening the MS image has led to the low noise in the image, thereby decreasing the DD.
Conclusion
In this article, a novel framework for pansharpening is introduced with the help of devised DICO_ZF Net. Owing to technical limitations, the sensors mounted in the satellite do not have the ability to capture images of high spectral as well as spatial resolutions. Thus, to improve the quality of images captured, pansharpening is needed. Here, pansharpening is performed by using three processes, namely sharpening, upsampling, and fusion. The MS images are sharpened by using CLAHE and then they are upsampled using the upsampling process utilized in the RGB-Guided Hyperspectral Image Upsampling algorithm. ZF Net is employed for fusing the upsampled MS image and the PAN image, wherein an innovative DICO algorithm is utilized for optimizing the fusion process. Further, the devised DICO_ZF Net is examined for its effectiveness by considering, parameters, such as PSNR and DD. The devised DICO_ZF Net attained a higher value of PSNR at 50.177 dB and a lower value of DD at 0.063 dB, thereby revealing enhanced performance. The proposed technique is used in various applications, like maps updating, change detection, and accurate land use classifications. Also, the comprehensive chromatic and spatial information offered by the pan-sharpened image helps territorial transformation identification, extraction of features, and photo interpretation. The CLAHE process of upsampling is restricted by the appearance of casting shadows, which is a major limitation, and to overcome this, future work focuses on using other effective upsampling techniques.
