Abstract
Hyperspectral (HS) images contain rich spatial and spectral information. Due to its large size, it is difficult to store, process, analyze, or transmit the critical information contained in it. The compression of hyperspectral images is inevitable. Many transform based Hyper Spectral Image Compression Algorithms (HSICAs) have been proposed in the past that work for both lossy and lossless compression processes. The transform based HSICA uses linked lists or dedicated markers or array structure to keep track of significant and insignificant sets or coefficients of a transformed HS image. However, these algorithms either suffered from low coding efficiency, high memory requirements, or high coding complexity. This work proposes a transform based HSICA using a curvelet transform to improve the directional elements and the ability to represent edges and other singularities along curves. The proposed HSICA aims to provide superior quality compressed HS images by representing HS images at different scales and directions and to achieve a high compression ratio. Experimental results show that the proposed algorithm has a low coding memory requirement with a 2% to 5% increase in coding gain compared to the other state of art compression algorithms.
Keywords
Introduction
Unlike human eyes, which can only be exposed to visible light, hyperspectral imaging is an imaging technique that gathers and processes data across the entire range of the electromagnetic spectrum, from visible to near-infrared region [1]. This is in contrast to how human eyes can only detect visible light [2]. The integration of imaging and spectral detection technologies is the key component of hyperspectral imaging [3]. Each spatial pixel in a hyperspectral image is scattered to create dozens or even hundreds of small spectral bands for continuous spectral coverage while imaging the target’s spatial features [4]. As a result, hyperspectral images have a significant ability to identify material/object that appear similar to humans based on their spectral diagnostic capabilities. However, due to the restrictions on the amount of incident energy, the hyperspectral imaging system is frequently degraded. The spatial and spectral resolution of the actual imaging process are always trade-offs. If all other elements are maintained constant to ensure a good signal-to-noise ratio (SNR), the spatial resolution will unavoidably suffer as spectral characteristics rise. Consequently, obtaining a trustworthy hyperspectral image with high resolution continues to be a very difficult challenge [5–7].
Compression is a required step to conserve onboard sensor memory, power, processing time, and computational complexity because of the high volume and correlation of the data [8]. In addition, the process of compression reduces the amount of bandwidth used for data transmission. The onboard sensors work significantly better after the compression procedure. Spatial redundancy and spectral redundancy are the two types of redundancies that exist in a single HS image. While spectral redundancy exists for a single pixel (same spatial location in the frames) between the continuous spectral frames of an HS image, spatial redundancy exists for a frame and is caused by statistical interdependence among the pixels of a frame [9].
On the basis of the data loss and the coding process, the HyperSpectral Image Compression Algorithm (HSICA) for HS images can be categorized [10]. On the basis of data loss, compression algorithms are further classified as lossless, lossy, or near-lossless, while on the basis of coding process, they are classified as Transform Coding (TC) [11], Predictive Coding (PC) [12], Vector Quantization (VQ) [13], Compressive Sensing (CS) [14], Tensor Decomposition (TD) [15], Learning-based Compression (LC) [16], and Neural Network (NN) based compression [17]. Data loss is avoided by lossless HS image compression methods, whose reconstructed images are nearly identical to the originals, but those of lossy HS image compression algorithms are not. For lossy compression techniques, the compression ratio (CR) is high, while for lossless compression, the coding gain is large [18].
In both lossy and lossless compression settings, transform coding is the most accepted compression method for HS image compression process. Mathematical transforms are used by these compression algorithms to move the image from time domain to the frequency domain. Most commonly used mathematical transforms for HS image compression include the Discrete Fourier Transform (DFT), Discrete Cosine Transform (DCT), Dyadic Wavelet Transform (DWT), Discrete Wavelet Packet Transform (DWPT), Integer Wavelet Packet Transform (IWPT), and Karhunen-Loeve Transform (KLT) [19]. In order to compress data, these mathematical transformations eliminate spectral and spatial correlation of HS image. The performance of these algorithms is superior even at relatively low data rates. The main drawback of the TC based Spectral Image Compression Algorithms subclass is the high complexity it entails due to the large number of mathematical computations it performs. The wavelet transform is used by some compression algorithms to attain a high compression ratio. These algorithms are also work with the predictive coding, vector quantization, compressive sensing, tensor decomposition, and learning-based compression. Popular state-of-the-art TC compression techniques include 3D-Set Partitioning in Hierarchical Trees (3D-SPIHT), 3D-Set Partitioning Embedded Block (3D-SPECK), 3D-Listless SPECK (3D-LSK), and 3D-No List SPIHT (3D-NLS) [20–28].
Compression using the predictive coding algorithm is easily implemented in hardware. In order to anticipate the value of a pixel in the pth frame, the PC compression algorithm first removes the spatial association between pixels in the previous p frames using low level mathematical calculations. The PC compression method offers superior performance on average while yet maintaining low complexity and a high bit rate requirement. Compression with no loss of quality, or nearly as good, is possible using a personal computer [12].
The training (code book generation) and coding process (matching of code vector) of the vector quantization based HS image compression technique are the two processing phases. Through an ideal approach, X code words are used to represent Y training vectors, resulting in the formation of code words (designing of the code book). A block cube representation of the input HS image is then translated into an n-dimensional vector space. The least-destructive vector in the codebook is identified, and the index to that entry is transmitted to the receiver. At the decoder end, the index is used to look up a matching codebook, and the code-vectors are regenerated to recreate the original image. High-quality lossless and near-lossless compression can be achieved with the VQ-based HSICAs. The fact that the complexity is a huge negative point for the VQ-based HS image compression algorithms [13].
The coding difficulty is moved from the encode end (the satellite HS image sensor) to the decoder end (earth station receiver) using the CS based HS image compression algorithms. Real-time applications of these algorithms require little memory resources during compression. The compression algorithm receives one slice of high-resolution (HS) image data, reduces its size, and sends it to a ground station. The main benefits of CS-based compression algorithms are their high compression performance, minimal encoder complexity, and modest coding memory required, while the main drawbacks are their high cost and complexity of decompression [14].
The Tensor Decomposition based HS image compression algorithms applied with the TC, PC or LC based HS image compression algorithms. Tensors can be simply divided into n-by-n matrices. A compression procedure is applied to the HS picture stored in the tensor, and the resulting low-dimensional tensor is then decomposed. The channel receives this encoded low-dimensional tensor. These algorithms are data-dependent, require a manual parameter update technique, and are complicated to code, yet they run faster and use less code than others [15].
Machine learning and deep learning are used in the learning-based compression methods for HS image compression. The LC technique is used in conjunction with the PC. Although a high compression ratio had been attained, it came at a significant complexity and resource expense [16].
The numerous layers of the neural network based HS image compression methods are (input, hidden, and output layers). The HS image is broken up into tiny cube sub-blocks. Neurons in the NN network are the size of sub-block cubes. The weights and biases of the NN are used to determine the output with the assistance of other layers. These algorithms featured intricate hardware architecture and very high coding complexity [17].
This article is described in various sections. Section 2 describes the related work associated to the proposed compression algorithm. Section 3 cover the detail description of the proposed compression algorithm 3D Memory Efficient Listless SPIHT (3D-MELS). Section 4 provide the comparative analysis between the 3D-MELS and other set partitioned compression algorithms on different image quality metrics includes coding efficiency, coding memory and coding computational complexity. Finally, Section 5 concludes this manuscript with the future scope.
Related work
The memory of any HS image sensor is constrained. An individual HS image might be up to 100 MB in size. Thus, the onboard HS sensor with a memory of 128 GB can save, on average, 900 to 1280 HS photographs. The HS image compression becomes a crucial step in order to conserve memory space, reduce sensor power consumption, and shorten processing time. The TC based HS image compression algorithms main advantage is that they may be used for any sort of compression (lossy, lossless and near lossless) [27]. This section includes a brief summary of the related research for the suggested compression algorithm.
3D set partitioned transform based HS image compression algorithms
When compared to other types of transform compression algorithms, the 3D set partitioned transform based HS image compression algorithms have multiple advantages in areas like embeddedness, low computational complexity, low coding memory demand, and good coding efficiency [28]. These algorithms are further classified into four distinct categories based on the set partitioning rule.
Apart from the sub categorization of set partition HSICAs based on set partitioning rule, it can also have classified on the basis of the lists type such as list based HSICA [20–22], listless HSICA [23–27] list and marker based HSICA [28] and array based HSICA [18]. Table 1 gives a short overview of state of art set partitioned HSICAs.
Set partitioned transform based HS image compression algorithms
Set partitioned transform based HS image compression algorithms
In the last decade, there are many transform based HS image compression algorithms have been proposed. These compression algorithms use the properties of the wavelet transform to achieve compression by aggregating a large number of insignificant coefficients either in a spatial zero tree [21] or spatial zero block cube [22] or spatial zero block cube tree [20]. The curvelet transform, enables the directional analysis of an image in different scales. Whereas the wavelet transform may identify point similar singularities and detect scale and position information, the curvelet transform has curve similar singularities and gives information about scale, location, and orientation [11]. Thus, it is used in multiple application ranging from HS image denoise, HS image classification to HS image compression [31–35].
Proposed 3D memory efficient listless SPIHT (3D-MELS)
The HS image sensors have limited memory and power resources. Therefore, it is necessary to design a less sophisticated compression technique that requires little memory for implementation on HS image sensors with high coding gain and low coding complexity. The proposed 3D-MELS is a unique listless version of the 3D-SPIHT that only needs a small amount of memory to keep track of the sets of coefficients. The 3D-MELS, 3D-NLS, and 3D-SPIHT both adhere to the same partitioning rule. Generates same amount of output bits but slightly different bit stream. The 3D-MELS does not required any linked list or state table or marker for the tracking of the state (significant/insignificant) of coefficients. The 3D-MELS uses markers only for the tracking of the state of sets or subset of the zerotree.
Using the fact that seven-eighths of the total coefficients are located in the wavelet pyramid’s bottom seven sub-bands and do not have any descendants, therefore no SMZ (State Marker of Zerotree) value is required to be assigned. That is, just one-eighth of the zerotree’s nodes should have an SMZ value assigned. The proposed compression algorithm has two passes, Initialization Pass (IP) and Bit Plane Pass (BPP), just like previous listless set partitioned HSICA 3D-LEZSPC [11] and 3D-LCBTC [28].
Initialization pass (IP)
The proposed HSICA starts from the top bit plane ‘n’ and encoding process runs till the last bit plane or bit budget exhaust. Morton mapping or linear indexing scheme is then used to convert the transform HS image (often recorded in raster format) into a 1D linear index. Linear indexing’s key feature is that it requires only a single operation to provide the operations on coefficient locations necessary for both tree-based and block-based set division HSICA. Each zerotree node present in the LLL sub-band is allotted the SMZ marker as ‘2’ while the other zerotree nodes allotted the SMZ marker as ‘1’.
Bit plane pass (BPP)
The BPP starts from the testing of the significance of the zerotree node (against the current threshold) present in the LLL sub-band for highest bit plane. If the Type A zerotree is found significant, it gets partitioned into the eight children and corresponding grand descendant sets. If any grand descendant found significant then eight new Type A zerotree formed and allotted the SMZ marker as ‘1’ while original parent (root) node reassign the SMZ value from ‘1’ to ‘3’. After that, immediate offspring nodes are checked for significance for current threshold and coded accordingly.
For the next subsequent bit planes, the immediate offspring of a parent node of a significant Type A set may be insignificant, significant or requirement for refinement bit only (previously significant coefficient). The 3D-MELS scan the zerotree nodes having SMZ value ‘2’ (Type B) and also check the significance status of eight children of the parent node for their insignificant, significant or requirement for refinement bit. If any Type B set is significant, then eight new nodes will be assigned Type A and root node assign the new value of SMZ marker as ’3’, otherwise it remains as SMZ equal to ‘2’. If we do not check the status of eight offspring of the parent node before significant test of Type B set, then the immediate eight offspring of the parent node will remain unchecked in the consequent passes until Type B set become significant.
So, in the event where both Type A and Type B are important, we have already used the marker SMZ=’3’ at the parent node. In this example, the suggested technique assessed the significance/insignificance/refinement of the parent node’s immediate four children throughout the search of the tree node in the subsequent passes if the node had SMZ=’3’. Important zerotree is determined for each node in their turn.
The 3D-NLS uses four markers to define the state of the coefficients, and two different markers to define the state of the zerotree (Type A or Type B). Beside these markers, skip markers are also used to define the state of the zerotree node at the lower level of an insignificant zerotree. The count of skip marker in 3D-NLS depends on the level of decomposition of HS image. But, 3D-MELS does not use any skip markers or use markers to define the state of the coefficients. 3D-MELS uses markers to define the state of the zerotree only. The detail description of the use of the markers are covered in the Table 2.
Detail description of the SMZ markers used in proposed algorithm
Detail description of the SMZ markers used in proposed algorithm
The decode of the proposed HSICA follows the similar procedure as the encoder. The encoding algorithm of 3D-MELS is given in Table 3. The process flow of the 3D-LMZC is shown in Fig. 1.
Pseudo code of 3D-Memory Efficient Listless SPIHT (3D-MELS) encoder

Flow of transform based 3D-MELS.
We evaluate the performance compression algorithms on four standard benchmark HS image dataset named as Washington DC Mall, Cuprite, Urban, and Jasper Ridge. All compression algorithms are simulated on the simulation software Matlab 2019A (64 bit windows 11 operating system) and hardware platform (20 GB RAM, 11th generation i5 system). Hyperspectral Digital Imagery Collection Experiment (HYDICE) HS image sensor captures Washington DC Mall (HS Image I) and Urban HS image (HS Image II) while Cuprite (HS Image III) and Jasper Ridge HS image (HS Image IV) were captured by the airborne visible/infrared imaging spectrometer (AVIRIS) HS image sensors. For the simulation, the HS images are cropped from the left top corner to the size of a cube and zero padding is done if it is required. The description of HS images used for the experiment is shown in Table 4.
Description of HS images used for the experiment
Description of HS images used for the experiment
The five level curvelet transform is applied to the HS image. The transform HS image is converted to the 1D array through the Morton mapping (linear indexing) [28, 32]. The coefficients (transform) present in the 1D array are encoded by the compression algorithm accordingly and generates the embedded bit stream for the transmission process. The coding efficiency (image reconstruction quality) is measured by the peak signal to noise ratio (PSNR), Bjøntegaard delta peak signal-to-noise rate (BD-PSNR), Structural Similarity Index Measure (SSIM), and Feature Similarity Index Measure (FSIM). The PSNR is calculated in decibel (dB) while other parameters are unitless [28]. The coding memory is the memory space which is use by the compression algorithm during the encoding or decoding process. It is calculated as kilobyte (kB) or megabyte (MB). The computational complexity of the compression algorithm is measure as the time taken by the encoder (for the encoding process or generation of the output bit stream) and decoder (for the decoding process or reconstruction of the HS image from the received bit stream) for the achieving the compression of the HS image. Performance parameters such as coding efficiency, coding memory, and computational complexity are described below.
In order to reach a minimum acceptable quality of the reconstructed HS image, coding efficiency is typically expressed in terms of the average number of bits per pixel in the coded bit-stream. Peak-Signal-to-Noise-Ratio (PSNR) is used to assess the objective quality of the reconstructed image and mathematically it is represented as Equation 1. The MSE is calculated through Equation 2.
The proposed HSICA 3D-MELS uses set partitioned rules, so comparison of state of art transform based HSICAs coding efficiencies is something that is interesting to look at. Table 5 give the comparative performance of the HSICA with the proposed 3D-MELS. It has been observed that 3D-MELS has high coding efficiency than other HSICA. This is because use of the curvelet transform. When it comes to the compression of HS images, the performance of curvelet transform-based compression algorithm is superior to that of wavelet transform-based compression algorithm.
Reconstructed quality of HS images at different bit rates for the 3D-MELS and other state of art HSICA
Since the curvelet transform is superior to the wavelet transform in terms of its ability to rebuild an HS image and eliminate redundancy from HS image, it is advantageous for use in image compression if the coefficient of neglect is large [33–35]. The 3D-MELS follows the same partition rule as 3D-SPIHT and 3D-NLS. It has been clear from the Table 5 that 3D-MELS outperform almost all bit rates. The variation between the PNSR of 3D-MELS and 3D-SPIHT exists in the range of –0.08 dB 0.84 dB for HS Image I, 0.53 dB 1.67 dB for HS Image II, 0.2 dB 0.66 dB for HS Image III and 0.23 dB 0.76 dB for HS Image IV. In the same way, the variation between the PNSR of 3D-MELS and 3D-NLS exists in the range of 0.45 dB 0.99 dB for HS Image I, 0.55 dB 1.78 dB for HS Image II, 0.22 dB 0.67 dB for HS Image III and 0.12 dB 0.86 dB for HS Image IV.
It has been known that a bit corresponding to a new significant coefficient improves image quality (PSNR) more than a bit used to tweak an existing coefficient. This means that a lower number of decoded new significant coefficients means a lower PSNR. Image Quality of 3D-SPIHT, 3D-NLS, and 3D-MELS for two HS images is tabulated in Table 6. It has been also clear from the table that 3D-MELS has higher number of significant bits (newly) and refinement bits for almost all bit rated which let it to be high coding efficiency. The SSIM is a way to measure the quality of an image by figuring out how similar two images are (original HS image and reconstructed HS image) [36]. Table 7 shows that the SSIM index is a bit higher than the other HS image compression algorithms. The SSIM is calculated through Eq 3.
Image quality of 3D-SPIHT, 3D-NLS, and 3D-MELS for two HS images
The SSIM performance comparison of 3D-MELS with recent state-of-the-art HSICAs
Where C1 and C2 are constant while mean averages are represented by the μA (input HA image) and μB (reconstructed HS image). In the same way, variance is represented as
The Feature-Similarity (FSIM) index maps the features and measures how similar the original HS image and the reconstructed HS image are [37]. FSIM is covered in the Table 8. The BD-PSNR calculation for the Bjontegaard metric is done for each of the four HS images under test [38]. The 3D-MELS has a superior performance with reference to other compression algorithms as covered in Table 9.
Feature-Similarity (FSIM) index performance comparison of 3D-MELS with recent state-of-the-art HSICAs
Bjøntegaard Delta PSNR gain of 3D-MELS with other HSICAs for 15 different bit rates
On-board memory (RAM) on the order of several gigabytes is standard for many of the low-cost sensor nodes [39]. The amount of memory that must be available to an image coder with sequential transform and coding stages is equal to the maximum amount of memory that must be available to either stage individually [40]. It has been known that the listless based compression algorithms have fixed memory demand regardless of the data rate and it is constant through out the compression process while the list based compression algorithms have varied coding memory demand depend upon the bit rate of the compression process [41]. As the bit rate increases, the requirement of the coding memory also increases and it increases exponentially as it is clearly observed in the Table 10. The 3D-ZM-SPECK and 3D-BCP-ZM-SPEC has no coding memory requirement while other listless compression algorithms have constant coding memory requirements due to the use of different markers in compression process. 3D-NLS uses the three different type of markers (define the state of coefficient, define the state of zerotree and skip marker). 3D-MELS uses only one type of marker to define the state of the coefficient (single zerotree node) and state of the zerotree. 3D-MELS outperform with the other compression algorithm but has inferior performance with the 3D-ZM-SPECK and 3D-BCP-ZM-SPECK.
Coding memory performance comparison of 3D-MELS with recent state-of-the-art HSICAs
Coding memory performance comparison of 3D-MELS with recent state-of-the-art HSICAs
The proposed compression algorithm’s computational complexity is evaluated by estimating the amount of time needed for encoding the transformed coefficients and for decoding the generated bit-stream at each bit-rate. This evaluation determines the algorithm’s overall level of computational difficulty [40, 42]. The decoding process is faster than the encoding process for all HSICAs for the following reasons: There is no requirement for the comparison operation in the decoding process. In the decoding process, effective skipping of the sub-bands, block cubes, or coefficients is performed rather than during the encoding process.
Because a greater number of operations are necessary when attempting to achieve a higher target rate, the complexity of an algorithm is also influenced by the target rate. The processing of the lists is responsible for a significant amount of the overhead computations in the list-based HSICAs. In order to reconstitute a coefficient, these compression techniques demand many passes over bit planes. The list of significant pixels (LSP) in 3D-SPIHT has coefficients that are subsequently processed by the 3D-SPIHT algorithm, which accesses those coefficients many times through lists in order to reconstruct itself. It has been clear from the Tables 11 12 that 3D-MELS outperform to the 3D-SPECK, 3D-SPIHT, 3D-WBTC, 3D-NLS, 3D-LMBTC and 3D-ZM-SPECK. It gives the similar complexity performance with the 3D-LSK but has inferior performance to the 3D-BCP-ZM-SPECK. The 3D-BCP-ZM-SPECK uses the functionality of the parallel processing to achieve the low complexity.
Encoding time (coding complexity) performance comparison of 3D-MELS with recent state-of-the-art HSICAs
Encoding time (coding complexity) performance comparison of 3D-MELS with recent state-of-the-art HSICAs
Decoding time (coding complexity) performance comparison of 3D-MELS with recent state-of-the-art HSICAs
The visual representation of the HS images (before compression & after compression process) for the four sub bands (frame 30, 60, 90 & 120) of Urban HS image and Washington DC MALL HS image is shown in the Figs. 3. In the same way, visual illustration for the Cuprite and Jasper Ridge hyperspectral images for four different frequency frames (20, 40, 60 and 80) is covered in Figs. 5.

Original Urban HS image (a) Frame 30 (b) Frame 60 (c) Frame 90 (d) Frame 150. Reconstructed Urban HS image with CR 10 (e) Frame 30 (f) Frame 60 (g) Frame 90 (h) Frame 150.

Original Washington DC MALL HS image (a) Frame 30 (b) Frame 60 (c) Frame 90 (d) Frame 150. Reconstructed Washington DC MALL HS image with CR 14 (e) Frame 30 (f) Frame 60 (g) Frame 90 (h) Frame 150.

Original Jasper Ridge HS image (a) Frame 20 (b) Frame 40 (c) Frame 60 (d) Frame 80. Reconstructed Jasper Ridge HS image with CR 13 (e) Frame 20 (f) Frame 40 (g) Frame 60 (h) Frame 80.

Original Cuprite HS image (a) Frame 20 (b) Frame 40 (c) Frame 60 (d) Frame 80. Reconstructed Cuprite HS image with CR 16 (e) Frame 20 (f) Frame 40 (g) Frame 60 (h) Frame 80.
The onboard HS image sensor only has a limited amount of resources to use for processing and transmitting HS images to the earth station. In this research, a unique technique is proposed that outperforms conventional transform-based HS image compression algorithms that are currently considered state of the art in terms of coding efficiency and computational complexity. The coding gain has been increased across virtually the whole range of bit rates, and it has been demonstrated that the proposed 3D-MELS has a positive Bjontegaard Delta PSNR gain across all HS images.
The curvelet transform requires a lower number of coefficients in order to accurately represent the edges in the HS image, which results in an increase in the coding gain. When compared to other zerotree HSICAs, such as 3D-SPIHT and 3D-NLS, the coding memory consumption is greatly decreased. The intricacy of the computational complexity just a little bit increased. The low coding memory needs of 3D-MELS more than make up for the algorithm’s slightly higher level of complexity. The coding gain can be improved with the application of the bandlet and ripplet transforms. By utilizing a lower total number of markers, the load placed on coding memory can be decreased.
Declarations
Footnotes
Acknowledgments
We are sincerely thankful to the anonymous reviewers for their critical comments and suggestions to improve the quality of the paper. The authors want to express his gratitude to Integral University, Lucknow for providing manuscript number IU/R&D/2023-MCN0001906 for the present research work.
Authors’ contributions
Chandra developed the algorithm, algorithms simulation, prepare the manuscript, Bajpai supervised & Alam, Chandel, Panday and Pandey reviewed the manuscript.
Competing interests
The authors have declared that no competing interests exist.
Ethical approval
Not applicable
Consent for publication
All authors agreed on the final approval of the version to be published.
Funding
The authors receive no funding for the publication of the manuscript.
Data availability
Data is available within the manuscript.
