Abstract
As a new tool of signal and image analysis, the dual-tree complex wavelet transform (DTCWT) has been found very useful for a lot of applications in machine learning and pattern recognition. In this paper, we present a method for denoising of images corrupted with additive white Gaussian noise, which employs DTCWT on noisy image to obtain complex coefficients with properties of approximate shift invariance and directional selection. Furthermore, our method adopts block thresholding scheme in denoising procedure, which can empirically choose the optimal block size and threshold at each resolution level by minimizing Stein’s unbiased risk estimate. Experimental results on several benchmark images show that our method outperforms most conventional term by term or block thresholding based methods for eliminating different levels of noise. Moreover, its denoising result is compared to state-of-the-art denoising methods with finer structures preservation.
Introduction
Denoising is often used as a crucial pre-processing step in image processing, which aims to preserve important structures of the image while removing noise. So far, wavelet denoising methods with thresholding or shrinkage scheme have received a lot of attention from many researchers. Such methods perform well under a number of applications since the interest information about a signal is essentially contained in relatively small number of large wavelet coefficients and the rest of small wavelet coefficients can be set to zero. The classical thresholds include Universal threshold [1], SureShrink [2] and BayesShrink [3]. However, these methods sometimes exhibit visual artifacts since they assume that the wavelet coefficients are independent, which is not always realistic [4, 5]. Therefore, wavelet thresholding rules considering statistical dependencies among wavelet coefficients were present to solve the problem. Wang et al. [6] proposed an adaptive shrinkage denoising method by using neighbourhood characteristics, which can obtain better results than Universal threshold and SureShrink based methods. Sendur et al. [7] proposed non-Gaussian bivariate distributions by considering the dependencies between the coefficients and their parents, and the corresponding nonlinear threshold functions were derived from a Bayesian estimation based model. Experimental results validated the effectiveness of the bivariate shrinkage rules. Chen and Han [8] proposed a new threshold by capturing the dependency of inter-scale wavelet coefficients, whose values are adaptive to the changing statistics of images while maintaining the denoising efficiency. Chen et al. [9] proposed a thresholding image denoising method by considering a small neighbourhood around the customized wavelet coefficient.
Block thresholding proposed by Cai et al. [10, 11] is another scheme considering the influence of neighbourhood wavelet coefficients for denoising. In such method, the wavelet coefficients are considered within overlapping blocks. If the sum of squared wavelet coefficients in a block is larger than a given threshold, the coefficients should be retained; otherwise all of them are set to zero. Block thresholding can achieve the exact optimal rates of convergence over a range of Besov bodies. The superiority of block thresholding method has been proved via the minimax approach under the Lp risk [12, 13]. The adaptivity of block thresholding, however, depends on the choice of block size and threshold level [14]. Cai and Zhou [15] proposed a data-driven block thresholding method, which empirically chooses the block size and threshold at each resolution level by minimizing Stein’s unbiased risk estimate. The procedure is easy to implement and sharply adaptive over a class of Besov bodies. Experimental results show that it has significant advantages over the most conventional wavelet thresholding methods with fixed block sizes. Chesneau et al. [16] also proposed a similar Stein based block denoising method, and compared it with state-of-the-art denoising methods on a large set of test images. Moreover, the combination block thresholding rules were proposed by Autin et al. [17, 18], which can generate larger maxisets and achieve better denoising performance.
Note that the discrete wavelet transform (DWT) has been most commonly used in thresholding denoising methods mentioned above. However, DWT is not shift invariant and lacks phase information. To overcome the problem, Kingsbury proposed a new form of DWT, called dual-tree complex wavelet transform (DTCWT), which can provide properties of approximate shift invariance and directional selection while preserving computational efficiency [19–21]. Since the magnitudes of complex coefficients are more strongly dependent among interscale and intrascale neighborhoods, substituting DTCWT for DWT can improve the performance of thresholding denoising method [22]. Several successful DTCWT based signal and image denoising methods can be referred to [23–25].
Main contributions of the paper include: (a) A Stein’s unbiased risk estimate (SURE) based block thresholding image denoising method has been proposed. Shift invariance wavelet coefficients on six directional subbands have been obtained by using DTCWT. After that, the block size and threshold at each subband can be determined by minimizing SURE. (b) The proposed method has been tested on several benchmark images, and compared with some state-of-the-art methods for effectiveness. The rest of this paper is organized as follows. Section 2 reviews the theory of 2-D DTCWT. Section 3 describes SURE based block thresholding in DTCWT domain, and presents the corresponding image denoising method. Section 4 analyzes the experimental results and section 5 addresses the conclusions.
2-D dual-tree complex wavelet transform
The DTCWT calculates the complex transform of a signal using two separate DWT decompositions (tree a and tree b). If the filters used in one tree are specifically designed different from those in the other, it is possible for the first DWT to produce the real coefficients and the second DWT to produce the imaginary coefficients. Figure 1 shows the block diagram for a 3-level DTCWT decomposition with a different set of filters at each stage.
Note that good shift invariance of DTCWT requires proper filter design. In this paper, (13, 19)-tap biorthogonal filter set is adopted at the first decomposition level, and 14-tap linear-phase Q-Shift filter set [21] is adopted at the other decomposition levels. Given two signals with 5 samples shift (see Fig. 2a), the real coefficients computed by using DWT with Daubechies14 filters and the magnitude of the complex coefficients computed by using DTCWT filters mentioned above are shown in Fig. 2b and c respectively. The total energies at scale 5 of the two signals are 0.1979 and 0.2292 for DWT, whereas the total energies at scale 5 of the two signals are 0.9435 and 0.9446 for DTCWT, which is nearly constant.
In 2-D DTCWT, an image signal f (x, y) is decomposed by using a complex scaling function and six complex wavelet functions as follow
Considering the 2-D wavelet φ (x, y) = φ (x) φ (y) associated with the row-column implementation of the wavelet transform, if φ g (t) is approximately the Hilbert transform of φ h (t), i.e., φ g (t) ≈ H {φ h (t)}, one can obtain the following six wavelets [22]:
for i = 1, 2, 3, where the two separable 2-D wavelet bases are defined as follows:
In fact, the 2-D DTCWT is implemented by taking sum/difference of two separable wavelet filter banks. Since each of the above six wavelets are aligned along a specific direction, the 2-D DTCWT can capture more image information than the conventional wavelet transform.
We first decompose a noisy image by using the complex scaling function and six complex wavelet functions defined in Equations 2–6. Suppose we obtain the complex wavelet coefficients , where j denotes the resolution level (1 ≤ j ≤ j0, j0 is the number of decomposition level), and k denotes the orientation (k ∈ α = {±15°, ± 45°, ± 75°}). For the sake of simplicity, we index the wavelet coefficients using one number, writing d l in place of , where l = 1, 2, …, N.
Considering the problem of estimating an image corrupted by additive white Gaussian noise of variance , it can be turned into the problem into the wavelet domain: estimate y
l
given d
l
= y
l
+ η
l
, . Now we estimate y = (y1, y2, … y
N
) based on the coefficient observations d = (d1, d2, … d
N
) under the average mean squared error as follow
Assume that n (n ≤ N) is the number of 2-D DTCWT coefficients at given resolution level and oriented direction, and it divisible by L for simplicity. Let m = n/L be the number of blocks, where L is the block size. Then the sum of squared empirical wavelet coefficients in the i-th block, denoted as B
i
, can be calculated as follow
where .
In our block thresholding procedure, if is larger than given threshold λ, all the coefficients in B
i
are retained; otherwise all the coefficients in B
i
are set to zero. For a given block B
i
the threshold estimator can be defined as follow
Furthermore, a weakly differentiable function, denoted as , can be defined as follow
According to Stein’s theory, one can obtain
Therefore, the total risk can be calculated as follow
where is an unbiased estimate of risk.
Thus, the optimal block size and threshold, denoted as L
S
and λ
S
, can be derived by minimizing SURE.
Now, our proposed block thresholding image denoising method with dual-tree complex wavelet transform can be described as follows. Perform 2-D DTCWT on the noisy image according to Equations (2–6), and obtain the complex coefficients at all directional subbands and resolution level, At each resolution level and oriented direction, select the block size and threshold according to Equation (13), Estimate all complex coefficients obtained in step (a) by using the hybrid Block James-Stein rule [15], Perform inverse 2-D DTCWT to obtain the denoised image.
In this section, we perform extensive experiments on several benchmark images (see Fig. 4) to validate the effectiveness of our proposed method. All experiments are performed on the platform of Lenovo M7100 with Dual core 2.83 GHz CPU and 4G RAM.
Decomposition level of DTCWT
We first determine the optimal number of decomposition level for DTCWT through experiments. All six original images are added with white Gaussian noise of mean zero and variance 900 (σ n = 30). The PSNR results of the proposed denoising method with different DTCWT decomposition level are show in Fig. 5. Taking Baboon as an example, the PSNR is 30.39 at decomposition level 1, and the PSNR improves to 30.77 at decomposition level 2. The PSNR is 32.05 at decomposition level 3, which obtains a significant improvement. The PSNR decreases to 31.60 at decomposition level 4, and further decreases to 31.37 and 31.12 respectively at decomposition levels 5 and 6. The similar results can be obtained on Barbara, Lena, Peppers, Boat and Hill. It is illustrated that the proposed method can obtain the best denoising performance when the number of DTCWT decomposition level is three.
Comparison of thresholding scheme
In this sub-section, the performance of four thresholding procedures, i.e., SureShrink [2], Bayesshrink [3], DenBlock [11] and our proposed block thresholding procedure are compared for DTCWT based image denoising. The former two procedures compute thresholds through term by term scheme, whereas the latter two procedures compute thresholds through block scheme. In the experiment, the parameters of SureShrink, Bayesshrink, and DenBlock are set as configured by their authors. While implementing our proposed method, the threshold is calculated at each resolution level, and used to process the high frequency DTCWT coefficients of images. The image denoising results of DTCWT+SureShrink, DTCWT+Bayesshrink, DTCWT+DenBlock, and our proposed method are recorded in Table 1. Note that all six original images are added with white Gaussian noise of mean zero and variance 900 (σ n = 30), and the DTCWT decomposition level of all methods are set to 3. As is shown in the table, our proposed method indeed achieves the best PSNR results on all of the test images. This is because our proposed method determines the block size and threshold at each DTCWT resolution level by minimizing SURE. DTCWT+DenBlock obtains slightly lower PSNR since it chooses the fixed block size for thresholding, which lack of local adaptive. As for the two term by term thresholding based methods, DTCWT+ Bayesshrink outperforms DTCWT+ SureShrink on most cases, since it aims to achieve the global minimax rate of convergence, while considering the local density in a Besov class.
Comparison of conventional methods with different noisy level
In this sub-section, we implement our proposed method on the six test images varying the level of Gaussian white noise (σ n = 10, 20, 30, 40, 50). Four other conventional denoising methods are performed for comparison: (a) the thresholding method using contourlet transform [26], (b) the multiple local thresholding method using curvelet transform [27], (c) the interscale dependency based thresholding method using DTCWT [28], (d) the thresholding method using ridgelet transform and DTCWT [29]. All methods estimate the noise variance by median estimation and implement three-level decomposition on input image to give a fair comparison.
Table 2 lists the corresponding results of the five denoising methods on six test images. It can be seen from the table that methods of [26] and [27] yield relatively lower PSNR values. This is because these two methods assume that the coefficients in transform domains are independent, which is not suitable for describing the features of most natural images. Since the method of [26] uses a single value as the global threshold, its denoising performance is worse than that of method of [27]. Compared to the former two methods, method of [28] can obtain better PSNR values except on very low noise level (σ n = 10). This may be due to the fact that it considers the dependencies between wavelet coefficients and their parents in detail for threshold selection, and implements DTCWT to keep the shift invariant property of input images. Our proposed method is nearly 3.99 dB better than method of [26], nearly 1.67 dB better than method of [27], and nearly 0.84 dB better than method of [28] for Baboon at the average of all noise levels. The similar results can be obtained on the other five test images. It is validated that the effectiveness by combining block thresholding and dual-tree complex wavelet for image denoising. Especially when the noise level is high, the advantage of our proposed method is more obviously (see Figs. 6 and 7 for example). Note that although method of [29] yields the best PSNR values, the whole process is the most complicated since it need to partition the image into a number of overlapping squares and compute the 2-D FFT of the image. Our proposed method, however, can approximate the performance of method of [29] with less execution time.
Visual results for the six test images are shown in Figs. 8–13. Although contourlet and curvelet have certain advantages over the classical wavelet, methods of [26] and [27] introduce lower denoising performance due to the diverse and irregular patterns contained in the natural images. Obviously, methods of [28] and [29], and our method can better recover the periodic textures at different orientations in Baboon and Barbara, and provide quite a good visual quality due to the direction selection provided by DTCWT. Our method is better than method of [28] in smoothing the Lena and Pepper, which shows that the block thresholding is a good choice for preserving the smooth regions in denoising procedure. While comparing method of [29] with our method from the results on Baboon, Barbara, Lena and Peppers, we cannot easily say which of them is the winner.
Conclusions
This paper presents an image denoising method based on DTCWT and block thresholding. Since DTCWT can use six wavelets aligned along different directions to capture more image information than the classical wavelet transform, we first employ DTCWT on the noisy image to obtain complex coefficients with properties of approximate shift invariance and directional selection. After that, block thresholding is applied in shrinkage phase so as to fully consider the influence of neighbourhood wavelet coefficients at each subband, and the optimal block size and threshold can be empirically choosed by minimizing Stein’s unbiased risk estimate. Compared to conventional term by term or block thresholding methods, such as SureShrink, Bayesshrink and DenBlock, experimental results indicate that our method can achieve comparable better denoising effectiveness. Furthermore, our method outperforms most state-of-the-art denoising methods both in terms of signal-to-noise ratio and visual quality. Future works include combining statistical model into our proposed denoising method.
Footnotes
Acknowledgments
We are grateful to the referees for their valuable comments. This work is financially supported by the National Natural Science Foundation of China (61363050, 61272077, 61563037), the Natural Science Foundation of JiangXi Province (20142BDH80026), and the Ph.D. Programs Foundation of Nanchang Hangkong University (No. EA201504141).
