Abstract
To solve the shortcomings of traditional guided image filtering (GIF) in edge preservation and denoising performance, this study describes a novel generalized guided image filtering method, which integrates an artificial swarm optimization algorithm. A locally adaptive weighting based on monogenic phase congruency and chaotic swarm optimization is used to produce a more robust method. Since the fixed regularization parameter cannot adapt to the grayscale difference between flat and edge patches, the box filter radius and regularization parameter of guided image filtering have significant influences on image-denoising effects. The chaotic swarm optimization algorithm, which is an improved optimization algorithm with a self-adapting search space, is adopted to find their optimal values for the best denoising effects. Compared with traditional guided image filtering for image denoising and other state-of-the-art methods with image quality as a performance metric, experimental results showed that the proposed denoising algorithm can not only remove noise efficiently and reduce halo artifacts, but can also preserve the edge texture well.
Keywords
Introduction
Digital images are often subjected to noise in image acquisition, transmission, or storage. For this reason, the presence of noise usually decreases the accuracy of various applications in image processing; therefore, image denoising is one of the important steps in image processing and analysis. A wide range of techniques have been developed to improve the quality of images by human vision in image restoration-enhancement applications. However, denoising is an ill-conditioned problem. Noise reduction and edge preservation are the two main metrics for measuring the effectiveness image-denoising algorithms. Creating an algorithm that simultaneously performs well for both remains a challenge.
Many image-denoising methods have been proposed in [1–3]. Generally, there are two types of edge-preserving, image-smoothing techniques to reconstruct images via spatial or frequency domain. The algorithms with Gaussian filters have a good smoothing effect for Gaussian noise, but image structure and edge are severely destroyed. On the other hand, algorithms with anisotropic diffusion filters excel at preserving edge; however, halo artifacts emerge.
Recently, a novel filtering method denoted guided image filtering (GIF) was proposed [4]. GIF is a both an effective and efficient linear translation-invariant filter for image denoising, and it fulfills the requirements of edge preservation and low complexity for noise reduction by means of a guidance image. It is an edge-preserving smoothing algorithm for simultaneous sharpening and denoising. Owing to GIF’s superior performance, it has been applied to the fields of computational photography and image processing. Typical examples include image sharpening [5], image fusion [6], and single image haze removal [7]. Unlike the popular bilateral filter [8], independent of window radius and intensity range, GIF is one of the fastest edge-preserving filtering methods for image denoising. In a general way, a noisy input image is also used as the reference image for GIF’s method of weight calculation, which can transfer the structures of the guidance image to the filtering output. Unfortunately, it is possible for the structure of the noisy image to be lost, which causes low accuracy in the denoising output and suffers from halo artifacts.
In order to address the above problems, an edge-aware weighting using local variances of all pixels in a 3×3 window was used in Ref. [9] to penalize the fixed regularization parameter to produce a more robust method. A gradient-domain guided image filter was proposed by Fei et al. [10] using an explicit first-order, edge-aware constraint to prevent the loss of texture and edge information. However, local variances and a first-order gradient in image noise cannot objectively represent the information of edge and texture area that changes greatly and that is disturbed by noise. The edge-aware weighting factor using local variances or first-order gradients in image noise is inaccurate. Therefore, it is sensible to seek a method that is efficient in precision edge detection and noise suppression to penalize the fixed regularization parameter.
Swarm intelligence methods comprise an area of bionic, intelligent technology that is seeing wide use in image processing and that has attracted the attention of researchers [29–33]. With the development of many methods using a swarm intelligence algorithm, image processing, is more effective; for example multiwavelet image denoising based on the artificial bee colony algorithm [28]. This type of method mainly includes the particle swarm optimization (PSO) algorithm, the fish Swarm optimization, ant colony optimization algorithms, the swarm algorithm, etc. In recent years, the artificial colony algorithm simulates the process of honey bees using local optimization and one-worker individual behavior, and highlights the global optimal value. The artificial swarm optimization algorithm has the advantages of high precision and fast search convergence, which can effectively avoid local minima problems. As a result, the artificial colony algorithm can be used for parameter selections in guided image filtering for further optimization.
Phase congruency measures the significance of feature invariance to contrast, such as edge and corner features. However, an impediment to applying phase congruency to the detection of image features has been its sensitivity to noise. Monogenic phase congruency, which is an improved version of phase congruency that overcomes noise sensitivity, was introduced to construct the structure similarity feature in Ref. [11]. As the regularization parameter of GIF uses the fixed value regardless of the image structure, halo artifacts appear in the image’s edge patch. A novel weighted guided image filtering algorithm using edge-aware weighting by monogenic phase congruency that takes edge and texture information into consideration is proposed in this paper. The proposed filter also has a linear computational cost, which is the same as that of GIF [4]. Experimental results for image denoising show that the proposed denoising algorithm has visual quality and removes noise efficiently.
The traditional GIF result relies on the choice of parameters when GIF is used for image denoising. In order to reduce difficulties in parameter selection, the chaotic swarm optimization algorithm is adopted to adaptively find the optimal parameter values.
The chaotic swarm optimization algorithm is an efficient, simple, and versatile optimization algorithm. It has already been applied to many of the optimization problems in science and engineering, e.g., individualized modeling, project scheduling, optimization of oil- and gas-field development planning, and image segmentation for determining thresholds.
The rest of this paper is organized as follows. Related works and techniques on the GIF are summarized in Section 2. Section 3 includes details on adaptive weighted (AW) GIF using monogenic phase congruency and parameter selections optimization with artificial colony algorithm. Experimental results and analysis of AWGIF are given in Section 4. Concluding remarks are provided in Section 5.
Related works on GIF
The guided image filter discussed in Ref. [4] uses a linear transform of the guidance image to preserve preferable edges independent of the filter radius and the range of gray values, and it outperforms the bilateral filter [8]. Guided by different guided images, GIF has been widely applied to the fields of image processing and computer vision, including enhancing image sharpness without noise amplification, high-dynamic-range compression, image matting or feathering, dehazing, and image fusion.
The guided filter is a general linear translation-variant filter. It is assumed that the output image is a linear transform of a guidance image and a filtering input image. The filtering output at a pixel i is given as
It is assumed that is a linear transform of G
k
in a window w
i
centered at a pixel i. A linear transform of G in the window w
i
is expressed by
Obtaining two coefficients a
i
and b
i
requires a solution that minimizes the following cost function E (a
i
, b
i
) in the window w
i
using the linear ridge regression model:
Here, N is the total number of pixels in the window w
i
, and λ is a regularization parameter penalizing a large a
i
. The solution of Equation (3) is given by
For the same reason, it was proven in Ref. [4] that GIF is also a weighted averaging filter, and the weighting kernel function W ij can be explicitly written as
The guided filter approach has an edge-preserving smoothing capability and low computational complexity. It has been widely applied to enhance the sharpness of, and reduce the noise of, blurred, noisy images [12], in real-time local stereo matching [13], and in interactive computation of global illumination [14].
It is well known that the guided-filter kernel weights can flexibly recognize underlying geometric structures in accordance with the performance of the guide image. Unfortunately, although GIF has numerous advantages for computer vision and graphics applications, the fixed regularization global smoothing parameter λ cannot adapt to the grayscale difference between flat and edge patches. Specifically, it performs poorly in image denoising of low signal-to-noise (SNR) images. It is difficult for it to find the noiseless image of the input image as the guide image, and it also lacks the ability to select optimal; parameters, which seriously affects its practicality. In general, the guide image in GIF for image denoising is identical to the input image. Therefore, the performance of the restored image is seriously affected.
To overcome the flaws of the GIF parameters and obtain an excellent result, integrating the shift-variant technique and a Laplacian of Gaussian (LOG) filter response output for pixel classification into the guided filter results was proposed in Ref. [12] to avoid halo artifacts or noise amplification.
In Ref. [4], the regularization parameter λ is a constant, and thus halo artifacts near the edge are caused without distinguishing differences in the image structure. To solve this problem, in Ref. [9] the regularization parameter of weighted GIF (WGIF) is defined by another function that is an edge-aware weighting. The regularization parameter λ (i) = λ/Γ
G
(i) is defined by a weighted factor using local variances of all pixels in the 3×3 window, which is expressed by
Recently, gradient-domain GIF was proposed in Ref. [17] that uses an edge-aware weighting-defining variable window to measure the importance of some pixels with respect to the entire guidance image. Through a reasonable analysis of the filter’s window size, the proposed filter handles images with better visual appearance than the existing guided-filter-based algorithms, especially around edges. However, the definition of the filter’s window size for different images is inextricable.
In the next section, a novel adaptive weighted GIF, which includes an explicit monogenic phase congruency, weighted for an edge-aware constraint to remove noise while preserving the detail information, is introduced. The new constraint using monogenic phase congruency can preferably detect the edge and be seamlessly integrated into the GIF. The monogenic phase congruency described in Ref. [11] not only captures the edge exactly, but also detects perceptually significant image features.
In this section, a novel measure of phase congruency called monogenic phase congruency, which uses the monogenic signal to construct phase congruency is first introduced, followed by proposal of a new edge-aware weighting that is incorporated into GIF in Ref. [4] to form AWGIF. The optimal parameters using artificial swarm optimization are approached in parallel via the division of labor, cooperation, and information sharing of employed bees, onlookers, and scouts.
Monogenic phase congruency
Phase congruency (PC) is a perceptually significant image-feature-detection method and reflects the behavior of the image in the frequency domain. It is applied to construct an edge-detection method that is particularly robust against changes in illumination and contrast. The phases in the frequency domain have maximal congruency at the edges, which corresponds to the human-perceived edges in an image where there are sharp changes between light and dark areas.
The monogenic signal was introduced in Ref. [16] and is considered to be a multidimensional extension of the Riesz transform. The monogenic signal
Monogenic phase congruency (MPC) can be expressed as
,γ is a gain factor for the sharpness of the cutoff, s is the cutoff value of the filter response spread, and c (x) is a fractional measure of spread. The value of c (x) is obtained by taking the sum of the amplitudes of the responses and dividing by the highest individual response; namely, c (x) = A′ (x)/(N * (Amax (x) + ɛ)), where N is the total number of scales, and ɛ is a gain factor approximately from 1 to 2. T compensates for the influence of noise, and is set at a fixed threshold according to empirical estimation.
MPC is fast and possesses greatly reduced memory requirements compared to the other phase-congruency model. Using T to compensate for the influence of noise, the MPC response is not sensitive to noise. From Fig. 1, it is not difficult to find that MPC correlates well with the human vision system (HVS) and provides good localization of features similar to human vision perception. The image called Harbour in Fig. 1(a) is from the LIVE image database and the Harbour image in Fig. 1(d) is from the A57 image database. Therefore, edge detection by MPC is a good choice for edge-aware weighting.
Instead of local variance, a new edge-aware weighting using monogenic-phase-congruency-based edge-detection methods is proposed. The proposed edge-aware weighting is defined by using the monogenic phase congruency of 3×3 windows of all pixels as follows:
Artificial swarm optimization is an optimization algorithm based on the intelligent foraging behavior of honey bee swarms, proposed by Karaboga in 2005. Since the box filter radius r and regularization parameter λ have a significant impact on image denoising performance, in order to make GIF achieve its best effect, a chaotic swarm optimization algorithm is introduced to the box filter radius and the regularization-parameter optimization. Based on the product of the local contrast and sharpness as the fitness function for evaluation, the optimal parameter values are obtained by use of an improved chaotic mutation swarm optimization algorithm.
The fitness function F is defined by
An artificial swarm optimization algorithm consists of three groups of bees; namely, employed bees, onlookers, and scouts. In other words, the number of employed bees in the colony is equal to the number of food sources around the hive. The optimum GIF parameters are found by searching for the best food source location via loop iteration.
The complete steps of the algorithm are as follows.
In this section, in order to test and compare the performance of our proposed algorithm with other algorithms in the literature, we conducted experiments on 10 512×512-pixel images from Ref. [18] that are widely available, such as Baboon, Barbara, Peppers, House, etc. The noise we use is normal, with standard deviation σ = 10, 15, 20, 25, 30, 35, and 40.
Image-quality evaluation
For the comparison we used both qualitative and quantitative evaluations to compare our proposed AWGIF method with other methods. Quantitative evaluations were performed using structural similarity (SSIM) [19] and peak signal-to-noise ratio (PSNR) for full-reference cases in which original high-quality images on large benchmark datasets are available. SSIM is a new criterion of image-quality evaluation that considers the image-structure information. It is easy to calculate and applicable to various image-processing applications. The image-quality index of SSIM is defined as
We compared our proposed AW GIF to the other five different filtering methods: the non-local means algorithm (NLM) [1], median filtering (MF) [21], bilateral filtering (BF) [22], the original GIF [4], and gradient-domain GIF (GD GIF) [17]. GD GIF is representative of the improved version of GIF and gives excellent performance. All algorithms in the experiments were implemented using MATLAB 2010b software, and all the simulations were carried out on an Intel Core 2 Duo T5850 processor with a frequency of 2.16 GHz and 4 GB of DDRII memory running the Windows 7 operating system.
The performance of different algorithms were measured quantitatively using PSNR and SSIM. Figures 2 and 3 list the denoising performance of the different methods for comparison.
First, the original GIF, our proposed GIF, and GDGIF produced outputs of reasonable quality for images with low noise levels from PSNR and SSIM indexes. However, when the noise level increased, the performance of both the original GIF and GDGIF deteriorated drastically. GD GIF performed better than the original GIF because of its better edge-preserving capability. Our proposed GIF did not suffer from noise amplification, and its performance was better than that of the original GIF and GDGIF when the noise level increased. Our proposed GIF outputs were comparable to those obtained using BF and superior to those obtained using any of the other algorithms. Although GDGIF produced images with slightly higher PSNR indexes than our proposed GIF did, our proposed GIF outperformed GDGIF in terms of SSIM indexes. We will discuss these characteristics in more detail from the perspectives of detail comparison and execution time.
From the detail shown in Fig. 4, it is seen that the result of our proposed GIF is better than that achieved using MF and the NLM algorithm. There are more edges in the detail region of our proposed GIF than in the others.
Finally, we compare execution times. Comparison of the computation times of our proposed GIF, GDGIF, the NLM algorithm, and BF are shown in Table 1 using three images, each of different size. The table shows that the processing times of our proposed GIF are very low. This is due to the measure of phase congruency based on the monogenic signal and the reduction of orientation and the noise threshold calculation, and the avoidance of dot and cross products. The execution time for phase congruency of our proposed GIF is lower than that of GDGIF for edge detection. BF is fast, however, as the image size increases; the execution time of BF is significantly longer than GIF.
Conclusions
A novel denoising algorithm based on the guided image filter is proposed. The novel weighted guided image filter based on monogenic signal theory and a chaotic swarm optimization algorithm is introduced in this paper. The adaptive regularization parameter of GIF can adapt to the grayscale differences between flat and edge patches to avoid halo artifacts for image denoising. The chaotic swarm optimization algorithm is adopted to find the optimal values for the best denoising effects in the proposed method. The usefulness of chaotic swarm algorithm optimizing parameters can effectively remove Gaussian white noise, and provide a good denoising effect. The improved GIF version can not only can keep the image edge and texture, but can also improve the adaptability of the algorithm in dealing with different images, as it chooses the optimal parameters automatically to achieve the best effect. Compared to the existing state-of-the-art approaches, the proposed method exhibited improved performance in terms of PSNR and SSIM indexes. The method also demonstrated better generalization performance and lower execution time than its nearest competitors.
Footnotes
Acknowledgments
This work is partially supported by the doctoral development Foundation of Panzhihua University and scientific Foundation of department of education of sichuan province under grant No. 15ZB0425.
