Abstract
BACKGROUND:
Low-dose computed tomography (CT) has been successful in reducing radiation exposure for patients. However, the use of reconstructions from sparse angle sampling in low-dose CT often leads to severe streak artifacts in the reconstructed images.
OBJECTIVE:
In order to address this issue and preserve image edge details, this study proposes an adaptive orthogonal directional total variation method with kernel regression.
METHODS:
The CT reconstructed images are initially processed through kernel regression to obtain the N-term Taylor series, which serves as a local representation of the regression function. By expanding the series to the second order, we obtain the desired estimate of the regression function and localized information on the first and second derivatives. To mitigate the noise impact on these derivatives, kernel regression is performed again to update the first and second derivatives. Subsequently, the original reconstructed image, its local approximation, and the updated derivatives are summed using a weighting scheme to derive the image used for calculating orientation information. For further removal of stripe artifacts, the study introduces the adaptive orthogonal directional total variation (AODTV) method, which denoises along both the edge direction and the normal direction, guided by the previously obtained orientation.
RESULTS:
Both simulation and real experiments have obtained good results. The results of two real experiments show that the proposed method has obtained PSNR values of 34.5408 dB and 29.4634 dB, which are 1.2392–5.9333 dB and 2.828–6.7995 dB higher than the contrast denoising algorithm, respectively, indicating that the proposed method has good denoising performance.
CONCLUSIONS:
The study demonstrates the effectiveness of the method in eliminating strip artifacts and preserving the fine details of the images.
Keywords
Introduction
In the field of medical imaging, the reconstruction of computed tomography (CT) images is a critical area of research. CT employs X-rays to create cross-sectional images of the body, which are useful for diagnosing a variety of health issues. The technique of CT imaging involves reconstructing an image by applying the Radon and inverse Radon transforms to data collected from the transmission of an X-ray beam. The quality of reconstructed CT images is influenced by various factors, including blurring, field of view (FOV), artifacts, and visual noise. An artifact refers to a distortion or inaccuracy in an image that does not accurately represent the object being imaged. While reducing the radiation dose in computed tomography (CT) scans lowers the risk of cancer, it is important to note that low-dose imaging is usually accompanied by the appearance of strip artifacts.
In the literature, numerous denoising techniques have been proposed for CT image denoising, which can generally be broadly classified into two categories: spatial domain filtering and transform domain filtering. Spatial filters, which can be broadly classified as linear and nonlinear filters, serve the purpose of noise reduction by directly filtering the original noisy image. To denoise the image in the spatial domain, linear filters such as the Gaussian filter [1] were initially used. However, these filters did not effectively preserve image details. To address this issue, nonlinear filters like the Wiener filter [2] and bilateral filter [3] were employed. In recent years, researchers have also explored the concept of least squares fidelity minimization [4] to remove Gaussian noise. Various regularization functions, including Tikhonov, anisotropic diffusion, and Total variation methods, have been investigated. The Tikhonov method [5] is the simplest approach using L2 norm for regularization, but it tends to oversmooth image details [6]. On the other hand, anisotropic diffusion-based methods [7] can enhance image details by preventing diffusion at the edges, but they may blur the edges. To overcome the smoothness issue in denoised images, Total variation (TV) based regularization was developed in parallel with anisotropic diffusion. However, TV techniques often encounter problems with stair-artifacts [8]. To address these limitations, the anisotropic total variation method [9] was introduced, combining the advantages of TV and anisotropic diffusion. Bayram et al. [10] proposed a directional TV method for image denoising, which is based on images with a single major direction and utilizes the directional derivative of the image. This method was further improved in [11], where it became spatially adaptive through pixel-specific angle estimates. Non-local means (NLM) [12] filters have also gained popularity in recent years for image denoising. The Wavelet Transform [13] is the most investigated transform in denoising. It has been widely utilized in scientific and engineering fields [14]. Among the various methods, the wavelet threshold atrophy method [15] has been extensively studied. Currently, BM3D [16] is considered the state-of-the-art denoising method, surpassing other existing methods in terms of denoising performance. Dictionary learning is a commonly used method for image denoising. It involves learning a sparse dictionary representation of an image to achieve denoising. An improved deep convolutional dictionary learning algorithm [17] was proposed for Low-Dose computed tomography image processing and denoising.
The EADTV method in [11] utilizes a directional parameter based on the spatial variation of image edge directions. It estimates the edge direction from either the noisy image itself or a noiseless reference image. In CT reconstruction images, obtaining a noiseless reference image for estimating image edge orientation is typically challenging. The edge direction was originally estimated using a noisy image that had been preliminarily smoothed with a Gaussian filter. To obtain a more accurate directional information, we propose using kernel regression to fit the original noisy image. The kernel regression method [18] is a non-parametric estimation method that aims to estimate the unknown regression function [19] based on observational data. It can be extensively applied in various types of image and noise processing, preserving image details while requiring minimal information about the image or noise distribution. Additionally, the EADTV model does not consider the specific characteristics of strip artifacts in CT images, it may mistakenly treat strip artifacts as image edges and preserve them instead of removing them. To address this issue, we have used the EADTV model to perform denoising along the edge direction and the normal direction.
Based on the discussion above, we propose a new CT denoising method called AODTV. This method involves performing kernel regression on the original image, which is reconstructed using the FBP algorithm [20]. The first step of AODTV is to approximate the image and its first and second order derivatives using kernel regression. These derivatives are then updated through further kernel regression. In order to create an image included more accurate directional information for the AODTV model, the reconstructed image, regression function estimate, and updated derivatives are combined using weights. The second step of AODTV is to denoise the reconstructed image along the orthogonal direction, namely along the edge direction and the normal direction. This approach effectively preserves edge details and reduces stripe artifacts. To evaluate the performance of our method, we compared it with the FBP algorithm and several established denoising algorithms, including Guided Image Filtering [22], the L0 smoothing algorithm [23], the WLS algorithm [24], the TV algorithm, the EADTV algorithm and the Truncated TV algorithm [25]. The comparison was conducted using both simulated and real data. The results demonstrate that our AODTV method outperforms the classical algorithms in terms of preserving details and eliminating artifacts.
With the wide application of CT imaging technology in clinical practice, low-dose CT imaging has received increasing attention in order to reduce the harm to the human body. However, the images obtained from low-dose CT reconstruction will be seriously disturbed by noise and produce strip artifacts. Therefore, it is necessary to suppress the noise in the image. EADTV is a denoising method that can handle images with multiple dominant orientations and it relies on a noisy reference image, which is pre-smoothed by Gaussian filtering, to estimate the edge orientation [11]. Gaussian filtering is a linear smoothing technique that assumes that the relationship between pixels and neighboring pixels is linear, which may not be accurate enough in real images, especially in complex texture regions or noise types that are not simple Gaussian, such as bone trabeculae, which have rich texture structure in medical images. Kernel regression is a non-parametric nonlinear method, which can build more complex mapping relations according to the training data, so as to better capture the local change trend and structure of the image, and provide more accurate recovery for noisy image. In theory, kernel regression can better control the prediction results by selecting the appropriate kernel function and bandwidth parameters, so as to better protect the image details and edge information to a certain extent. In order to obtain higher quality reference images and thus more accurately estimate the edge orientation of CT images, we introduced kernel regression. Since there is still noise in the information of the first and second derivatives of the fitted image obtained by kernel regression, we run kernel regression again on the fitted image. For multi-directional medical images, such as the reconstructed image of bone trabecular projection data used in our experiment, the EADTV method only denoises along the edge direction of the image, which is easy to identify the strip artifacts in the image as texture structure, resulting in incomplete denoising. Therefore, we use the EADTV method to denoise along the edge direction and its normal direction, so as to achieve better denoising effect.
The structure of this work is as follows: Section 2 provides a brief presentation of some basic theories and Section 3 introduces the AODTV method. The experiments and their results are reported in Section 4. Section 5 analyzes the influence of parameters on the experimental results. Finally, Section 6 discusses and concludes this work.
Related work
Classical kernel regression
The two-dimensional data measurement model is as follows [18]:
Where ∇ and ∇2 are the gradient (2 × 1) and Hessian (2 × 2) operators, respectively. β0 = z (x),
Then the parameters
The Gaussian function is commonly used as a kernel function due to its lower computational requirements compared to other kernel functions that satisfy the given conditions. Additionally, Equation (4) can be represented in a matrix form as a weighted least-squares optimization problem:
In 1992, Rudin, Osher, and Fatemi introduced the classical total variation image denoising algorithm [9]. This algorithm aims to restore a clean image from a noisy image by establishing a noise model and utilizing an optimization algorithm solution module. Through continuous iteration, the restored image gradually approaches the ideal denoised image. The total variation model minimizes the following function:
Diffusion along the direction θ is enhanced by the DTV model. When the dominant direction in the image aligns with θ, the DTV model enhances the dominant structure. Conversely, it destroys the structure when the dominant direction does not align with θ. In cases where there are multiple dominant directions in the image, it becomes necessary to spatially vary the parameters θ throughout the image. In [11], spatial variation parameters θ (x, y) based on image edge orientation are proposed. The edge direction can be calculated using the following formula:
In the experimental section, the Guided Image Filtering, the L0 smoothing algorithm, the WLS algorithm, the TV algorithm, the EADTV algorithm and the Truncated TV algorithm are used as a comparison of AODTV method for image denoising. For the AODTV method, there are mainly four groups of parameters: The kernel size ksize1 and ksize2 are the parameters controlling the support region size of the kernel, which will affect the smoothness of the kernel regression and the fitting effect on the data. The weight parameter t1 and t2 balance the kernel regression results and the reconstructed image. The regularization parameters of the EADTV regular term are expressed as λ1 and λ2 in the model. The long axis lengths of the ellipses along the image edge direction and normal direction are denoted by α1 and α2. In the Guided Image Filtering algorithm, the regularization parameter is denoted by ɛ, the local window radius is denoted by r and the subsampling ratio is denoted by s. In the L0 smoothing algorithm, λ is the smoothing parameter controlling the degree of smooth, κ is a parameter that controls the rate [23], a smaller value of κ leads to more iterations and sharper edges. In the WLS algorithm, λ is the balance parameter between the data term and the smoothing term. Increasing λ will produce smoother images. α controls the affinities to a certain degree by nonlinear scaling the gradient. Increasing α will result in sharper preserved edges. In the TV algorithm, λ is the smoothness parameter. In the EADTV algorithm, λ is the regularization parameter, α represents the length of the major axis of the ellipse, and varargin is number of iterations. In the Truncated TV algorithm, ɛ is the threshold parameter in the model, α is the regularization parameter of the truncated TV term and λ is the parameter introduced in the iterative solution of each sub-problem in the model.
AODTV method
In general, the AODTV method has two steps:
Specific progress is as follows: Perform kernel regression again on the first-order and second-order differential information β11, β12, β21, β22, β23 in Equation (7) obtained by kernel regression of the FBP reconstructed image y. Their corresponding regression functions are denoted as z
x
(x
i
), z
y
(x
i
), z
xx
(x
i
), z
xy
(x
i
), z
yy
(x
i
). Finally, the updated differential information is denoted as
To be specific, the edge direction (n1 (x, y) , n2 (x, y)) based on the image
The spatial variation parameters θ1 (x, y) can be calculated as follow:
Since the penalty is only applied along the edge direction of the image at this time, the details are preserved, but the image still has strip artifacts in the normal direction. Therefore, take f* as the new input image and determine the spatial variation parameters θ2 (x, y) based on the normal direction (t1 (x, y) , t2 (x, y)) of image
f** is the final image result obtained by the AODTV method. The solution method for this model can be found in [10].
In this section, both simulation and real data are images with texture details. The choice of experimental parameters is a compromise between removing noise and preserving detail. The parameters were selected manually through a large number of experiments. Mean square error (MSE), peak signal-to-noise ratio (PSNR), mean structural similarity (MSSIM) [26] and Gradient Magnitude Similarity Deviation (GMSD) [27] are utilized for quantitative evaluation of the denoised images.
Mean Square Error (MSE) is the squared difference between the true and predicted values and then summed and averaged. The calculation formula is as follows:
PSNR is the most common and widely used objective image evaluation index, which is based on the error between the corresponding pixels, that is, error-sensitive image quality evaluation. The unit is dB, with higher values indicating less distortion. The calculation formula is as follows:
B is the number of bits per pixel, which is generally taken as 8, (2 B - 1) 2 represents the maximum possible pixel value of the picture.
Mean Structural Similarity (MSSIM), The sliding window was used to divide the image into N blocks, and the mean, variance and covariance of each window were weighted, the weight w
ij
satisfies ∑
i
∑
j
w
ij
= 1. Usually a Gaussian kernel is used, Then the structural similarity SSIM of the corresponding block was calculated, and finally the average value was used as the structural similarity measure of the two images. SSIM measures image similarity from brightness, contrast and structure:
The MSSIM is calculated as:
GMSD provides accuracy by estimating pixel-level gradient similarity. GMSD values are between 0 and 1, with closer to 0 indicating better results. By utilizing the Prewitt filter to calculate the gradient, the Prewitt filters along horizontal (x) and vertical (y) directions are defined as:
By convolving the image with the above operators, the gradient magnitudes of the image in the horizontal and vertical directions can be obtained, denoted as
For the simulation study, we utilized a medical image along with the corresponding region of interest (ROI) depicted as red rectangle. In order to facilitate visual comparison, we zoomed in on the ROI and included some detailed information, as shown in Fig. 1. Poisson noise was added to the projection data of the simulation image, and the reconstructed image was obtained by sampling at intervals of every four and eight angles within the range of 0 to π, respectively. The results obtained from the AODTV method and other contrast methods on the FBP reconstructed images are presented in Figs. 2 and 3.

Simulation data. (a) Original simulation image. (b) Local magnification of the simulation image.

(a) The FBP reconstructed result obtained by sampling simulated image projection data, with Poisson noise added, at intervals of every four angles within the range of 0 to π. (b)-(h) Performance of different denoising methods in this experiment. (b) Guided Image Filtering. (c) L0 smoothing. (d) WLS. (e) TV. (f) EADTV. (g) Truncated TV. (h) AODTV.

(a) The FBP reconstructed result obtained by sampling simulated image projection data, with Poisson noise added, at intervals of every eight angles within the range of 0 to π. (b)-(h) Performance of different denoising methods in this experiment. (b) Guided Image Filtering. (c) L0 smoothing. (d) WLS. (e) TV. (f) EADTV. (g) Truncated TV. (h) AODTV.
For the first simulation, Fig. 2(a) shows the image reconstructed by the FBP algorithm, which is the result of sampling the projection data of the simulated image by 4 angles at intervals within the sampling range of 0 to π after adding Poisson noise. The optimal values for all parameters, as presented in Table 1, are chosen to attain the most effective denoising outcome for each denoising algorithm. The denoising effect of Guided Image Filtering and Truncated TV in Fig. 2(b) and (g) is similar, with some noise remaining in the image and the details not being clearly preserved in the locally enlarged area of Fig. 2 (b). Figure 2 (c) and (d) demonstrate that there is still considerable noise in the ROI, and the texture structure appears unclear after L0 smoothing and WLS denoising. The result of TV in Fig. 2 (e) does not significantly differ from those in Fig. 2 (f). It can be noted that the image in the ROI retains detailed information after EADTV denoising, however, there still exist some residual noise in Fig. 2 (f). The AODTV method exhibits the best preservation of details and denoising effect in Fig. 2 (h). The quantitative results are presented in Table 2, where it is evident that the AODTV method produces a noise-reduced image with higher PSNR and MSSIM, and lower MSE.
The best parameter values of different methods in the first simulation
Evaluation results of different denoising methods for the first simulation
For the second simulation, Fig. 3(a) displays that the FBP reconstruction result is obtained by sampling the simulated image projection data with Poisson noise added, at intervals of every eight angles within the range of 0 to π. Table 3 presents the optimal parameter values for each denoising algorithm. Upon examining the resulting images, it is evident that in the local amplification area of Fig. 3 (b), a significant reduction in noise can be observed, but the details are not very clear after Guided Image Filtering denoising. Figure 3 (c) demonstrates that the L0 smoothing technique preserves image details while still retaining a considerable amount of noise. On the other hand, the denoising effect of the WLS method, as shown in Fig. 3 (d), is poor. However, when we examine the local magnification areas in Fig. 3 (e), (f), and (g), it becomes apparent that the TV, EADTV, and Truncated TV methods yield better denoising results, with relatively smooth image details. Figure 3 (h) illustrates that the AODTV method allows for the preservation of texture details in the image, resulting in relatively clear detailed edges and a more noticeable noise removal effect. Table 4 indicates that the MSE, PSNR, and MSSIM values of the AODTV method surpass those of the other methods. This indicates that the AODTV method possesses the ability to remove noise while preserving image details.
The best parameter values of different denoising methods in the second simulation
Evaluation results of different denoising methods for the second simulation
In the real experiment, the full angle reconstructed image of the trabecular bone projection data and its local magnified area (indicated by the red rectangle) are presented in the Fig. 4. The projection data was sampled at intervals of six and fifteen angles within a sampling range from 0 to π. These data were then processed using the AODTV method and compared with other denoising methods as shown in the Figs. 5 and 6.

Real data. (a) The full angle reconstructed image of the trabecular bone projection data. (b) The local magnified area of (a).

(a) The reconstruction result of trabecular bone projection data sampled every six angles in the sampling range from 0 to π. (b)-(h) Comparison of noise reduction results for the first real experiment. (b) Guided Image Filtering. (c) L0 smoothing. (d) WLS. (e) TV. (f) EADTV. (g) Truncated TV. (h) AODTV.

(a) The reconstruction result of trabecular bone projection data sampled every fifteen angles in the sampling range from 0 to π. (b)-(h) Comparison of noise reduction results for the second experiment. (b) Guided Image Filtering. (c) L0 smoothing. (d) WLS. (e) TV. (f) EADTV. (g) Truncated TV. (h) AODTV.
In Fig. 5(a), the reconstruction of the projection data at intervals of six angles within the sampling range from 0 to π resulted in noticeable strip artifacts. In the partial enlargement of the image in Fig. 5 (c), it can be seen that after processing by the L0 smoothing method, the strip artifacts and details of the image are smoothed, and part of the image information is lost. In Fig. 5 (d) and (g), the image edge after denoising by WLS and Truncated TV are blurred. Although the edge of Fig. 5 (b) is distinct, the artifact removal is not entirely satisfactory. In Fig. 5 (e) and (f), EADTV and TV have good artifact removal effect but the details are not clear. In Fig. 5 (h), the image artifacts after denoising by the AODTV method are effectively removed, while the texture structure of the image is maintained. The optimal parameter values for each algorithm are presented in Table 5. The quantitative results can be found in Table 6. The evaluation index of the AODTV method demonstrates a significantly superior performance compared to other methods.
The best parameter values of different methods in the first real experiment
Evaluation results of different denoising methods in Fig. 5(a)
In the second real experiment, an increase in strip artifacts was observed in the reconstruction results due to the sampling interval of the projection data being more angles, as shown in Fig. 6(a). The denoising algorithms also demonstrated a more intuitive effect. The L0 smoothing in Fig. 6 (c) resulted in a highly blurred texture of the image, with significant loss of details. WLS in Fig. 6 (d) exhibited the poorest ability in denoising and preserving image edges. The strip artifacts of the image were relatively well removed in Fig. 6 (e) and Fig. 6 (f), but the edges appeared very blurred after TV and EADTV denoising. The effects of Guided Image Filtering and Truncated TV were similar in Fig. 6 (b) and (g). Figure 6 (h) demonstrates that the AODTV method effectively balances the preservation of edge details and the removal of strip artifacts, yielding the best results. The optimal parameter values for each algorithm are presented in Fig. 6. Quantitative results in Table 8 clearly indicate that the AODTV method outperforms other denoising algorithms.
The best parameter values of different methods in the second real experiment
Evaluation results of different denoising methods in Fig. 6(a)
In this subsection, the analysis is based on the first real experiment. We present the effect of the kernel size ksize2 at the second kernel regression, α2 in Equation (26), λ1 and λ2 in Equations (23), (26), and weight parameter t1 and t2 in Equation (20). The reason for choosing to analyze these parameters was that it was found through a large number of experiments that they had a large impact on the experimental results, while adjusting the remaining parameters ksize1 and α1 had little impact on the results. The analysis results are shown in Fig. 7. In order to determine the optimal value of a certain parameter, different parameter values are adjusted while keeping the other parameters fixed. Three well-known image quality metrics, PSNR, MSE and MSSIM, are used to analyze the influence of the parameters. It can be seen from Fig. 7 (a) that PSNR value is basically flat and has a downward trend after α2 is taken as 20, while MSE changes from the original downward trend to a gradual increase after α2 is taken as 20, which indicates the optimal value of major axis α2 of the ellipse is 20. In Fig. 7 (b), when ksize2 takes 1, the PSNR reaches the maximum and the MSE reaches the minimum, so the optimal value of the ksize2 is 1. Figure 7 (c) shows that when λ1 is larger than 3, the MSE value decreases and the PSNR value increases gradually. So the optimal value of λ1 is 3. In Fig. 7 (d), it can be observed that when λ2 is set to 0.012, MSE curve remained flat up to a certain distance and then started to increase. On the other hand, when λ2 is set to 0.012, the PSNR curve remained flat up to a certain distance and then gradually decreased. These findings suggest that the optimal value of λ2 is 0.012. It can be seen from the Fig. 7 (e) and 7 (f) that when the weight parameters t1 is equal to 0.3 and t2 is equal to 0.7, PSNR is maximum and MSE is minimum; accordingly, MSSIM also reaches maximum at this time, indicating the optimal values of t1 and t2 are 0.3 and 0.7, respectively. The optimal values of the parameters are shown in Table 5.

The effect of parameters (a) The corresponding MSE and PSNR values for different. (b) The corresponding MSE and PSNR values for different. (c) The corresponding MSE and PSNR values for different. (d) The corresponding MSE and PSNR values for different. (e) The corresponding PSNR and RMSE values for different weight parameter. (f) The corresponding MSSIM values for different weight parameter.
It can be seen from the parameter sensitivity analysis that the disturbance of parameters α2, λ1 and λ2 has a significant impact on the experimental results. However, the adjustment of parameters ksize2 within a certain range leads to little change in the evaluation index. Therefore, for trabecular bone, we give the ranges of these parameters, which can obtain reasonable CT images stably. The values of α2, λ1 and λ2 range from [15–21], [2–4], and [0.011–0.013]. For the weight parameters t1 and t2, we should try to maintain t2 a larger proportion, but not equal to 1.
In this work, we propose a new denoising method that can achieve good denoising effect while preserving the structure and details of the CT images. Although the experimental results are good, there are limitations. The proposed method involves many parameters, and adjusting so many usage parameters is not convenient for clinical use. According to the sensitivity of the parameter, we can take the value of the parameter in the appropriate range, so as to obtain a good denoising effect. When the traditional denoising method removes the noise, the details of CT image are also lost. In the process of denoising, the proposed method can more accurately estimate the edge direction of the image and denoise along the orthogonal direction, which can better retain the details of CT image when removing strip artifacts and noise. AI models offer advantages in performance, making them popular for low-dose and low-sampling CT imaging. Various deep learning-based image denoising methods exist, including convolutional neural network denoising methods like N2N [28], FFDNET [29], residual network denoising methods like FC-AIDE [30], and generative adversary-network denoising methods like GCDN [31]. These deep learning methods can autonomously extract shallow pixel level features and deep semantic level features quickly, demonstrating strong representation learning ability and effective denoising effects. Current research in deep learning and machine learning explores combining diffusion models with regularization models to enhance model performance and generalization [32, 33]. We can also try to combine it with deep learning, which is also a problem we need to study further.
In this innovative study, Kernel regression is introduced into EADTV to obtain a higher quality reference image, and the CT image is denoised along the orthogonal direction. The results demonstrate that the AODTV method significantly outperforms other denoising algorithms. This method focuses on eliminating strip artifacts while retaining the intricate details in the image. It achieves this by carefully balancing the penalties applied in both the edge direction and the normal direction of the reconstructed image. However, determining the precise proportion of penalty in the orthogonal direction requires additional research.
Footnotes
Acknowledgments
This work was supported by the National Natural Science Foundation of China under Grants Nos. 82371960, 82102037, and 82071922 and Tianjin Municipal Education Commission (2020KJ208).
