Abstract
Inspired by the compressed sensing (CS) theory, introducing priori information of sparse image into sparse-view reconstruction algorithm of computed tomography (CT) can improve image quality. In recent years, as a special case of CS, total variation (TV) reconstruction algorithm that uses both image sparsity and prior information of edge direction have attracted much research interest in sparse-view image reconstruction due to its ability to preserve image edges. In this paper, we propose a new adaptive-weighted total variation (NAWTV) algorithm for CT image reconstruction, which is derived by considering local gradient direction continuity and the anisotropic edge property. The anisotropic edge property is used to consolidate the image sparsity, where the associated weights are expressed as a combination of exponential and cosine function. The weights can also be adjusted adaptively according the local image intensity gradient. The NAWTV algorithm is numerically implemented with gradient descent method. The typical Shepp-Logan phantom and FORBILD head phantom are employed to perform image reconstruction simulation. To evaluate performance of NAWTV algorithm, we compared it with TV and AwTV reconstruction algorithms in experiments. Numerical experimental results verified the effectiveness and feasibility of the proposed algorithm. Comparison results also showed that the NAWTV algorithm achieved a satisfactory performance in suppressing artifacts and preserving the edge structure details information of the reconstructed image.
Introduction
X-ray computed tomography (CT) is one of the widely used technologies in the field of medical imaging. However, the extra radiation dose in clinical examination can increase the health risk, which may affect the physiological function of human body and increase the chance of carcinogenesis, especially for children [1]. Therefore, it is very important to control radiation dose in clinical examination. It is well known that both lowering the x-ray intensity and reducing the number of projections when the patient is scanned to get the projection data are effective ways to reduce radiation dose [2–4]. When the intensity of the X-ray is reduced by adjusting the parameters of the equipment, the acquisition projection data will unavoidably increase the unstable noise, which causes the reconstructed image quality to fall [5]. The acquisition of reducing the number of projections is a problem of the incomplete projection data reconstruction of the under-sampled, which is an ill-posed inverse problem in mathematical theory [6].
When the analytic algorithms such as filtered back-projection (FBP) [7] is used for under-sampled sparse projection reconstruction, image quality can be greatly affected by the aliasing artifacts due to insufficient projection information [6, 8]. However, the application of traditional iterative algorithms such as algebraic reconstruction technique (ART) [9] and the maximum likelihood expectation maximization (MLEM) [10] will not be disturbed by aliasing artifacts. The iterative algorithm can reconstruct a clearer image, but the reconstruction image will also show some unnecessary artifacts which can’t meet the clinical needs [11]. In recent years, the development of compressed sensing (CS) theory [12–14] has provided new ideas for solving this problem. The key point of CS is that CT images can be accurately reconstructed by mining its L1 norm or the L1 norm of its sparse transformation from under-sampled projection if the original image is sparse or can find a suitable transform domain to make it sparse [15]. The sparse transform commonly used in the image domain mainly includes discrete gradient transform [12], wavelet transform [16], curvelet transform [17] and shearlet transform [18] et al. In CT reconstruction field, the iterative reconstruction algorithm based on CS employs the total variation (TV, i.e. the L1 norm of image gradient) [19] as regularization item in constructing the optimization problems. It has been widely studied in the reconstruction from under-sampled projection data [20–22].
In 2006, Sidky et al. [23] proposed the TV-POCS (projection onto convex sets) reconstruction algorithm, which shows that this kind of algorithm can reconstruct the CT image in the sparse-view projection data. However, because of the piecewise constant assumption for the image, the edges of reconstructed images tend to be frequently over-smoothed and sometimes produce staircase effects, which leads to the loss of low-contrast information. In order to mitigate this drawback, some scholars have improved this algorithm and many related algorithms have been put forward. These efficient and accurate algorithms have been widely investigated: 1) considering the anisotropic edge property of image by introducing penalty weights [25–27]; 2) increasing more constraint on TV regular terms [28–30]; and 3) changing the order of TV, considering the first order difference operator and applying other order difference operators instead of using any fixed combination [31–33]. These algorithms have achieved a good performance in suppressing artifacts and preserving the edge structure details information of the reconstructed image.
In this paper, considering both local gradient direction continuity information and the anisotropic edge property of an image, we propose a new adaptive-weighted TV reconstruction algorithm (NAWTV). The rest of this paper is organized as follows. In Section 2, the imaging model, TV model and the process of building NAWTV model for CT image reconstruction is presented. In Section 3, the proposed algorithm is applied to solve sparse-view problems with numerical simulation experiments. In Section 4, we conclude the paper and some related issues discussion is presented.
Method
CT imaging model
The CT imaging system is usually described by linear equations. The discrete form of the reconstruction model can be expressed as:
Where f denotes the image to be reconstructed, p is the projection data, W is the system matrix which associates the image with the projection data. The CT reconstruction problem can be expressed as a solution to the unknown image f based on the system matrix W and projection matrix p. The traditional iterative reconstruction algorithms such as ART and MLEM can reconstruct an image with under-sampled or limited projection data, but fail to meet the clinical needs. However, the image quality can be further improved by introducing a priori knowledge to regularized constraints in the reconstruction model [35].
For most natural images are non-sparse, but if we can find a suitable sparse transform domain and transform it, we can take the sparsely transformed signal as sparse. Rapid variations often occur at the edges of some structures in medical images. Therefore, the L1 norm of image gradient is often used in sparse-view image reconstruction. As mentioned in Section 1, the objective function of the TV reconstruction algorithm can be expressed as:
Where s and t represent the coordinates of the current pixel. The constraint term is used to constrain the consistency between the measured projection data and the reconstructed model data. The TV reconstruction algorithm can effectively use the information of image sparsity and edge direction, and can reconstruct the image with high quality under the sparse-view projection data. However, this algorithm can’t fully adjust the local information of the image, which lead to the edges of the reconstructed image tend to be over-smoothed.
To further improve the performance of the TV reconstruction algorithm, we propose a new adaptive-weighted TV reconstruction algorithm (NAWTV) for sparse-view CT image reconstruction. This algorithm enhances the local gradient continuity by increasing the gradient directionality information and consolidates the sparsity of the image by improved weight, while TV reconstruction algorithm only considers the sparsity of gradient in horizontal and vertical directions. Therefore, the sharp edges of CT images can be better preserved and the image reconstruction results are more accurate for NAWTV reconstruction algorithm.
NTV model
Inspired by the wok in [35], here a new TV (NTV) model is presented. The NTV model not only uses the gradient information in horizontal and vertical directions, but also enforces the gradient continuity. In Fig. 1 (a), it is seen that the pixels in the smooth region have similar values to the surrounding pixels, and any direction of the partial gradient can be seen as continuous. And when the partial gradient is continuous in addition to its own direction, the edge of the image can be obtained. Similarly, the continuity of partial gradient in the neighborhood (fs,t, fs-1,t-1, fs-1,t, fs,t-1) can be considered. In Fig. 1 (b), it is seen that the gradient continuity depends on the direction of the image edge, if it exists in the neighborhood. For different directions, the gradient continuity constraints are:
(a) pixel distribution; (b) partial gradient distribution.
Thus, we can enforce the directional continuity of Gx and Gy by ∥Gxxs,t ∥ 1,∥Gyys,t ∥ 1, ∥Gxys,t ∥ 1, ∥Gyxs,t ∥ 1, where Gxxs,t is the derivative of Gxs,t along the horizontal direction and Gyys,t is the derivative of Gys,t along the vertical direction. Actually, ∥Gxxs,t ∥ 1 = ∥ fs,t + fs-1,t-1 - fs,t-1 - fs-1,t ∥ 1 = ∥ Gyys,t ∥ 1. When we calculate the sparsest gradient, ∥Gxxs,t ∥ 1 = ∥ Gyys,t ∥ 1 can be regarded as the redundancy of the entire equation. We define NTV model:
In 2012, Liu et al. [25] proposed an adaptive-weighted total variation (AwTV) reconstruction algorithm. This algorithm improved the preservation of edge details by an adaptive-weighted which brings the local information into the above conventional TV model. We apply the adaptive-weighted technique in AwTV algorithm into the above NTV algorithm, we called it NWTV:
In 2008, Tian et al. [37] compared the performance of several kernel functions in denoising. The idea of these kernel functions is in line with our requirements for weight functions. Several weight functions are shown in Fig. 2, the Euclidean distance is used to represent the neighborhood pixel similarity, and the neighborhood pixel similarity is inversely proportional to the Euclidean distance.
Diagram of the weighted function curves.
From Fig. 2, it can be seen the exponential function (a) and cosine function (a) both get a larger weight when the similarity of neighborhood pixels is relatively high. However, with the decrease of similarity, the exponential function (a) decrease rapidly and cosine function (a) decreases slowly. When the relative parameters are adjusted to change the trend of exponential function, the range of weights will be smaller, such as the exponential function (b); the relative parameters are adjusted to change the trend of the cosine function, so that some small weights of the local image will be lost, such as the cosine function (b). Therefore, the exponential function has under-weighted shortcoming while the cosine function has over-weighted shortcoming.
To balance both under-weighted and over-weighted case, a new exponential cosine kernel function is proposed by combining exponential function and cosine function. From the exponential cosine function (a) and (b) in Fig. 2, it can be seen this new weight function can easily adjust the trend of weight change in the range of 0–1. This new weight is defined as follows:
where the Euclidean distance d (s, t) is related to the direction gradient, h2 is a scale factor which controls the range of weight, the value of h2 can be determined by maximum Euclidean distance to ensure the minimum value of the weight is 0. Replacing the weight function in formula (6) with the weight function defined by formula (7), a new adaptive-weighted TV (NAWTV) imaging model is proposed as follows:
Inspired by the TV implementation strategy, the steepest descent method and FISTA algorithm [38] are employed in NAWTV implementation for solving the optimization problem. In summary, the implementation of the NAWTV method can be described as follows:
(A) Initializing the beginning reconstructed image:
(B) The ART algorithm is used to reconstruct the image and calculate the median image satisfying the projection data consistency, for m = 1, …, N
data
:
(C) Positivity Constraint:
(D) Calculating and updating weights:
(E1) TV gradient descent initialization:
(E2) According equation (8) and (9), TV gradient descent method is applied, for m = 1, …, N
grad
(F) Fasten the convergence with FISTA algorithm:
(G) Return to (B) until the stopping criterion is satisfied.
Numerical results
To investigate the feasibility of proposed NAWTV algorithm for sparse-view CT reconstruction, we implemented it in MATLAB2016b under a PC with 3.60 GHz Intel Core i7-7700 CPU, 8.0 G memory, Windows 8 operating system environment. In experiments investigation, the Shepp-Logan phantom [39] and the low-contrast FORBILD head phantom [40] were used to simulate the fan-beam projection data based on circular orbit, and the size of simulation data is 256*256. In the simulated environmental, we use an equi-view virtual detector, the edge ray just covers the target. And the distance between source and center of rotation is 655.36 mm, the number of detector channels was set to 256. In the following numerical experiments, we use TV, AwTV, NTV, NWTV and NAWTV algorithms to reconstruct the simulation data and compare their experimental results, the maximum iteration number is set 1000.
Evaluation measures
To verify the effectiveness of the proposed method, the performance of the reconstructed image is evaluated by the root mean square error (RMSE) and the universal quality index (UQI). The definitions of RMSE and UQI are as follows:
For Shepp-Logan phantom based experiment, the phantom is uniformly sampled from 0 to 2π, and 24 projections are obtained. The two weight parameters in the implementation of the NAWTV algorithm are set to h1 = 0.21 and h2 = 1, and the gradient’s continuity weight is set to γ = 0.2. The weight parameter in the implementation of AwTV algorithm and NWTV algorithm is set to h = 0.21. The relaxation factor of the ART algorithm is set to λ = 1 and it is repeated once per total cycle. The outer iteration number for all methods are 1000 and the inner iteration number of TV terms are all N
data
= 20. The reconstructed images with TV, AwTV, NTV, NWTV, NAWTV algorithms and performance evaluation measures are shown in Fig. 3 and Table 1 respectively.
Reconstructed results of the Shepp-Logan phantom from 24 noise-free projection data. (a) Original image. (b-f) The reconstructed results of NAWTV, NWTV, NTV, AwTV and TV. Performance evaluation of the reconstructed results by different algorithms from noise-free projection data
From Fig. 3 we can see that using 24 projection data with 1000 times iterations, the reconstructed image using NAWTV and NWTV method almost have no difference with original image in visual effect. While the reconstructed image using NTV and AwTV method contains less artifacts, and the inner distribution near edge is more uniform than the image reconstructed with TV. Combined with the quantitative evaluation in Table 1, we can conclude that the NAWTV algorithm performs best among five of them.
To further investigate whether the performance of proposed algorithm is better, the comparison of the midline profiles of horizontal and vertical between original Shepp-Logan phantom and reconstructed images using five different reconstruction algorithms shown in Fig. 4. It can be seen from the local zoom in figure (a’) and (b’) that the horizontal and vertical profile of the image reconstructed by the NAWTV and NWTV algorithm are the closest to the original profile. In addition, the algorithm is well in protecting the edge property of the original image. At last, the change curves of the UQI and RMSE of this experiment are presented in Fig. 5. From the figure, we can see that the NTV algorithm converges faster than the TV algorithm, while NWTV and NAWTV converge faster than AwTV. The convergence speed difference is not very big between NWTV and NAWTV in Fig. 5. To facilitate observation, a local zoom of this two change curves is also shown in Fig. 5. In conclusion, we can get that the convergence speed of NAWTV algorithm is the fastest among five of them.
The comparison of the midline profiles between reconstructed images using five different reconstruction algorithms and Shepp-Logan phantom. (a) is the comparison of horizontal midline, (b) is the comparison of vertical midline. (a’) and (b’) is the corresponding zoomed in part of (a) and (b). The RMSE and UQI change curves of reconstructed images with different reconstruction algorithms at 24 projection data for Shepp-Logan phantom. The center rectangle region is a local zoomed in results of the change curves of NAWTV and NWTV algorithms in the dashed frame.

In order to further validate the robustness of the proposed algorithm in sparse-view reconstruction, simulation experiments were performed on FORBILD phantoms with more detail. The simulation model is uniformly sampled from 0 to 2π, and 36 projection data are obtained. The two weight parameters in the implementation of the NAWTV algorithm are set to h1 = 0.21 and h2 = 1.8, and the other reconstruction parameters are consistent with Section 3.2. The reconstructed images and performance evaluations are respectively shown in Fig. 6 and Table 2. The comparison of the midline profiles of horizontal and vertical between original FORBILD head phantom and reconstructed images using five different reconstruction algorithms shown in Fig. 7.
Reconstructed results of the FORBILD head phantom from 36 noise-free projection data. (a) Original image. (b-f) The Reconstructed results of NAWTV, NWTV, NTV, AwTV and TV. The comparison of the midline profiles between reconstructed images using five different reconstruction algorithms and FORBILD head phantom. (a) is the comparison of horizontal midline, (b) is the comparison of vertical midline. (a’) and (b’) is the corresponding zoomed in part of (a) and (b). Performance evaluation of the reconstructed results by different algorithms from noise-free projection data

From Fig. 6 we can see that using 36 projection data with 1000 times iterations, the reconstructed image using NAWTV and NWTV method have little difference in visual effect. Although there are some pleated artifacts, the reconstructed image of NAWTV and NWTV are the most similar to the original image in several reconstruction algorithms. In addition, the reconstructed image using NTV and AwTV method contains slightly more artifacts, but the visual effect of the reconstructed image is still better than the image reconstructed with TV. Combined with the quantitative evaluation in Table 2, we can conclude that the NAWTV algorithm is the best method for FORBILD phantom reconstruction.
From the local zoomed in results of the horizontal and vertical profile in Fig. 7 (a’) and (b’), we can see that the profile of the reconstructed image fluctuates more volatile, but the profile of NAWTV and NWTV algorithm are still the closest to the original profile in several reconstructed results, especially in the corner region. The change curves of the UQI and RMSE of the reconstructed image are shown in Fig. 8. Among them, the local zoomed in results of the change curves of NAWTV and NWTV algorithm are also shown in Fig. 8. In the reconstruction of phantom data with more details, the convergence speed of NAWTV algorithm is still the best, and the result is consistent with the simulation result of Shepp-Logan phantom.
The RMSE and UQI change curves of reconstructed images with different reconstruction algorithms at 36 projection data for FORBILD head phantom. The center rectangle region is a local zoomed in results of the change curves of NAWTV and NWTV algorithms in the dashed frame.
To validate the anti-noise performance of proposed algorithm, 24 projection data are uniformly sampled from the Shepp-Logan model, then random noise is added in projection data. Five reconstruction algorithms’ performance on noise projection data are compared in detail. The reconstruction parameters in the implementation of each algorithm are consistent with Section 3.2. The reconstructed images and performance evaluations are respectively shown in Fig. 9 and Table 3. From the reconstructed results and the evaluation measures, the edge of the reconstructed image on noise data is a little blurred comparing with the noise-free case, and the evaluation index is slightly reduced. For this noise projection data reconstruction experiment, the proposed NAWTV algorithm performs the best among five of them.
Reconstructed results of the Shepp-Logan phantom from 24 noise projection data. (a) Original image. (b-f) The reconstructed results of NAWTV, NWTV, NTV, AwTV and TV. Performance evaluation of the reconstructed results by different algorithms from noise projection data
The comparison of the midline profiles of horizontal and vertical between Shepp-Logan phantom and reconstructed images on noise projection data are shown in Fig. 10. It can be seen from the local zoomed in views in Fig. 10 (a’) and (b’), the reconstructed image profile of NTV is closer to the original image than the reconstructed image profile of TV, while the reconstructed image profile of NAWTV and NWTV are closer to the original image than the reconstructed image profile of AwTV. The change curves of the UQI and RMSE of the reconstructed images from noise projection data and local zoom in results of the change curves are shown in Fig. 11. It can be clearly seen that the NTV algorithm converges faster than the TV algorithm, the NWTV algorithm converges faster than the AwTV algorithm, while the NAWTV algorithm converges the fastest among five of them. This experimental result is consistent with the simulation result of the noise-free projection data reconstruction. In all, whether for noise-free or noise projection data reconstruction, the proposed algorithm NAWTV gives a better reconstruction performance and faster convergence than other algorithms.
The comparison of the midline profiles between reconstructed images from projection data with noise on Shepp-Logan phantom. (a) is the comparison of horizontal midline, (b) is the comparison of vertical midline. (a’) and (b’) is the corresponding zoomed in part of (a) and (b). The RMSE and UQI change curves of reconstructed images with different reconstruction algorithms at 24 noise projection data for Shepp-Logan phantom. The center rectangle region is a local zoomed in results of the change curves of NAWTV and NWTV algorithms in the dashed frame.

In this paper, the simulation experiments are implemented with Shepp-Logan phantom and FORBILD head phantom. The reconstructed image and numerical results of TV, AwTV, NTV, NWTV and NAWTV were compared qualitatively and quantitatively. It can be seen in terms of visual effect or performance evaluations, the algorithm of NAWTV has the biggest advantages in reconstructing image quality. The NAWTV reconstruction algorithm has a higher reconstruction accuracy, which can suppress artifacts and noise while preserving more edge structure information.
Since the lack of clinical CT projection data, this paper only carried out on simulation experiments, the experiment on real clinical projection data will be studied in the near future. In addition to the reconstruction algorithm is solved by gradient descent method, which takes a long time. The proposed algorithm can be further improved with state of the art technique, such as GPU based acceleration technique and new solution algorithms. In the further, we will try to explore the limited-view projection reconstruction problem to reduce radiation dose.
In summary, we proposed a new adaptive-weighted TV reconstruction algorithm for sparse-view projection data and demonstrated its promising performance. First, the proposed method introduces local gradient direction continuity to the TV reconstruction algorithm and the NTV model is proposed to improve the local gradient continuity. Second, the adaptive-weighted technique of AwTV algorithm is introduced into the NTV model to enhance the gradient sparsity. Last, the NAWTV model is proposed by improving weight function makes the weight function can balance the under-weighted and over-weighted cases. The NAWTV is used as a regularization term to constrain gradient sparsity and gradient continuity for sparse-view CT image reconstruction. In addition, it also fastens the convergence by incorporating the FISTA step into ours methods.
Footnotes
Acknowledgments
This work is supported by the Natural Science Foundation of Shanghai, China (Grant No. 18ZR1426900) and the National Natural Science Foundation of China (Grant No. 61201067).
