Abstract
Iterative reconstruction algorithms for computed tomography (CT) through total variation (TV) regularization can provide accurate and stable reconstruction results. TV minimization is the L1-norm of gradient-magnitude images and can be regarded as a convex relaxation method to replace the L0 norm. In this study, a fast and efficient algorithm, which is named a weighted difference of L1 and L2 (L1 - αL2) on the gradient minimization, was proposed and investigated. The new algorithm provides a better description of sparsity for the optimization-based algorithms than TV minimization algorithms. The alternating direction method is an efficient method to solve the proposed model, which is utilized in this study. Both simulations and real CT projections were tested to verify the performances of the proposed algorithm. In the simulation experiments, the reconstructions from the proposed method provided better image quality than TV minimization algorithms with only 7 views in 180 degrees, which is also computationally faster. Meanwhile, the new algorithm enabled to achieve the final solution with less iteration numbers.
Keywords
Introduction
The circular scanning configuration is widely used in many computed tomography (CT) applications [1]. The high radiation dose can increase the health risk, which has become a major clinical concern. Two classes of strategies to reduce radiation dose exist: 1) lower the measured X-ray tube current or voltage; and 2) lower the required number of projection views. The first strategy usually leads to noisy projection data at each view with lower signal to noise ratio (SNR), which causes a low reconstruction image quality compared to the data from higher current-measured or voltage-measured scan. The second strategy focuses on the reconstruction from incomplete-view data. To the analytic reconstruction algorithm, the reconstructions with noticeable artifacts will be obtained from the incomplete-view projection data because it does not satisfy the Tuy–Smith condition [2, 3]. On the other hand, the traditional iterative reconstruction algorithms, such as simultaneous algebraic reconstruction technique(SART) [4], provide noisy reconstructions with noticeable artifacts from the incomplete-view projection data as well.
Iterative reconstruction algorithm based on compressive sensing theory [5, 6] can reconstruct images accurately utilizing the sparsity of signals [7, 8]. The sparse of images is generally not widely applicable in industrial and security scanning applications, but it is often the case that images have sparse gradient-magnitude images (GMI) [8]. Method based on the total variation (TV) minimization, which is the L1-norm of GMI, has been widely studied in the reconstruction from few-view and limited-angle data [7–13]. These methods can be roughly categorized into different classes of optimization-based algorithms, including the adaptive-steepest-descent-projection-onto-convex-sets (ASD-POCS) framework [8], the Chambolle–Pock (CP) framework [14], the split-Bregman (SB) iteration [15], the alternating direction minimization (ADM) method [16, 17], etc.
It should be noted that TV is not the most direct method to describe the GMI’s sparsity. For a digital signal
In this study, we propose a weighted difference of L1 and L2 on the gradient minimization for circular CT. It is closer to the L0 norm of gradient than TV minimization model. The alternating direction method is utilized to iteratively reconstruct a CT image from sparse-view projection measurements by splitting the augmented Lagrangian function of the optimization into two sub-problems. One sub-problem, containing the image constraints term, can be solved by the shrinkage operator after slacking the L2 of image gradient. The other sub-problem is solved by a local linearization and proximity technique that avoids pseudo-inverse computation of big matrices, which is a quadratic function. The algorithm is tested by simulation and real CT data and shows outstanding capabilities in sampling reduction. The proposed method can provide better image quality when α increases from 0 to 1. Meanwhile, the convergence speed is also faster with the increase of α. The best reconstruction can be provided for the proposed method, with α = 1 than that of α = 0, which equals TV minimization.
The remainder of this paper is organized as follows. Section 2 presents the proposed reconstruction model and the design of the proposed algorithm. In Section 3, numerical simulations and real data experiments are used to validate the performance and capability of the proposed algorithm. In Section 4, a brief discussion is given of the results obtained from Section 3, and our conclusions are given in the end.
Methods
Proposed reconstruction model
The imaging model can be approximated using the following discrete linear system:
Several viable alternatives are available because minimizing L0 norm is NP-hard [18]. Greedy methods (matching pursuit [27], orthogonal matching pursuits (OMP) [28], and regularized OMP (ROMP) [29]) work well if the dimension N is not too large. Another strategy is relaxing the L0 norm by a continuous sparsity promoting penalty functions. Convex relaxation uniquely selects the L1 norm, which is the TV minimization:
Here, we select a novel non-convex relaxation approach, called weighted difference of L1 and L2:

Contours of three sparsity metrics. The level curves of approach the x and y axes as the values decrease, hence promoting sparsity. Panel (a) shows the contour of L1, panel (b) shows the contour of L1-(1/2)L2, and panel (c) shows the contour of L1-L2.
The model can also be written as
To solve the proposed reconstruction model (5) efficiently, the alternating minimization method (ADM) algorithm is utilized in this paper. For constrained optimization, an influential class of method class seeks for the minimizer or maximizer by approaching the original constrained problem by a sequence of unconstrained sub-problems. We apply the augmented Lagrange (AL) function, which could be regarded as a combination of the Lagrangian and quadratic penalty functions, to convert formula (5) to an unconstrained form:
where β1 and β2 are the penalty coefficients, and
Fixing the variable
Let
Setting its gradient to 0 results in
On the periodic definition condition, the matrix form of ∇
Fixing the variable
Unfortunately,
In conclusion, the weighted difference of L1 and L2 on the gradient minimization with ADM is proposed, and the pseudocode of the proposed algorithm is as follows:
While under the influences of noise, the observation equation is no longer
The implication of parameters in (20) is the same to that in (6). Compare the above function to that in (6), it can be easily found that the solvations of
Excepting updating variables
However,
Again, thus, a new working algorithm for allowing for the noise can be generated as follow
It is important to identify an effective stopping criterion for the iteration sequence (7) for algorithm 1. And the Karush-Kuhn-Tucker (KKT) conditions [30] are satisfied by optimal solutions of constrained optimization problems. Here, we take (
where ∂
where
And the detailed derivation of (24) is shown in Appendix 1. Thus, the residuals for the feasibility conditions can be viewed as
Under ideal conditions, the mathematical convergence conditions of Algorithm 1 should be
For algorithm 2, the KKT conditions has a little difference because of the addition of noise
While it has no effect on the residuals for
For the proposed method, the selections of parameters have huge impact on the convergence. Parameter β1 and β2 are used to balance the quadratic term and penalty function. As a matter of experience, they are better to be set more than 0 and less than 104. Between them, they are also used to balance the weighting of influence by two constraint terms. Generally, they are in the same order of magnitudes. Parameter τ is a positive number, and it is usually set in [1, 2]. In this paper, it is set 1 for all of the experiments. For noisy data, parameter δ in algorithm 2 should be considered. If the noise
Experimental results
The algorithm was performed with the use of both numerical simulation and real data. All experiments were implemented with MATLAB programming language on an HP workstation with Intel® Xeon® CPU E5-2620 v3 @2.40 GHz×2 and 64 GB RAM on a 64-bit Windows 7 system.
Simulation experiments
In this experiment, a fan beam geometry is set to simulate the data acquisition of CT imaging. The typical Shepp–Logan phantom with a discretization of 256×256 pixels is utilized as the standard image. The distance from the X-ray source to the iso-center is 164.86 mm and the distance from the X-ray source to the center of the line detector is 1052.00 mm. The number of detector bins is 512 with a size of 0.074 mm for each.
In the first experiment, seven projections are evenly acquired within 180° along the circular trajectory. The parameter α is discussed first. We set α from 0 to 1 with interval 0.1. The model equals TV minimization when α = 0, and it satisfies the model of L1- L2 in [25] when α = 1. Here we set β1 = 66 and β2 = 62 for all of the tests in experiment 1. Through the stopping criterion strategy in 2.3 chapter, we consider that the reconstruction result can be the solution when

Curves of value

Original phantom and seven-view reconstructions by the proposed algorithm with α = 0, 0.5, 1. Panel (a) shows the original phantom, panel (b) shows results at 1200 iterations, panel (c) shows results at 1600 iterations, and panel (d) shows results at 2000 iterations. Its grayscale window is [0.195, 0.205].

The comparisons of RMSE curves for the proposed algorithm with different α values for seven views of projections based on the original Shepp–Logan phantom.
Figure 3 shows some results in special parameter α that it can provide better reconstructions using less iterations for the larger α. With the iteration increasing, the final results can have a high image quality for all the α in [0, 1]. For a more detailed result, the curves of RMSE for each value of α in Fig. 4 indicate that reconstructions by the proposed method have faster error reduction speed with increasing α. Table 1 shows that the value of RMSE is smaller when the value of α increases at each iteration number. Although some fluctuations exist in the back iterations, the final results improve with increasing α. In other words, the final reconstruction is the best one when α = 1 with the smallest value of RMSE.
The numerical comparisons at different iterations of each α for the proposed algorithm with seven views
In further studies, the parameter α is discussed with two special values, 0 and 1, based on the results in the first experiment. Other values of α from 0 to 1 cause reconstruction performance between that of α = 0 and 1. In the second experiment, we use the low-contrast Shepp–Logan phantom generated by MATLAB code phantom(’Shepp-Logan’,256), and the system parameters is invariant to that in the first experiment. We also use

Curves of value

Original phantom and seven-view reconstructions with different α values at an iteration number of 3500 for low-contrast Shepp–Logan phantom. Panel (a) shows the original phantom, panel (b) shows the result with α = 0, and its RMSE is 9.09E–6, and panel (c) shows the result with α = 1, and its RMSE is 7.02E–6. Its grayscale window is [0.0195, 0.0205].

The comparisons of RMSE curves for the proposed algorithm with different α values for seven views of projections for low-contrast Shepp–Logan phantom.
Reconstructions with noisy projections are conducted and demonstrated in this subsection. The Poisson noise is added into the simulated projections. Although the noise level in the experiment is not very high compared with the original projections, reconstruction with very few views is still very difficult. In this experiment, the reconstructions under seven views with α = 0 and 1 are carried out for the proposed algorithms. Here we set β1 = 128 and β2 = 100 for all of the tests in this experiment. In this experiment, we use algorithm 2 to test the performance of the proposed method with α = 0 and 1. In this test, we consider that the reconstruction result can be the solution when

Curves of value

Original phantom and seven-view reconstructions with different α values at an iteration number of 2100 for original Shepp–Logan phantom adding Poisson random noise. Panel (a) shows the original phantom, panel (b) shows the result with α = 0, and its RMSE is 2.4E–3, and panel (c) shows the result with α = 1, and its RMSE is 1.08E–4. Its grayscale window is [0.195, 0.205].

Original phantom and seven-view reconstructions with different α values at an iteration number of 3000 for original Shepp–Logan phantom adding Poisson random noise. Panel (a) shows the original phantom, panel (b) shows the result with α = 0, and its RMSE is 2.54E–4, and panel (c) shows the result with α = 1, and its RMSE is 1.38E–4. Its grayscale window is [0.195, 0.205].

The comparisons of RMSE curves for the proposed algorithm with different α values for seven views of projections for the original Shepp–Logan phantom.
In this Section, real data verifications are conducted to test the proposed Algorithm 2. Projections are acquired from an adult male rabbit in vivo on a CT imaging system. The distance from the source to the rotation center is 502.8 mm, and the distance from the source to the center of the detector is 1434.7 mm. The flat panel detector has 1024 bins with a pixel size of 0.381 mm. The size of the reconstructed image is 800×800 with 0.1335 mm×0.1335 mm for each voxel. A total of 360 projections are acquired evenly within the range of 360°.
We first perform the reconstruction with the real CT projections using the standard FBP algorithm. Reconstruction with full views and sparse views (60 projections evenly chosen with angular step of 3° within 180°) by FBP are conducted and the typical results are shown in the two left images in Fig. 13. In this test, we set β1 = 10 and β2 = 10 for all of the tests in this experiment. Algorithm 2 is used to test the performance of the proposed method with α = 0 and 1 as well. For iterative algorithms under real CT projections, the iteration should be stopped at an appropriate number, thereby avoiding the influence of noise. Thus, we consider that the reconstruction result can be the solution when

Curves of value

Reconstructions for the rabbit by FBP and the proposed algorithm at iteration number of 130. Panel (a) shows the result of FBP with full views, panel (b) shows the result of FBP with 60 views evenly chosen within 180°, panel (c) shows the result of the proposed algorithm at α = 0 with 60 views evenly chosen within 180°, and panel (d) shows the result of the proposed algorithm at α = 1 with 60 views evenly chosen within 180°. Its grayscale window is [0.008, 0.14].
Images reconstructed from sparse views by FBP suffer severely from the streak artifacts and several tiny structures are degraded, which prove that the analytical reconstruction algorithm is not applied to sparse view projections. No exact image for the real CT exists, even when using the FBP under full views. The result by the full view FBP can be a reference to the result by iterative algorithm. The reconstructions by the proposed algorithm, as shown in the two right images in Fig. 13, are close to the result by full view FBP. For detailed results, the ROI of these four reconstructions are shown in Fig. 14. The reconstruction by the proposed method with α = 1 is closer to the full view FBP than the reconstruction by the proposed method with α = 0 for the tiny structures of bones.

ROI of reconstructions for the rabbit by FBP and the proposed algorithm at iteration number of 130. Panel (a) shows the ROI result of FBP with full views, panel (b) shows the ROI result of FBP with 60 views evenly chosen within 180°, panel (c) shows the ROI result of the proposed algorithm at α = 0 with 60 views evenly chosen within 180°, and panel (d) shows the ROI result of the proposed algorithm at α = 1 with 60 views evenly chosen within 180°. Its grayscale window is [0.008, 0.14].
TV-based methods have been widely used in CT image reconstruction under sparse-view projection measurements. While it is not the best way to describe the sparsity, but the L0 norm. This paper proposed an efficient reconstruction algorithm named a weighted difference of L1 and L2 on the gradient minimization for circular CT. It is closer to the L0 norm than the TV minimization method, that is the better description of image sparsity. This new method is derived by ADM and can achieve promising results using less iteration numbers under very few views of projections compared with conventional TV minimization method. Considering the noise data, we improve the reconstruction model, and propose a new reconstruction algorithm. Moreover, we provide the stopping criterion through the KKT conditions to guarantee that the reconstructed image can be considered as the final solution. The proposed method can be regarded as the L1 - αL2 norm of gradient, and the TV minimization method is the L1 norm of gradient. Though the proposed method appends only one term compared with TV minimization method, they are different in essence. TV minimization can be regarded as a convex relaxation of L0 norm of gradient, whereas the proposed method is a non-convex relaxation of L0 norm of gradient. The proposed method has a better performance in reconstruction under very few views of projections because it is has better description of image sparsity than the TV minimization method.
In the simulation experiments, the stopping iteration number and RMSE curves show that the proposed method can provide better reconstruction results than traditional TV minimization algorithms whether noise-free data or noise data, and requiring less iteration number with increasing α from 0 to 1. The best performance of the proposed method is obtained at α = 1. In the real data reconstruction, we test the proposed method by sparse-view projection data from the skull of adult male rabbit on a CT imaging system. The reconstructions indicate that the proposed method can preserve many fine details that many tissues of structural information are visible, which can be as good as the full-scan FDK reconstruction. While, some of them are lost in the TV reconstruction.
Footnotes
Appendix 1
The optimality condition of (12) and (17) for Algorithm 1.
For the k iteration of (7), we can get following expressions from (12) and (17)
Then, combined with (7c) and (7d), they are converted into
So we can get following expressions
where
Acknowledgments
This work is supported by the National Natural Science Foundation of China (Nos. 61372172, 61601518). We also want to thank the anonymous referees for giving us useful comments and suggestions to improve this paper.
