Abstract
BCKGROUND:
Accurate statistical model of the measured projection data is essential for computed tomography (CT) image reconstruction. The transmission data can be described by a compound Poisson distribution upon an electronic noise background. However, such a statistical distribution is numerically intractable for image reconstruction.
OBJECTIVE:
Although the sinogram data is easily manipulated, it lacks a statistical description for image reconstruction. To address this problem, we present an alpha-divergence constrained total generalized variation (AD-TGV) method for sparse-view x-ray CT image reconstruction.
METHODS:
The AD-TGV method is formulated as an optimization problem, which balances the alpha-divergence (AD) fidelity and total generalized variation (TGV) regularization in one framework. The alpha-divergence is used to measure the discrepancy between the measured and estimated projection data. The TGV regularization can effectively eliminate the staircase and patchy artifacts which is often observed in total variation (TV) regularization. A modified proximal forward-backward splitting algorithm was proposed to minimize the associated objective function.
RESULTS:
Qualitative and quantitative evaluations were carried out on both phantom and patient data. Compared with the original TV-based method, the evaluations clearly demonstrate that the AD-TGV method achieves higher accuracy and lower noise, while preserving structural details.
CONCLUSIONS:
The experimental results show that the presented AD-TGV method can achieve more gains over the AD-TV method in preserving structural details and suppressing image noise and undesired patchy artifacts. The authors can draw the conclusion that the presented AD-TGV method is potential for radiation dose reduction by lowering the milliampere-seconds (mAs) and/or reducing the number of projection views.
Introduction
X-ray CT has been widely used in clinical diagnosis, image-guided interventions, and radiotherapy because of high-resolution anatomic imaging. However, the extra radiation dose in CT examination is harmful which may induce genetic, cancerous and other diseases [1, 2]. Therefore, minimizing radiation dose is one of the major topic in CT fields.
Over the years, two strategies have been widely used to reduce radiation dose: the first one is to lower the milliampere-seconds (mAs) in data acquisition protocols [3–10] and the second one is to decrease the number of projections per rotation around the object [11–18]. The first strategy will result in noisy sinogram data and the reconstructed image quality will be degraded [5, 10]. The latter strategy will cause serious streaking artifacts in the reconstructed images because of insufficient angular sampling [14–16].
To achieve high-quality images, various advanced CT image reconstruction methods have been reported [8, 10–18]. Statistical iterative reconstruction (SIR) methods have shown great potential to reduce noise and artifacts as compared with filtered back-projection (FBP) algorithm [5, 10]. SIR method is composed of data-fidelity and regularization terms. The fidelity term models the statistics of measured projection data and the regularization term reflects the prior information of the reconstructed image. The projection data can be modelled accurately using the compound poisson noise of polyenergetic x-ray spectrum plus Gaussian electronic noise [10]. However, such a complicated statistical distribution is numerically intractable for image reconstruction [19]. The projection data after linearity calibration on the transmission data can be easily manipulated, but lacks a statistical description for image reconstruction [20]. To address this problem, we present an alpha-divergence constrained total generalized variation (AD-TGV) method for CT image reconstruction. The novelty of the AD-TGV method is twofold. First, alpha-divergence fidelity and TGV regularization are incorporated into one framework for CT image reconstruction. Alpha-divergence is adapted to measure the discrepancy between projection data and corresponding estimator; the higher order derivative of the images is incorporated into TGV regularization could yield visually pleasant images with more continuous boundaries and less patchy artifacts. Second, a modified alternating optimization algorithm is proposed to minimize the objective function of the AD-TGV method with robust convergence.
The rest of this paper is organized as follows. The AD-TGV model for CT image reconstruction is presented in section 2. The experimental setup and evaluation metrics are also presented in this section. The results are reported in section 3. Finally, the discussion and conclusion are presented in section 4 and 5, respectively.
Materials and methods
Alpha-divergence
The alpha-divergence is a generalization of KL divergence and χ2-divergence which is used to measure the difference between probability distributions [7, 22]. For any tow distributions r (x) and t (x), the alpha-divergence is defined as follows:
TGV was first proposed in [23] for image denoising, which is capable to measure image characteristics up to a certain order of differentiation. The second-order TGV of a given image f is defined as follows:
The CT measurement can be formulated as a discrete linear system under a monochromatic beam:
To find an approximation of Equation (3), alpha-divergence was used to measure discrepancy between y and Hf:
The objective image is the minimum of Dα (y, Hf) under the constraints f ≥ 0 (f
j
≥ 0 for all j). As opposed to minimizing the Equation (4) directly, we propose the following AD-TGV minimization for CT image reconstruction
Since the objective function in the reconstruction problem (5) is convex, it can be solved by iteratively performing the following two steps:
Problem (P1) is a gradient descent method with a step-size 1/μ of for the minimization problem . The introduction of μ is to ensure the numerical stability of the gradient descent method. The efficiency of the algorithm can be improved by substituting this one step gradient descent with a surrogate function (SF) algorithm in [21, 22], the overall convergence can be enhanced. The detailed implementation of this SF algorithm is performed as follows:
Prime-dual algorithm described in [25] was employed for solving the problem (P2), which can be rewritten as the following convex-concave saddle-point problem:
The proximal map prox1 (f) is defined as
In summary, the implementation of the AD-TGV method can be described as follows:
In this study, an initial estimate of f0 was set to be the preliminary image reconstructed by the FBP method with ramp filter. The initial estimation of were set to be zeros. In line 2, ρ and τ are step-size, which can be determined according to the work in [25]. The stop criterion is .
Selection of the value of α
Determining an optimum α to describe the statistical distribution of projection data is very important for AD-TGV image reconstruction. It is known that if α = 2, then the α-divergence measure has the following form:
The penalty parameter λ balances the fidelity and regularization terms in Equation (9). The penalty parameter determines the smoothness of the reconstructed image. Practically, the penalty parameter is empirically determined through a visual inspection for eye-appealing result. In this work, the penalty parameter λ was determined case by case on the basis of the noise level of reconstructed image and projection.
Experimental data
XCAT phantom
A slice of digital XCAT phantom with a tumor lesion (Fig. 1(a)) was used in this study [29]. The imaging geometry is as follows: (1) the distance from the detector arrays to the source is 1040 mm; (2) the distance from the rotation center to the source is 570 mm; (3) the number of an arc detector bin is 672; and (4) the space of each bin is 1.407 mm. The size of the reconstructed images is512×512.

Illustration of a slice of digital XCAT phantom (a), an anthropomorphic torso phantom (b), and the gold standard (c) for the anthropomorphic torso phantom reconstructed by FBP algorithm with the averaged sinogram data of 150 times repeated scans at a protocol of 100 mAs and 120 kVp.
Similar to our previous studies [8, 15], the noisy transmission data I was simulated as follows:
An anthropomorphic torso phantom (Fig. 1(b)) was scanned by a Siemens SOMATOM Sensation 16 CT scanner at 40 mAs, 120 kVp. The sparse-view projection was evenly extracted from the acquired sinogram data to ensure the same noise level for each projection view.
Patient data
The patient data were obtained with a patient consent for a chest CT study. The experimental data were acquired by a Siemens SOMATOM Sensation 16 CT scanner. The scanning parameters were as follows: the time per gantry rotation is 0.5 s, the tube current is 200 mA, and the tube voltage is 120 kVp.
Performance evaluation
Image reconstruction accuracy
The relative root mean square error (rRMSE), which calculates the accuracy of the resulting image, was utilized to measure the quality of the reconstructed image. The rRMSE is defined as follows:
To explore the performance of the proposed method, the universal quality index (UQI) [30] was used to conduct an ROI based analysis by evaluating the similarity between the reconstructed image and gold standard. UQI ranges from 0 to 1, and its value closer to 1 suggests a better similarity between the reconstructed and gold standard images.
Comparison method
To validate and evaluate the performance of AD-TGV method, the AD-TV method described in our previous work [7] was adopted for comparison. The cost function of AD-TV method is written as follows:
XCAT phantom study
Determination of the value of α
The influence of the parameter α on the reconstruction accuracy should be considered carefully because the performance of the present AD-TGV method heavily depends on the α setting. In this study, the AD-TGV method was validated quantitatively from different values of α with the range of α ∈ [1, 2]. The rRMSE versus the value of α for the AD-TGV method from different sparse-views (30-, 40-, and 60-view projections) data were shown in Fig. 2. We can observe that rRMSE increases with the value of alpha increases. Hence, we just focus on the case of α = 1, which exhibits the lowest rRMSE.

rRMSE versus the value of alpha for the AD-TGV method from different projections: (a) 30-view projection; (b) 40-view projection; (c) 60-view projection.
The rRMSE versus the iteration steps for the present AD-TGV method from different projection views (30-, 40- and 60-view) are displayed in Fig. 3. It can be observed that the proposed AD-TGV method can converge to a steady solution after enough iteration. In addition, the convergence speed is accelerated as the projection views increased with the same penalty parameter. The results show that the present AD-TGV method can yield a steadily convergence solution in terms of the rRMSE measurement.

rRMSE versus iteration steps for the AD-TGV method from different projections: (a) 30-view projection; (b) 40-view projection; and (c) 60-view projection.
The results reconstructed by different methods from the 30-, 40- and 60-view projections are shown in Fig. 4. Serious streak artifacts can be observed in the images reconstructed by the FBP method. The steak artifacts have been mostly suppressed in the images reconstructed by AD-TV method, but undesired artifacts can still be observed. The images reconstructed by AD-TGV method achieve more gains over the AD-TV method in terms of successful artifacts suppression and edges detail preservation.

The images reconstructed by the FBP with ramp filter, AD-TV and AD-TGV methods from 30-, 40- and 60-vews projections, respectively.
To quantitatively evaluate the present AD-TGV method, the UQI were studied in this subsection. The ROI as indicated by a red rectangular window in Fig. 1(a) was selected to calculate the UQI values. The corresponding UQI values are shown in Fig. 5. We can observed that the UQI values of AD-TGV results are higher than the AD-TV results.

UQI values of images reconstructed by FBP, AD-TV, and AD-TGV methods from 30, 40, 60, 116, 232, 290, 387, 580, and 1160-projection views.
Visual evaluation
The results reconstructed by different methods from the sinogram data acquired at 40 mAs with different projection views are shown in Fig. 6. We can observe that AD-TGV method achieves more gains than AD-TV method in terms of successful patchy artifacts suppression and structure detail preservation. Figure 7 depicts the associated vertical profiles of the reconstructed images across the 245th column, from the 233th row to the 283th row. The results suggest that the AD-TGV method achieved better matching results than the AD-TV method.

Images reconstructed by the FBP, AD-TV, and AD-TGV methods from 58-, 116-, 232-, 290- view projection at 40 mAs.

Profiles (245th column) of the images reconstructed by the AD-TV and AD-TGV methods from different projections at 40 mAs: (a) 58-view projection; (b) 116-view projection; (c) 232-view projection; (d) 290-view projection.
To further show the gains of the present AD-TGV method, the zoomed ROIs (as indicated by the squares in Fig. 1(c) are shown in Fig. 8. Serious artifacts can be found in the FBP results for all the cases, and AD-TGV method can achieve noticeable gains compared with AD-TV method in terms of staircase and patchy artifacts suppression.

The zoom-in views of images in Fig. 7.
The gold standard image (Fig. 1(c)) was reconstructed by the FBP method with ramp filter from the averaged full view sinogram data of 150 times repeated scans at the protocol of 100 mAs and 120 kVp. The curve of UQI values versus the number of projection views is shown in Fig. 9. The UQI values of AD-TGV results are higher than that of the AD-TV results. Thus, the AD-TGV method achieves better matching results than the AD-TV method.

UQI value versus the numbers of projection views.
The chest CT images reconstructed by different methods are displayed in Fig. 10. The image reconstructed by the FBP method with ramp filter from full view projection is shown in Fig. 10(a). The image reconstructed by the FBP method with ramp filter from 387 projection views is shown in Fig. 10 (b). The image reconstructed by the AD-TV method from 387 projection views is shown in Fig. 10 (c). The image reconstructed by the AD-TGV method from 387 projection views is shown in Fig. 10 (d). To visualize the difference between the AD-TV and AD-TGV methods, the zoomed ROIs of the images (Fig. 10) are shown in Fig. 11. The results show that the AD-TGV method achieves better noise reduction, artifacts suppression, and structure preservation than the AD-TV method.

Clinical chest CT image reconstruction results: (a) image reconstructed by the FBP method with ramp filter from full view projection at 100 mAs, 120 kVp; (b) image reconstructed by the FBP method with ramp filter from 387-view projection at 100 mAs, 120 kVp; (c) image reconstructed by the AD-TV method from 387-view projection at 100 mAs, 120 kVp; and (d) image reconstructed by AD-TGV method from 387-view projection at 100 mAs, 120 kVp.

The zoomed ROI in Fig. 10. (a) image reconstructed by the FBP method with ramp filter from the full view projection at 100 mAs, 120 kVp; (b) image reconstructed by the FBP method with ramp filter from 387-view projection at 100 mAs, 120 kVp; (c) image reconstructed by the AD-TV method from 387-view projection at 100 mAs, 120 kVp; and (d) image reconstructed by the AD-TGV method from 387-view projection at 100 mAs, 120 kVp.
In this study, we proposed an alpha-divergence constrained total generalized variation minimization for sparse-view CT image reconstruction. The alpha-divergence is a generalization of KL divergence and χ2- divergence [30] which is usually used to measure the difference between two probability distributions. Some special cases in Dα (y, Hf) listed below have been frequently applied to iteration reconstruction:
Ignoring the constants in Equation (16) it coincides with opposite log-likelihood function. Equation (17) and Equation (18) are the corresponding WLS models in [26, 27]. An accurate statistical distribution of the sinogram data is very important for high-quality CT image reconstruction. By the determined adequate α, the corresponding alpha-divergence can accurately describe the statistical distribution ofsinogram data.
TV regularization has been widely used in CT image reconstruction with noticeable gains [11, 17]. However, TV regularization often leads to noticeable staircasing effect and patchy artifacts because of the piecewise constant assumption [15, 32]. In this work, the TGV regularization was used to eliminate the staircasing effect and patchy artifacts. The performance of the presented AD-TGV method has been extensively validated and evaluated in terms of the noise reduction and structure detail preservation. As shown in the study of digital XCAT phantom, the performance of the AD-TGV method is impressive by quantitative measure and visual inspection. Furthermore, qualitative and quantitative studies were conducted using using physical phantom and patient data in realistic CT imaging. The results show that the AD-TGV method outperforms the AD-TV method in terms of visual inspection and UQI evaluation.
For the implementation of the AD-TGV method, several parameters should be optimally selected. This is an open problem for the iterative reconstruction methods. In this study, we just focus on the value of α = 1. Furthermore, the rRMSE versus the vale of α in the range of [10-6, 2] for AD-TGV method with 60-view projection (XCAT phantom) was shown in Fig. 12. We note that the optimal value of α is around 0.15, and rRMSE can reach its minimum at α = 1 in the range of [1, 2]. Determining the optimal value of α in a greater interval is an interesting topic for future research. The parameters ρ and τ can be selected according to the scheme introduced in [25]. Similar to most SIR methods, the computational time of the proposed AD-TGV could be a challenge for its practical use because of multiple re-projection and back-projection operations. Nevertheless, with the development of fast computers and dedicated hardware [33], iterative reconstruction algorithms including the present AD-TGV method may be used for clinical CT reconstruction in the near future.

rRMSE versus the value of alpha for the AD-TGV method with 60-view projection (XCAT phantom).
In this work, we presented an iterative reconstruction method for sparse-view X-ray CT using alpha-divergence constrained TGV. The alpha-divergence is used to measure the discrepancy between the measured projection and estimation. The TGV regularization is proposed to eliminate the staircase and patchy artifacts which is often observed in TV regularization. Experimental results show thatAD-TGV method is more beneficial than AD-TV method in terms of noise reduction, artifact suppression and structure preservation. In other words, the presented AD-TGV method can potentially reduce the radiation dose by lowering the mAs and reducing the number of projection views.
Footnotes
Acknowledgments
This work was partially supported by the National Natural Science Foundation of China (No.81371544, No.61571214, No.11661007 and No.11661008), the National Science and Technology Major Project of the Ministry of Science and Technology of China (No. 2014BAI17B02), Science and Technology Program of Guangzhou (No.201510010039), Natural Science Foundation of Jiangxi Province(No.20161BAB212055), Science and Technology Program of Jiangxi Education Committee(GJJ150994), and NIH/NCI(# CA143111 and # 082402).
