Abstract
BACKGROUND AND OBJECTIVE:
Since the stair artifacts may affect non-destructive testing (NDT) and diagnosis in the later stage, an applicable model is desperately needed, which can deal with the stair artifacts and preserve the edges. However, the classical total variation (TV) algorithm only considers the sparsity of the gradient transformed image. The objective of this study is to introduce and test a new method based on group sparsity to address the low signal-to-noise ratio (SNR) problem.
METHODS:
This study proposes a weighted total variation with overlapping group sparsity model. This model combines the Gaussian kernel and overlapping group sparsity into TV model denoted as GOGS-TV, which considers the structure sparsity of the image to be reconstructed to deal with the stair artifacts. On one hand, TV is the accepted commercial algorithm, and it can work well in many situations. On the other hand, the Gaussian kernel can associate the points around each pixel. Quantitative assessments are implemented to verify this merit.
RESULTS:
Numerical simulations are performed to validate the presented method, compared with the classical simultaneous algebraic reconstruction technique (SART) and the state-of-the-art TV algorithm. It confirms the significantly improved SNR of the reconstruction images both in suppressing the noise and preserving the edges using new GOGS-TV model.
CONCLUSIONS:
The proposed GOGS-TV model demonstrates its advantages to reduce stair artifacts especially in low SNR reconstruction because this new model considers both the sparsity of the gradient image and the structured sparsity. Meanwhile, the Gaussian kernel is utilized as a weighted factor that can be adapted to the global distribution.
Keywords
Introduction
Computed tomography (CT) is referred to as an important version of the problem which is that of obtaining the attenuated coefficients within the object from multiple X-ray projection data [1]. Owing to the emerging of CT, it has been extensively utilized in medicine and industry [2, 3]. For instance, for new medical application, the dedicated CT systems have been developed in breast imaging [4]. Under the circumstance, radiation exposure to the chest is a key consideration [5], since the chest issues are sensitive to the radiation. In view of the X-ray dose, lowering X-ray tube current intensity is an effective way to lower the exposure dose [6], but it is unfortunate that the obtained projection data will be of low Signal-to-Noise ratio (SNR). Moreover, in industry, reducing exposure time for the fast scanning speed may yet lead to the low SNR projection data. Reconstructing high-quality image from projection data with low SNR is a challenging and ill-posed problem [7].
Several efforts have been made to improve the quality and preserve edges of the image reconstructed from noisy projection data. The corresponding iterative algorithms has been reported. For instance, the classical simultaneous algebraic reconstruction technique (SART) as a major refinement of the algebraic reconstruction technique (ART) [8], has the more potential to acquire decent image quality using projections with low-level noise, compared to the commercial analytical algorithm such as filtered back-projection [9]. However, SART will not work well in condition of the projection data with the high-level noise as shown in Fig. 1 (b).

(a) is the reference image. (b) represents the image reconstructed from 512 projection views with added Gaussian noise whose average value is zero and standard deviation is 2% maximum projection in current view by using SART.
It is noteworthy that the regularization is an effective technique for addressing severely ill-posed problem using a certain prior information of the image to be reconstructed [10]. Total variation (TV) regularization as a representative is state-of-the-art in handling low-dose and few-view CT [11, 12], and the other sparsity regularization such as wavelet frames [13], Haar transform [14], S-transform [15] and so on. Currently, the noise suppression-guided image filtering reconstruction algorithm was proposed to deal with the low SNR problem [16].
TV based regularization is already commercialized in reconstructing image from noisy and incomplete projection data [11, 17]. But the assumption of TV regularization is that the image to be reconstructed consists of the areas, which are piece-wise constant [18]. TV performs well for the projection data with relatively low-level noise. Whereas, when the SNR of the obtained projection data is low, TV-based methods sometimes yield the undesirable stair artifacts and results in stair artifact [18]. Figure 2 (b) exhibits the performance of TV regularization for disposing noisy projection data. It can be observed that although TV algorithm has ability to deal with the high-level noise, it leads to the stair artifacts and distorts some details.

(a) is the reference image. (b) represents the image reconstructed from 512 projection views with added Gaussian noise whose average value is zero and standard deviation is 2% maximum projection in current view by using TV algorithm.
Most of the regularization techniques usually put emphasis on the global characteristics of the transformed image, and its constraint according to the image is posed on the whole. Latterly, the regional structure information of the reconstructed image is getting more and more attention. The way of group sparsity (GS) captures this key point. Soon afterwards, a great deal of the researches on GS has been reported in concern with denoising and preserving the edges. The approach that introduces group sparsity into TV was proposed for signal denoising [19]. TV with overlapping group sparsity (OGS) was presented for image restoration to alleviate the stair artifacts [20]. Subsequently, TV with OGS was proposed for deblurring Poisson noisy images, and it has ability to avoid stair effect and preserve edges in the restored images [21]. Recently, a novel image reconstruction for electrical impedance tomography was proposed, which utilizes enhanced adaptive group sparsity with TV constraint [22]. A new approach based on enhanced group sparsity and non-convex regularization was presented to perform compressed sensing MRI (magnetic resonance imaging) [23]. More recently, the concept of group sparsity is widely used in many applications, and they all worked very well.
In this paper, motivated by the above works, a novel model is proposed that introduces the Gaussian kernel and overlapping group sparsity into TV model for CT image reconstruction from the projection data with low SNR. On one hand, this model extends total variation that aims to include group sparsity characteristics of the derivatives of the reconstructed image. It supposes that the derivatives of the reconstructed image are not only sparse, but also exhibit a simple form of structured sparsity. On the other hand, this model utilizes the Gaussian kernel to differentiate the importance of each point in every group. The Gaussian kernel is a kind of smooth functions, which is easy to deal with and implement numerically. The closer to the center of the group, the greater the weight value is. Using this kernel, not only the case of a single point is considered, but also the local characteristics are considered. Numerical simulated experiments showed that GOGS-TV has more ability to alleviate the stair artifacts and preserve the edges of the reconstructed image, compared with SART and TV.
The rest of the paper is organized as follows. In section 2, it will briefly review the fundamental model of image reconstruction, and then propose the novel model for low SNR CT. Consequently, the corresponding algorithm is given to address the presented model. In section 3, it demonstrates several numerical simulated experiments for image reconstruction and verifies the effectiveness of our algorithm, as compared to the classical FBP, SART and the state-of-the-art TV algorithm. Finally, conclusions are made in section 4.
Model
Generally, from a mathematical point of view, the image reconstruction problem can be formulated as:
where x denotes the discrete attenuation coefficients of the object to be reconstructed; A is the system matrix; e represents the noise; b is the measurement of the calibrated and log-transformed projection data. Generally, the problem (1) can obtain a solution by solving the following least square problem:
It can perform well by iterative algorithms, such as ART and SART. SART can suppress the noise when the projection data are complete and contains low-level noise, whereas it will lead to obvious artifacts in reconstructed images when projection data are incomplete or contains the high-level noise. In this situation, the solution generated from SART will not converge to the correct solution.
The regularization methods offer an effective way to surmount the instability of the solution of an ill-posed problem. Nowadays, TV regularization [10] as a state-of-the-art representative can be expressed as:
where ∥Tx ∥ 1 denotes the total variation of the image x, xi,j is the intensity of the image x at the position (i,j), λ is a parameter balancing the fidelity term and regularization term,T denotes the gradient transform and (Tx) i,j = (xi+1,j - xi,j+1 - xi,j). The model (3) puts emphasis on the global characteristics of the transformed image, and its constraint according to the image is posed on the whole. TV has ability to deal with several reconstruction problems. However, if the projection data contain the large noise and the object to be reconstructed is relatively complex, TV may lead to stair artifacts and obscure the edges.
For the sake of addressing low SNR reconstruction, this paper proposes a novel model (4) that aims to include group sparsity characteristics of the gradient of the reconstructed image. The presented model combines the Gaussian kernel and overlapping group sparsity into TV model (3), which can be noted as GOGS-TV. The presented model is expressed as follows:
where the symbols of the presented model (4) can be explained as follows: I denotes the group size and Ω denotes the whole coordinate positions of the image x;
In this section, it will briefly introduce the methods to address the models (2) and (3), respectively, then dilate on the algorithm to deal with the proposed model (4). To address the model (2), SART [8] is utilized. In this situation, let the image x be arranged into one vector. The system matrix A belongs to the real space R
M
×N, i.e. A is M×N matrix and ai,j denotes the ith row and jth column element of the matrix A. Likewise, the projection data b belongs to R
M
and the image x belongs to R
N
. The (n + 1)th iteration is expressed as follows:
where ϖ denotes a relaxation parameter in (0,2),
For convenience, we let x(n+1) = SART (x n ) instead of the equation (5).
For the sake of solving the model (3), SART is first utilized to deal with the fidelity term ∥Ax - b ∥ 2 ⩽ ɛ, and that is
Finally, the gradient descent method is used to obtain the reconstructed result. The (n + 1)th iterative result can be formulated as:
Before addressing the model (4), let’s analyze the various situations of (4). Its three main forms can be represented as follows: If If If there exists k1 and k2 such that G
k
1
∩ G
k
2
≠ Φ and ωi,j (p
k
) = e-d2(p
k
,pi,j)/-h2, then it denotes the weighted and overlapping group sparsity modality.
In this study, it mainly deals with the third case. Just like TV algorithm, SART is used to address the fidelity term, i.e.
To summarize above the procedures of our method, let N iter be the maximum iteration number and NGOGS - TV be the maximum iteration number of GOGS-TV minimization step, and the pseudo code of the GOGS-TV method is exhibited in the following:
1) Compute
2) Non-negative constraint:
In this section, simulated experiments for low SNR reconstruction are implemented to validate and evaluate the effectiveness of GOGS-TV approach. Then the results using the proposed approach are compared with the classical SART and state-of-the-art TV algorithm which has been widely used for several reconstruction problems. All the experiments are implemented on 2.40 GHz intel(R) Core (TM) i5-4210U CPU processor with 4G memory.
Quantitative assessments
In the experiments, it conducts visual inspection for reconstructed images using the proposed approach, FBP [1], SART [8] and TV [11], respectively. To exhibit the details and edges in the reconstructed images, the results magnify certain selected regions of the reconstructed images. Additionally, quantitative assessments are utilized to characterize the reconstruction quality by considering the root mean squared error (RMSE) [25], peak signal to noise ratio (PSNR) [26] and structural similarity index (SSIM) [27] as follows.
Where x denotes the reconstructed image; x r is the reference image; M×N is the total number of the image pixels; μ x and μ x r are the mean values of x and x r ; σ x and σ x r are the standard deviations of x and x r ; Cov{x,x r } denotes the covariance between x and x r ; c1 = (0.01R)2 and c2 = (0.03R)2, R is the dynamic range of pixel values. The value of SSIM is closer to 1, then the similarity between the reference image and the reconstructed image is higher.
Simulated scanning parameters and parameter selections for the methods
A chest phantom [27] is used to test the proposed approach. The simulated geometrical scanning parameters for low SNR CT are listed in Table 1. this paper adds the Gaussian noise to the projection data such that it can verify the effectiveness of the proposed approach, where the average value of the Gaussian noise is zero, and its standard deviation is 1% and 2% maximum projection in current view (denoted as 1% max and 2% max). The experiments take the complete range 0°∼360° into consideration, and the iterations are all set to 50, i.s. N iter = 50. The relative error ε0 is set to 10–6.
Geometrical scanning parameters for simulated CT imaging system
Geometrical scanning parameters for simulated CT imaging system
In the simulated experiments, the parameters are chosen by trial and error for better image quality. For different degrees of low SNR projection data and the methods, the corresponding parameters are selected differently. However, the parameter selections are in connection with the noise level of the projection data. The specific choice in this paper is shown below.
For SART [8], the relaxation parameter ϖ is both set to 0.05 for the two simulated situations. In general, ϖ is set to 1, but for the low SNR situation, it will lead to high noise in the reconstructed results. For TV algorithm [11], there are three main parameters to be confirmed which are the relaxation parameter ϖ, the regularization parameter λ and the iteration number of TV minimization step (N TV ). For both simulated situations, N TV = 20 and ϖ = 0.1. Separately, λ is set to 0.1 for the 1% max, and λ is set to 0.2 for the 1% max.
For GOGS-TV algorithm, five parameters are needed to be confirmed, which are the size of the group I, the smoothing parameter h, the relaxation parameter ϖ, the regularization parameter λ and the iteration number of GOGS-TV minimization step (NGOGS - TV). For both simulated experiments, there are four cases: I = 3×3, h = 0.4; I = 3×3, h = 0.5; I = 5×5, h = 0.4; I = 5×5, h = 0.5. In these situations, ϖ = 0.1 and NGOGS - TV = 20 for all situations, meanwhile, λ = 0.05 for the 1% max and λ = 0.09 for the 2% max.
Figure 3 (a) shows the reference image. The 1st and 2nd red rectangles of Fig. 3 (a) are to be locally zoom-in regions such that it can further confirm the effectiveness of the proposed approach GOGS-TV. The rest of Fig. 3 are the results reconstructed using FBP, SART, TV and GOGS-TV from projection data with added 1% max Gaussian noise. It can be observed that although Fig. 3 (b) and (c) has a good contrast, but it contains a lot of noise. Thus, sometimes it may affect some small details. Figure 3 (d) shows the result using TV algorithm, which inhibits the noise very well, but simultaneously it does not preserve the edges and details very well.

(a) is the reference image and its red rectangles stand for the locally zoom-in positions. (b), (c) and (d) are the results reconstructed using FBP, SART and TV from the 1% max projection data, respectively. (e1), (e2), (e3) and (e4) represent the results using GOGS-TV with different parameters from the 1% max projection data. (e1): I = 3×3, h = 0.4; (e2): I = 3×3, h = 0.5; (e3): I = 5×5, h = 0.4; (e4): I = 5×5, h = 0.5. Grayscale window is [0,255].
From Fig. 3 (e1) to Fig. 3 (e4), they are reconstructed by GOGS-TV with the different parameters. In vision, they all have ability to suppress the noise and preserve the edges except for some tiny places. To better demonstrate the details, Fig. 4 and Fig. 5 exhibit the local zoom-in images of Fig. 3. It can be found that the results using GOGS-TV have better ability to suppress the noise while preserving the edges, compared with the classical SART and the state-of-the-art TV algorithm. For better distinguishing the reconstruction effects, the quantitative assessments are implemented as shown in Table 2. From Table 2, it is seen that for the 1% max situation GOGS-TV (I = 5×5, h = 0.4) can obtain the best assessments.

It shows the locally zoom-in images corresponding to the 1st red remark of Fig. 3 (a). Grayscale window is [0,255].

It shows the locally zoom-in images corresponding to the 2nd red remark of Fig. 3 (a). Grayscale window is [0,255].
Quantitative assessments for the 1% max situation
In order to better verify the effectiveness of GOGS-TV for addressing low SNR, this paper added higher level noise to the projection data. Figure 6 (a) shows the reference image. The 1st and 2nd red rectangles of Fig. 6 (a) are to be locally zoom-in regions which can further confirm the superiority of GOGS-TV. The rest of Fig. 6 are the results reconstructed using FBP, SART, TV and GOGS-TV from projection data with added 2% max Gaussian noise. From Fig. 6, it can be observed that Fig. 6 (a) and (b) represents the result using SART and FBP respectively, which contains so much noise that its edges and details are distorted. In comparison, TV algorithm can obtain a better result as shown in Fig. 6 (d). However, due to the projection data with the high-level noise, TV algorithm brings about the stair artifacts and makes the edges fuzzy.
Compared to SART and TV algorithm, GOGS-TV has more advantages to deal with the low SNR projection data. From Fig. 6 (e1) to (e4), it can be observed that they have the better ability to preserve the edges and suppress the noise, especially Fig. 6. (e4). Table 3 shows that the result using GOGS-TV (I = 5×5, h = 0.5) can obtain the best quantitative assessments.

(a) is the reference image and its red rectangles stand for the locally zoom-in positions. (b), (c) and (d) are the results reconstructed using FBP, SART and TV from the 2% max projection data, respectively. (e1), (e2), (e3) and (e4) represent the results using GOGS-TV with different parameters from the 2% max projection data. (e1): I = 3×3, h = 0.4; (e2): I = 3×3, h = 0.5; (e3): I = 5×5, h = 0.4; (e4): I = 5×5, h = 0.5. Grayscale window is [0,255].
Quantitative assessments for the 2% max situation
In order to verify the effectiveness of the presented approach for the few-view and low-dose CT reconstruction, this paper also utilizes the Chest phantom. The geometrical scanning parameters for simulated CT imaging system is as shown in Table 1. There are three experiments with 20, 30, 40 projection views to be done. The projection data is added by 1% max Gaussian noise. The total number of iterations N itet is set to 500.
In these experiments, the parameters are chosen by trial and error for better image quality. For different projection views, the corresponding parameters are selected differently. For SART, the relaxation parameter ϖ is both set to 0.5 for the three simulated situations. For TV algorithm, there are three main parameters to be confirmed which are the relaxation parameter ϖ, the regularization parameter λ and the iteration number of TV minimization step (N TV ). For both simulated situations, N TV = 20 and ϖ = 0.5. λ is set to 0.8, 1 and 1.1 for 20, 30, 40 projection views separately.
For GOGS-TV algorithm, I = 3×3, h = 0.4; I = 3×3, h = 0.5; I = 5×5, h = 0.4; I = 5×5, h = 0.5;. In these situations, ϖ = 0.5 and NGOGS - TV = 20. Meanwhile, λ is set to 0.78, 0.96 and 1.08 for 20, 30 and 40 projection views, respectively. The results can be shown in Fig. 9, Fig. 10 and Fig. 11. It can be shown that the results reconstructed by FBP and SART contain a lot of noise and the details and edges are distorted; meanwhile, TV can work better than FBP and SART, but they have stair artifacts and some details has been distorted. The reason is that firstly the Chest phantom has a few details and the grayscale of each field is similar; secondly, the obtained projection data contains a high level of noise. Then, when dealing with the noise, the details are distorted. On the contrary, when preserving the details, the noise will affect the reconstructed results. Compared with previous algorithms, GOGS-TV has the ability of address the low-dose with few-view projection data in terms of quantitative indicators as shown in Table 4. But it’s worth noting that the reconstructed results gradually become worse as the few-view number decreasing. The numerical assessments show that the results by GOGS-TV with I = 3×3, h = 0.4; can get the best RMSE, PSNR and SSIM.

It shows the locally zoom-in images corresponding to the 1st red remark of Fig. 6 (a), respectively. Grayscale window is [0,255].

It shows the locally zoom-in images corresponding to the 2nd red remark of Fig. 6 (a), respectively. Grayscale window is [0,255].

It shows the results reconstructed from 40 projection views. (a) is the reference image. (b), (c) and (d) are the results reconstructed using FBP, SART and TV from 40 projection views with added the 1% max Gaussian noise, respectively. (e1), (e2), (e3) and (e4) represent the results using GOGS-TV with different parameters from the 1% max projection data. (e1): I = 3×3, h = 0.4; (e2): I = 3×3, h = 0.5; (e3): I = 5×5, h = 0.4; (e4): I = 5×5, h = 0.5. Grayscale window is [0,255].

It shows the results reconstructed from 30 projection views. (a) is the reference image. (b), (c) and (d) are the results reconstructed using FBP, SART and TV from 30 projection views with added the 1% max Gaussian noise, respectively. (e1), (e2), (e3) and (e4) represent the results using GOGS-TV with different parameters from the 1% max projection data. (e1): I = 3×3, h = 0.4; (e2): I = 3×3, h = 0.5; (e3): I = 5×5, h = 0.4; (e4): I = 5×5, h = 0.5. Grayscale window is [0,255].
Quantitative assessments for few-view and low-dose Chest phantom
To further confirm the effectiveness and accuracy of the presented approach for low-dose CT reconstruction. Some details of the Walnut are difficult to reconstruct, which includes a dense, layered shell. The Walnut is acquired from a CT system with tube current 200μA and tube voltage 80 kv. Its projection data and geometrical parameters can be obtained at reference [28]. The scanning range [0,181°] is investigated, and the corresponding projection views are 604. The size of reconstruction image is 656×656.
Similarly, the parameter selections are by trial and error. For SART, the relaxation parameter ϖ is both set to 0.2 for the real Walnut. For TV algorithm, there are three main parameters to be confirmed which are the relaxation parameter ϖ = 1, the regularization parameter λ = 0.2 and the iteration number of TV minimization step (N TV = 20). For GOGS-TV algorithm, five parameters are needed to be confirmed, which are the size of the group I, the smoothing parameter h, the relaxation parameter ϖ, the regularization parameter λ and the iteration number of GOGS-TV minimization step (NGOGS - TV). For the Walnut projection data, there are four cases: I = 3×3, h = 0.4; I = 3×3, h = 0.5; I = 5×5, h = 0.4; I = 5×5, h = 0.5;. In these situations, ϖ = 0.2, λ = 0.0001 and NGOGS - TV = 20.
From the reconstructed results in Fig. 12, FBP can reconstruct the relatively good details but include the noise. It shows that SART, TV and GOGS-TV can obtain the good results. From some small details (such as the center of the Walnut), GOGS-TV can address that better.

It shows the results reconstructed from 20 projection views. (a) is the reference image. (b), (c) and (d) are the results reconstructed using FBP, SART and TV from 20 projection views with added the 1% max Gaussian noise, respectively. (e1), (e2), (e3) and (e4) represent the results using GOGS-TV with different parameters from the 1% max projection data. (e1): I = 3×3, h = 0.4; (e2): I = 3×3, h = 0.5; (e3): I = 5×5, h = 0.4; (e4): I = 5×5, h = 0.5. Grayscale window is [0,255].

(a) is the result reconstructed by FBP. (b) and (c) are the results reconstructed using SART and TV from the real Walnut projection data, respectively. (d1), (d2), (d3) and (d4) represent the results using GOGS-TV with different parameters. (d1): I = 3×3, h = 0.4; (d2): I = 3×3, h = 0.5; (d3): I = 5×5, h = 0.4; (d4): I = 5×5, h = 0.5. Grayscale window is [0,1].
In recent decades, regularization based iterative image reconstruction algorithms have been widely developed and demonstrated their capable of addressing many different imaging problems. TV as one of the most powerful tools has been extensively utilized in low SNR situation. However, TV yields the undesirable stair artifacts when the projection data contains the high-level noise, since it puts emphasis on the global characteristics of the gradient image and it constrains the whole image.
In order to capture the shortcomings of the TV algorithm, the idea of group sparsity is utilized in this paper. This not only considers the sparsity of the gradient image, but also include the structured sparsity. Meanwhile, the Gaussian kernel is utilized to be the weighted factor, which can be adapted to the global distribution. The simulated experiments verify the superiority of the proposed approach GOGS-TV, compared to the classical FBP, SART and state-of-the-art TV algorithm. Furthermore, quantitative assessments similarly confirm this point. As adding the high-level noise, GOGS-TV still owns the merits of suppressing the noise and preserving the edges. However, for the different noise levels, the parameters of GOGS-TV need to be redetermined. In the future, the adaptivity of the parameters of the approach will be considered.
Footnotes
Acknowledgments
This work is supported by the Science and Technology Research Program of Chongqing Municipal Education Commission (Grant No. KJQN2019013), the Scientific Research Foundation of Chongqing University of Arts and Sciences (Grant No. R2019FSC17), the Natural Science Foundation of Chongqing Municipal Science and Technology Commission (Grant numbers: cstc2019jcyj-msxmX0012, cstc2020jcyj-msxm2352), and the Open Project of Key Laboratory No.CSSXKFKTQ202004, School of Mathematical Sciences, Chongqing Normal University.
