Abstract
Background
Total generalized variation (TGV) based CT iterative reconstruction algorithm has the ability to effectively suppress the staircase effects caused by the piecewise constant assumption of total variation regularization. By unrolling the model-based iterative reconstruction to networks, the deep unrolling approach can further improve image quality within a finite number of iterations by data-driven training. However, most deep unrolling approaches focus on unrolling the data fidelity term into deep neural networks, which limit the performance of the deep unrolling approach.
Objective
To address this issue, we unrolled both the data fidelity term and the TGV term to construct a novel low-dose CT reconstruction network, called TGV based deep unrolling approach (TGV-DU).
Methods
The Chambolle-Pock algorithm was employed to solve the TGV based CT iterative reconstruction problem to obtain a single-loop CT iterative reconstruction algorithm, which is easy to be unrolled to neural networks. In the proposed algorithm, the parameterized mapping that updates primal variables and dual variables across successive iterations was implemented by convolutional neural networks and was dynamically learned from big data.
Results
To validate the effectiveness of our proposed algorithm, we perform the experiment on the “Low-Does CT Image and Projection Data” dataset. The results show that the proposed TGV-DU outperforms other state-of-the-art methods quantitatively and qualitatively.
Conclusions
Experiments show that our proposed algorithm can effectively alleviate the piecewise smoothness while preserve more structural details.
Introduction
Computed tomography (CT) is a non-invasive diagnostic technique widely used in medical diagnostics. However, its use of ionizing radiation has raised significant concerns due to the associated carcinogenic risk, particularly from radiation exposure. 1 In view of these considerations, X-ray exposure doses should be minimized during CT scanning to ensure safety. This may lead to poor image quality during reconstruction, loss of tissue structural details, and the presence of excessive noise and artifacts. 2 Ultimately, these factors may trigger misdiagnosis. Thus, reducing the CT radiation dose while ensuring the acquisition of high-quality images that meet clinical diagnostic requirements has always been an important research focus and challenge in the field of CT. Numerous CT reconstruction algorithms are proposed to enhance image quality for low-dose computed tomography (LDCT). 3
Traditional CT reconstruction algorithms are categorized into analytical and model-based iterative reconstruction (IR) methods. Filtered back-projection (FBP) is a classical analytical reconstruction algorithm, but as the decreasing of dose and angle, the reconstructed image shows a lot of noise and artifacts. Model-based IR has the potential to reduce the radiation dose while maintaining the quality of the image. 4 In model-based IR algorithms, the objective function consists of a data fidelity term and a regularization term. The data fidelity term is utilized to quantify the consistency between the reconstructed image and the actual measured projection data, to ensure that the reconstructed image accurately reflects the real projection data. The regularization term is employed to incorporate a priori knowledge and impose constraints on the smoothness and structural information of the reconstructed image, thereby yielding better reconstructed image quality. Total variation (TV) regularization, a widely adopted technique in CT reconstruction, effectively preserves edge structures by imposing gradient sparsity constraints while suppressing noise. Due to the piecewise constant assumption there are usually staircase effects in the reconstructed image. To improve this drawback, the second-order total generalized variation (TGV) was adopted as the regularization term of the objective function. However, model-based IR methods typically necessitate laborious hyperparameter turning and high computational demands due to their iterative optimization frameworks. 5
With the rise of artificial intelligence (AI), deep learning techniques are widely applied in the fields of computer vision, image segmentation and medical image reconstruction. Over the past few years, many researchers have been dedicated to the development of deep learning-based LDCT methods. These methods can be roughly categorized into four classes. The first class of methods recovers the corrupted projection data by filtering the projection data in the projection domain through neural networks.6,7,8 The second class of methods employs various post-processing networks to directly enhance the reconstructed images for removing noise and artifacts.9,10,11,12 The third class utilizes two neural networks on dual domain to learn the sinogram and CT image information respectively.13,14,15,16 The fourth class of methods unrolls the specific model-based IR methods into networks which effectively enhances the performance of CT reconstruction networks. Chen et al. unrolled the steepest gradient descent method to construct a sparse-data CT reconstruction network. 17 In this algorithm, the regularization term of the objective function was directly replaced by the learning experts. To further improve the feature extraction capability of this unrolling network, Xia et al. adopted a graph convolution to extract the features of the low-dimensional manifold for LDCT. 18 Adler and Öktem proposed a learned primal-dual algorithm for tomographic reconstruction, generalizing the Chambolle-Pock (CP) algorithm by replacing the proximal operators with parametrized operators where the parameters are learned from training data. 19 Zhang et al. unrolled ISTA into deep network and developed an effective strategy to solve the proximal mapping using nonlinear transforms. 20 Xiang et al. employed fast iterative shrinkage/thresholding algorithm (FISTA) to solve the inverse problem and unrolled the obtained iterative steps into a deep neural network. In this algorithm, it alternately minimized the differentiable part using the gradient information and the non-differentiable part using a proximal operator represented by a learned network. 21 Wang et al. unrolled the alternating direction method of multipliers (ADMM) to construct the architecture of the unrolling network. 22 In this network, the regularization term and the regularization parameters were adaptively obtained from the training data with the architecture of U-Net. He et al. similarly unrolled the ADMM-based iterative reconstruction into a CT reconstruction network. 23 This method took both the projection and image domains into account and utilized residual neural networks to directly replace the regularization term in the projection and image domains respectively. Pan et al. employed the block-coordinate descent method to unroll the objective function based in residual measurements. 40 This unrolling network integrated neural network prior as a regularizer which was undertaken by the convolution-assisted transformer subnetworks. Komolafe et al. utilized the gradient descent method to solve the Field of Expert based objective function. 41 The regularization term is implemented through a cascaded convolutional and deconvolutional network similar to a classic CNN structure. All of these methods only acknowledged the significance of the data consistency in the unrolling networks, but ignored to explore the prior information which plays a crucial role in the iterative algorithm. Without unfolding the regularization term in the objective equation and directly replacing it with a neural network, deep unrolling networks suffer from a limited performance.
To address the aforementioned limitations of deep unrolling approaches, we constructed the CT reconstruction network by unrolling both the data fidelity term and regularization term. Zhang et al. proposed a deep unrolling algorithm by enabling the direct unrolling of TV prior terms to convolutional neural networks (CNNs).
24
To further eliminate the undesired staircase effect caused by TV based models, we utilize the second-order TGV as the regularization term to formulate the objective function for the CT reconstruction problem. The CP algorithm was employed to solve the TGV based CT reconstruction problem to obtain a single-loop CT iterative reconstruction algorithm, which is easy to be unrolled to neural networks. Leveraging the remarkable learning capacity of CNNs, this framework achieves high-fidelity reconstructions by adaptively optimizing parameter spaces during the process of solving inverse problem. The main contributions of our work are as follows.
In view of the staircase effect caused by TV regularization, we adopted the second-order TGV as regularization term of the objective function by incorporating higher-order derivatives. Due to the strong convergence, we employed the CP algorithm to solve above objective function to obtain a single-loop algorithm which is easy to be unrolled to networks. Different from the other data-driven deep learning-based algorithm, our proposed backbone network architecture, combining model-driven and data-driven approaches, is built by unrolling the iterative algorithm into deep neural networks which retains interpretability of physical models and optimizes the process of reconstruction by outstanding non-linear feature extraction capability of CNNs. Distinguished from the other deep unrolling approach, TGV-DU took both the data fidelity term and prior information into account when unrolled a model-based IR algorithm into networks rather than replaced the prior term with a CNN directly.
Method
TGV-based iterative algorithm
TV-based reconstruction model has been widely adopted for its edge-preserving properties. However, the assumption of segmentation constants for TV minimization usually leads to significant staircase effect in the reconstructed images.
25
In order to eliminate the staircase effect, high-order partial differential equation have been applied to image reconstruction.
26
Bredies et al. proposed the concept of the TGV which provides a balance between the first and second derivative of a function. The second-order TGV is defined as
27
:
The objective function of the iterative model based on second-order TGV can be written as
28
:
The above minimization problem can be seen as an optimization problem solved by CP algorithm due to its convergence guarantee. The CP algorithm is used to solve a general form of the primal minimization problems in the following.
In the TGV-based minimization problem,
The next step is to derive the dual problem of the primal problem by computing the convex conjugate function of F and G, defined in (11). Based on the TGV prior, the convex function F and G and their convex conjugation which play a vital role to update the values of variables in the CP algorithm are defined as follows.
Subsequently, the above minimization problem can be a special instance of CP algorithm
29
and the primal and dual problems are connected in the following saddle point optimization:
The key to solving (13) lies in determining the proximal mapping of the above variables to update the variables. The definition of the proximal mapping is as follows:
In TGV-based problem, we computed the proximal mapping of the primal and dual variables to obtain the iterative reconstruction algorithm.
We further calculated the above formula and listed the specific steps in Algorithm 1.
It can be seen that there are numerous parameters in the above iterative algorithm which necessitate much labors and high computations to select a best value. Inspired by the outstanding learning ability of networks, this single-loop iterative algorithm is easy to be unrolled into a network where train these parameters from big data automatically.
LDCT reconstruction network based on second-order TGV unrolling
We constructed the architecture of TGV-DU based on the iterative algorithm in Algorithm 1. The overall structure of our proposed TGV-DU network is illustrated in Figure 1. In this network, each block of the network corresponds to one iteration of the Algorithm 1 which is illustrated in Figure 1(a). Each block is composed of multiple subnetworks which keep the mapping relation between variables but generalize derived operations to have learnable parameters. The subnetworks are demonstrated in Figure 1(b). Inspired by the powerful representation
30
and the universal approximation property
31
of CNN, we adopted a combination of three linear convolution operators with bias terms separated by batch normalization (BN) layers and nonlinear activation layers which construct the architecture of subnetworks with purpose to learn parameters. The pixel size and number of channels of each convolution are shown in Figure 1(c). The update steps between Line 4 and Line 10 are generalized to

Illustration of our proposed TGV-DU. (a) the overall framework of N iterations (b) the detailed structure of the n-th iteration block (c) the structure of CNN in (b).

Fixed convolutional kernel, (a), (b): the convolution kernel of operation
In conclusion, seven kinds of CNNs were proposed to implement the proximal operators of the vectors
Our proposed TGV-DU is an unrolling approach with 10 iterations which incorporates both the data consistency and prior information of an image by combining model-driven and data-driven approaches. Each iteration block is composed of seven subnetworks which has the same structure. The structure of CNNs is employed to enhance the feature extraction capabilities of the network. The sinogram data g is inputted into the network. There are three outputs
Datasets
Mayo dataset
In this paper, we mainly utilized the public dataset “Low-Does CT Image and Projection Data”
34
to verify the effectiveness of our proposed TGV-DU network. The chest CT scan images of 14 patients were selected as the training set and the chest CT scan images of 1 patient were selected as the test set. Since the CT scan images of this dataset were obtained by spiral scanning, we performed the projection of the obtained CT images with fan-beam physical geometry to obtain the CT projection data under fan-beam geometry. In this case, the distance from the ray source to the center of rotation is set to 595 mm, the distance from the ray source to the center of the detector is 1068 mm. The detector is a line-array detector having 768 detectors with a spacing of 1 mm and the ray source is scanned around the patient doing 360 equally spaced scans at an angle with a spacing of 1°, obtaining the size of the sinogram data of 768*360. The size of the reconstructed image of 512*512 with the size of each pixel size is set to 0.5859*0.5859. Since it is difficult to obtain both normal-dose and low-dose scanning images and sinogram data, we simulated the low-dose sinogram data from chosen CT images by fan-beam scanning. Adding Poisson noise and electronic noise to the noise-free sinogram data obtained from the above fan-beam scanning can better simulate the clinical environment and generate the low-dose sinogram data under different X-ray incident intensity with the formula
18
:
Piglet dataset
To further provide the robustness and practical viability of the proposed unrolling network, we executed the supplementary experiments on a publicly accessible Piglet Dataset. 42 This dataset comprises authentic porcine CT scans acquired with a GE Discovery CT750 HD scanner, setting the tube voltage to 100kVp and a slice thickness to 0.625 mm. Normal dose reference images were obtained at a tube current of 300 mAs. Corresponding LDCT images were generated by reducing the tube current to 50%, 25%, 10%, 5% of the normal dose. For model training, image pairs consisting solely of LDCT images obtained at 150mAs (50% dose) and corresponding normal dose CT images at 300mAs were utilized. The training set comprised 525 randomly selected image pairs. The trained model was subsequently tested on test sets encompassing all four doses (50%, 25%, 10%, 5%), with each test set containing 40 image pairs.
Implementation details
We performed the training and testing of the deep network within the PyTorch framework,
35
using the Adam optimization
36
algorithm for gradient descent optimization of the loss function. The initial learning rate of this network was set to
Results
Results on mayo dataset
Comparison of results at 15% dose
Two representative slices from the testing dataset are selected for visual comparisons of different DL-based methods in Figure 3 at 15% dose. It can be observed that the result of FBPConv is affected by streak-like artifacts which degrade the quality of images. In the results of LPD, there are a certain degree of improvement in image quality compared with the FBPConv, but there are still deficiencies in the accurate restoration of the subtle structures which are vital to diagnosis. Although the results of FISTA show certain noise suppression and texture preservation capabilities, there seem to be some additional structures which are nonexistent in the reference images. The DRCR algorithm has improved the effect of the artifact removal and detail restoration. But there are some oversmoothed effects in the results of DRCR. While the iterative reconstruction-based PWLS-SRDTGV method preserves significant structural details, it exhibits less effective denoising performance compared to deep learning approaches. Conversely, MBIR achieves near-optimal noise suppression for low-dose conditions but yields inaccurate tissue details within ROI regions—particularly at locations indicated by red arrows. As can be seen in Figure 3, the results show that our proposed TGV-DU not only alleviates the effect of noise and artifacts but also keeps richer subtle details. To further evaluate the effectiveness of our proposed TGV-DU, Table 1 illustrates the PSNR, SSIM and RMSE metrics of different methods from the first slice. The bold values in the tables represent the best qualitative indicator values. From Table 1, it is not difficult to see that our proposed method achieves the best performance in metrics which is consistent with the visual inspection in Figure 3. Different from other images, the medical images pay great attention to details which may contain medical diagnostic information. To further demonstrate the detail-preserving capability of our proposed TGV-DU, the magnified regions of interest (ROIs) marked by red rectangles in the first row are shown in the second row. As indicated by red arrows, it can be observed that TGV-DU can better reconstruct the subtle details compared with other methods where are slightly blurred. To quantify the detail preservation capability of different methods in ROI, a horizontal profile, located at the 120th row and 20th to 50th columns in the second row of Figure 3, is plotted in Figure 4. As demonstrated by the results, TGV-DU exhibits superior consistency relative to the reference image compared to other approaches.

Results of different algorithms at 15% dose. The ROIs, marked with red rectangular boxes in the first and third rows of NDCT, are shown as magnified views below each corresponding image (the second and forth row respectively). The display window is [-160,240]HU.

Intensity distribution of different algorithms at 15% dose.
Comparison of metrics for different algorithms at 15% dose in Figure 3.
Comparison of results at 10% dose
Figure 5 demonstrates two representative images for qualitative comparison in the first and third rows at 10% dose. As the number of incident photons of X-rays decreases, the results of FBPConv become even worse. Although some improvement has been achieved for noise suppression, the details in the results of LPD, FISTA and DRCR are blurred to varying degrees. Compared to deep learning methods, the iterative PWLS-SRDTGV reconstruction preserves structural detail effectively but achieves weaker denoising effect. Conversely, MBIR provides near-optimal noise suppression under low-dose conditions yet compromises the fidelity of tissue details within ROIs. To further verify the detail preservation capability of our proposed TGV-DU, the ROIs labeled by the red rectangles in the first and third rows of Figure 5 are magnified in the second and forth rows of Figure 5. As indicated by red arrows in zoomed regions, it can be observed that the results of FBPConv, LPD and FISTA cannot suppress the noise and artifacts in certain degree. DRCR has achieved some improvement in noise suppression, but it seems to erode the details indicated by red arrows due to the oversmoothed effects. Our proposed TGV-DU seems to achieve a better balance between the artifact removal and detail preservation in visual effect. To further illustrate the effectiveness of the proposed algorithm, Table 2 lists the quantitative results of metrics on the first slice demonstrated in Figure 5. TGV-DU scores the highest PSNR, SSIM and lowest RMSE compared with other methods which confirm the superiority of the proposed approach. To further inspect the detail preservation ability of different methods, a horizontal profile, located at the 126th row and 55th to 100th columns in the second row of Figure 5, is plotted in Figure 6. Among all approaches, TGV-DU is the most closely aligned with the reference curve.

Results of different algorithms at 10% dose. The ROIs, marked with red rectangular boxes in the first and third rows of NDCT, are shown as magnified views below each corresponding image (the second and forth row respectively). The display window is [-160,240]HU.

Intensity distribution of different algorithms at 10% dose.
Comparison of metrics for different algorithms at 10% dose in Figure 5.
Comparison of results at 5% dose
As the number of incident photons decreases further, the results of different approaches at 5% dose are demonstrated in Figure 7. Due to the server noise and artifacts, the results of FBPConv, LPD, FISTA and DRCR become even worse. The quality of the image is drastically reduced which is difficult to obtain useful information. When incident photons drop to ultra-low dose level (5% of standard), PWLS iterative reconstruction suffers from severely elevated noise that critically degrades image quality. While MBIR effectively suppresses this noise, it eliminates essential structural details within ROIs, particularly at the regions marked by blue arrows. The results of TGV-DU achieve better performance in terms of noise suppression and detail preservation. To further demonstrate the effectiveness of TGV-DU, the zoomed regions are displayed in the second and forth rows of Figure 7 marked by blue rectangles in the first and third rows of Figure 7. Two selected structures of vessel are indicated by blue arrows in zoomed region. It can be observed that these structures almost disappear in the results of FBPConv, LPD, FISTA and DRCR. But they are well reconstructed by TGV-DU and have the closest structures of the reference images. Table 3 demonstrates the quantitative analysis with respect to three chosen metrics of the first slice. It is obvious that TGV-DU achieves the highest results of both PSNR and SSIM, lowest result of RMSE, which illustrates the effectiveness of our proposed approach. To further verify the detail preservation capability of different approaches, a horizontal profile, located at 29th row and 20th to 60th columns in the second row of Figure 7, is plotted in Figure 8. It is obvious that TGV-DU exhibits the smallest deviation from the reference curve, outperforming the other approaches.

Results of different algorithms at 5% dose. The ROIs, marked with red rectangular boxes in the first and third rows of NDCT, are shown as magnified views below each corresponding image (the second and forth row respectively). The display window is [-160,240]HU.

Intensity distribution of different algorithms at 5% dose.
Comparison of metrics for different algorithms at 5% dose in Figure 7.
Results on piglet dataset
To demonstrate the robustness of the proposed unrolling network, we conducted additional experiments on the piglet dataset. The network was trained exclusively at the 50% dose level, which was then utilized to reconstruct images from test data acquired at 50%, 25%, 10% and 5% dose levels. Comparative reconstruction results by different methods are presented in Figure 9. FBPConv and FISTA exhibit server noise and artifacts which significantly degrade the image quality. While LPD demonstrates effective noise suppression, it suffers from excessive smoothing leading to structural blurring and consequent anatomical distortion. DRCR achieves improved retention of fine structural details compared to LPD, yet perceptible residual noise and artifacts persist in its outputs. The proposed method addresses these limitations by effectively suppressing noise and artifacts while simultaneously preserving critical structural details to a greater extent than the benchmark approaches. This qualitative superiority is consistently demonstrated in representative regions of interest (ROIs), which clearly illustrate the enhanced capability of the proposed algorithm in maintaining diagnostic image integrity. Corroborating the visual assessment, quantitative analysis confirms the superiority of the proposed approach. It achieves significantly higher PSNR values coupled with substantially lower RMSE values compared to other methods. This quantitative advantage is shown in the upper left corner of Figure 9.

Result of different methods on the piglet dataset. The magnified ROI is displayed in the upper right corner. The metrics of the whole slice is illustrated in the upper left corner.
Effect of different network parameters
In this section, we conducted two experiments to analyze the values of the parameters involved in the training process on the Mayo dataset at 15% dose, including the size of the convolution kernel and the number of iterations. The convolutional kernel sizes were set to 3, 5 and 7. The number of iterations was set to6, 8 and 10. The results are shown in Table 4 and 5. As indicated in Table 4, the performance metrics exhibit a gradual decline with increasing convolutional kernel size. This degradation suggests a plausible explanation: the available training data may be insufficient to effectively optimize the substantially increased number of trainable parameters associated with larger kernels. Conversely, Table 5 demonstrates a consistent improvement in performance metrics as the number of iterations increases. This positive correlation indicates that additional iterations enhance the model's representational capacity and optimization efficacy within the constraints of the architecture. Based on these empirical observations, the network configuration employing a convolutional kernel size of 3 × 3 coupled with 10 iterations was selected as the optimal configuration for the proposed unrolling network.
The result comparisons of different convolutional kernel size.
The result comparisons of different iterations.
Reconstruction results with different loss functions
To demonstrate the effectiveness of the proposed loss function in this paper, we train the network with different loss functions at 15% dose, as shown in Table 6. One representative slice from testing dataset is chosen for visual comparison in Figure 10 (a)–(e). It can be seen that although the network training with the incomplete loss function achieves good reconstruction results, the proposed loss function shows more remarkable success in edge preservation and detail reconstruction. To further visualize the texture preservation capability, the difference images are shown in Figure 10 (f)–(i). It can be seen that the difference image of (d) and the label image has the least gray level difference region, and the recovery is better at the edges and details, which fully demonstrates that the reconstructed image of the normal case has a very high degree of similarity to the target image, and among all the loss function settings, the case has the best performance in terms of the accuracy and fidelity of image reconstruction. performance is optimal. Table 7 illustrates the values of PSNR, SSIM, and RMSE metrics with different loss function settings, which shows that the adopted loss function has higher PSNR and SSIM and lower RMSE.

Reconstructed and difference images with different loss functions. The first row's display window is [-160,240]HU. The second row's display window is [-30,30] HU after applying a thresholding of 40HU.
The definition of different loss function.
Comparison results of metrics with different loss functions.
Ablation studies
To validate the impact of different neural network modules on TGV-DU algorithm performance within the iterative unrolling framework, we conducted ablation studies using three distinct reconstruction networks (Net1, Net2, Net3). Table 8 lists the neural module configurations for each network, where ✗ indicates not to replace the variable optimization steps in the iterative reconstruction algorithm with a neural network module and ✓ denotes to replace the optimization steps with a neural network module. Performance was evaluated under 15% dose on the Mayo dataset. It can be observed that the visual quality of the reconstruction results continuously improves as the number of neural network modules increases from Figure 11. The values of PSNR and SSIM metrics of different networks are shown in Table 8. Consistent with visual observations, TGV-DU with full neural module replacement achieved optimal quantitative metrics. These ablation experiments confirm the efficacy of substituting optimization steps in iterative reconstruction algorithms with neural modules.
Ablation studies: quantitative comparison of different networks on the Mayo dataset at 15% dose.

Ablation studies: visual comparison of different networks on Mayo dataset at 15% dose. The display window is [-160,240]HU.
Discussion
In this work, we proposed a novel LDCT network by unrolling the second-order TGV based IR algorithm, dubbed TGV-DU. The second-order TGV-based IR algorithm has the potential to alleviate the staircase effect caused by TV regularization. The CP algorithm was employed to solve the objective function to obtain a single-loop iterative algorithm. Inspired by the outstanding non-linear feature extraction capability of CNNs, this single-loop iterative algorithm can be unrolled into network easily which effectively improve the quality of imaging. However, most unrolling approaches have potential limitations. The regularization term, as a critical component of objective function, exerts a decisive influence on noise suppression and detail preservation of the reconstructed images in IR algorithm. While, some unrolling approaches only unroll the data fidelity term but replace the regularization term with a neural network directly which heavily limits the performance of the network. Incorporating the prior information into the unrolling network can further improve the performance of CT reconstructions as demonstrated in Figure 3, 5 and 7.
Our previously proposed network PD-Net unrolled both the data fidelity term and TV term into network. To further alleviate the staircase effects caused by TV, we adopted the second-order TGV-based model and implemented the proximal mapping of both the data fidelity term and regularization term into learnable convolutional neural networks. One of the core advantages of TGV-DU is that TGV regularization term provides a way of balancing between the first and second derivative, significantly suppresses noise at low doses, and excels in detail preservation. Table 9 illustrates the quantitative results compared with PD-Net at 15% dose. It can be observed that TGV-DU achieves the better score in terms of PSNR and RMSE, PD-Net scores the higher SSIM metric. The proposed TGV-DU method exhibits opportunities for further refinement and enhancement.
Comparison with PD-Net.
Although we achieved better results on low-dose simulation datasets, validation on real LDCT datasets in a clinical setting remains a focus for future research. In particular, clinical data with more projection
amplitude and complexity will pose a higher challenge to the generalization ability of TGV -DU, so further clinical validation experiments are important for the feasibility of this method.
Conclusion
To make full use of the interpretability of physical models and the outstanding non-linear feature extraction capability of CNNs, we combine model-driven and data-driven approaches to constructed our proposed network architecture which unroll the TGV-based iterative algorithm into deep neural networks for LDCT reconstruction. The proposed network effectively suppresses low-dose noise while preserving critical edge details. Its three-layer CNN architecture hierarchically captures distinct feature representations, which are progressively refined and synthesized into the final high-fidelity output. The experimental results show that TGV-DU not only performs well in terms of visual effects, but also surpasses multiple the-state-of-art algorithms in terms of quantitative evaluation metrics, demonstrating a superior reconstruction performance. It is also a feasible strategy to set different regularization terms for various task scenarios in our future research.
Footnotes
Acknowledgements
The authors would like to thank the institution, the Mayo Clinic, for providing the data (Low-Dose CT Image and Projection Data) used in this study. This work was supported in part by the National Natural Science Foundation of China under Grant 62401517, in part by the Natural Science Foundation of Shanxi Province under Grant 202303021211148, 202203021222038, and 201901D211246, in part by the Research Project Supported by Shanxi Scholarship Council of China under Grant 2021-111 and in part by the Innovation Project for Graduate Student of Shanxi Province under Grant 2024KY582.
Funding
The authors received no financial support for the research, authorship, and/or publication of this article.
Declaration of Conflicting Interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
