Abstract
Keywords
Introduction
Computed tomography (CT) is an essential modality in medical imaging and its use has increased significantly in recent years [1–4]. Concern over stochastic radiation risk arising from the increased use of CT has led to several initiatives and developments targeting dose reduction [5–8]. One recent development has been the commercialization of iterative reconstruction algorithms. Iterative algorithms are being explored for dose reduction in CT while maintaining image quality. They complement already existing dose reduction techniques such as tube-current modulation and automatic kV selection [2, 9].
Generally speaking, iterative algorithms use models of data acquisition and noise statistics to improve the tradeoff between image quality and dose. Filtered back projection (FBP), on the other hand, assumes idealized data acquisition and does not account for data statistics, leading to amplification of image noise [10–13]. Iterative algorithms require significantly more computational resources compared to FBP [4]. With the availability of fast parallel processing such as that performed by graphic processor units, CT iterative algorithms have become a clinical reality.
General Electric (GE, Milwaukee, WI, USA) introduced the first FDA-approved clinical iterative method in 2008, called adaptive statistical iterative reconstruction (ASIR™). ASIR™ utilized a physical model of the scanner’s geometry. The final image could be a blend of iterative and FBP reconstructions, as determined by the user [14]. The amount of blend with FBP, determined the ‘strength’ of ASIR™ used in the reconstruction, usually referred to as percentages. GE later introduced a full model-based iterative method, VEO™, which employed a comprehensive physical model of the scanner’s geometry, system optics and data statistics. It used 3D voxel volumes of the scanned object, and complex mathematical computations to account for scatter, beam hardening and metal attenuation artifacts [10, 14–16]. VEO™ required significantly longer processing times compared to FBP and ASIR™.
Siemens Healthcare (Erlangen, Germany) introduced its first generation iterative algorithm, iterative reconstruction in image space, (IRIS™) in 2009. IRIS™ operated in the image domain. A second generation algorithm was released in 2011, named SAFIRE™ (sinogram affirmed iterative reconstruction). SAFIRE™ operated in the data and image domains, using a dynamic raw-data-based noise model for image noise reduction [17–19]. The level of noise removal and image noise texture was determined by the ‘strength’ of SAFIRE™, with up to five user-selected strength levels [19].
Toshiba Medical Systems (Tochigi, Japan) introduced adaptive iterative dose reduction 3D (AIDR3D™) in 2012. AIDR3D™ was also designed to work both in the data and image domain. Noise reduction in the data domain involved the combination of scanner and statistical models to characterize and reduce noise. Iterations in the image domain resulted in further image quality enhancements leading to the final image [20–23]. The level of noise reduction was adjustable by four preset strengths namely: ‘weak’, ‘mild’, ‘standard’, and ‘strong’.
Philips Healthcare (Eindhoven, Netherlands) employs an algorithm commercially referred to as iterative dose reconstruction 4th generation (iDose4™). iDose4™ performs iterative processing both in the data and image domains. It uses a maximum-likelihood de-noising algorithm to remove noise in the data domain. A combination of physical and dynamic multi-frequency models are then used to reduce image noise while preserving high frequencies [10, 25]. The strength of the iterative process is adjustable by selecting one of seven levels.
Several studies have investigated the performance of iterative algorithms using objective quantitative parameters or subjective qualitative assessments [10, 26–33]. A useful review of different studies and a description of several algorithms can be found in Willemink et al. [4].
Hara et al. [26] reported the possibility of dose reduction in the range of 32%–65% with 40% ASIR™ strength, following a preliminary study in adult abdomen/pelvic examinations. Chang et al. [15] reported up to 46% dose reduction using model-based VEO™ with respect to a regular dose FBP, with superior depiction of lesion conspicuity in the liver. More recently, Mieville et al. [10] reported dose reduction possibilities of 67%–86% using quantitative physical metrics and subjective assessments of lesions in phantom images reconstructed with ASIR™ and VEO™.
For SAFIRE™, Takx et al. [34] reported improved diagnostic accuracy in coronary CT angiography of adults using 50% of radiation dose. Ebersberger et al. [29] reported improved noise and signal-to-noise ratio (SNR), with equivalent image quality assessments using SAFIRE™ at half-dose with respect to FBP at full dose. Baumueller et al. [17] reported improved image quality and diagnostic accuracy in non-enhanced low dose lung CT. Using quantitative physical metrics, Ghetti et al. [30] reported up to 60% noise reduction with SAFIRE™, without significant resolution improvements regardless of the choice of reconstruction kernel.
Yamada et al. [21] reported a 64% dose reduction using AIDR3D™, maintaining diagnostic acceptability in a low-dose chest CT examination. Improved SNR and decreased noise was obtained relative to FBP and an earlier version of the algorithm. Chen et al. [22] suggested that a 50% dose reduction produces comparable image quality with the standard FBP, using AIDR3D™ in a retrospective clinical coronary CTA study. Gervaise et al. [32] reported a mean noise reduction of 40% using phantoms, with improved SNR when compared to FBP. However, no spatial resolution improvement was observed.
For iDose4™, Li et al. [24] reported improved or equivalent image quality metrics in contrast-enhanced chest CT of 80 patients after 71% dose reduction at low-tube voltage (80kVp) with respect to standard FBP (120 kVp). Mieville et al. [10] reported improved contrast-to-noise ratio (CNR) and lower noise in phantom scans, but no spatial resolution change was observed.
The purpose of this study was to measure objective image quality metrics on four clinical CT scanners, and to investigate the performance of their respective iterative algorithms. We investigated the following algorithms: ASIR™ and VEO™ (GE Healthcare, WI, USA); SAFIRE™ (Siemens Healthcare, Erlangen, Germany); AIDR3D™ (Toshiba Medical Systems, Tochigi, Japan) and iDose4™ (Philips Healthcare, Eindhoven, Netherlands). We focus on routine abdomen/thorax protocols, which are the most clinically common protocols in our jurisdiction [35].
Methods and materials
Data acquisition and image reconstruction
The iterative algorithms were available on the following multi-detector CT (MDCT) scanners: GE DiscoveryCT750HD, Siemens Somatom Definition AS+, Toshiba Aquilion64, and Philips Ingenuity iCT256, which were installed at different institutions in our jurisdiction. All four MDCT scanners used in this study could reconstruct images using their respective conventional filtered-back projection (FBP) and the iterative algorithm available on each scanner.
In all acquisitions obtained for this study, the helical pitch and tube current-time product (mAs) were manually changed to obtain the volume CT dose index (CTDIvol) in the range 0.75 – 18.7 mGy. We used the routine abdomen/thorax protocols of each scanner. The acquisition protocols and exposure parameters are presented in Table 1. Note that for VEO™ reconstructions, acquisitions on the scanner was carried out using 0.625 mm thin slices, before reformatting to obtain 5 mm thick slices.
Phantom and image quality analysis
Algorithm performance was evaluated by scanning the Catphan500 phantom (Phantom Laboratory, Salem, NY USA). The Catphan500 phantom consists of four modules: CTP401, CTP528, CTP515, and CTP486 which are used for testing CT number accuracy, high-contrast spatial resolution, low-contrast detectability, and noise/CT image uniformity, respectively.
Phantom images were used to measure standard deviation (noise), contrast-to-noise ratio (CNR), the modulation transfer function (MTF) and the noise power spectrum (NPS). We reconstructed images of the phantom using FBP and several strengths of the iterative algorithms available on each scanner.
We measured the standard deviation (SD) in the centre ROI’s (605 mm2) in three uniform slices of the CTP486 uniformity module, as a way of assessing image noise. We calculated the average SD obtained from the slices and the standard deviation of the average. Conventionally, image noise was Poisson distributed [36]. In order to examine whether the choice of algorithm affected this relationship, we performed a non-linear least squares fit of the noise-dose relationship using the following formula [36]:
The CNR was measured in the low-contrast module (CTP518). We chose two spherical ROI’s (65 mm2) inside the module (Fig. 1). The ROI inscribed inside the 1% contrast structure represented the signal, while the other ROI represented the background. The CNR was evaluated according to the following equation [10]:
We averaged the CNR measurements in four different slices of the CTP518 module, with the results reported as mean±standard deviation of mean.
For the planar MTF and NPS analysis, we adopted the methods described in Friedman et al. [37], modified for the CTP486 uniformity module, for all except the GE scanner. The MTF analysis is based on computing an over-sampled edge profile from a radial edge. The edge profile was differentiated to obtain the corresponding line spread function (LSF), to which a Hanning window was applied followed by a fast Fourier transform (FFT). The MTF is then calculated as the modulus of the FFT of the LSF. For the GE scanner, the MTF was computed from the point spread function (PSF) obtained directly from the image of a high contrast tungsten carbide bead point (0.28 mm diameter) located in the CTP528 module of the Catphan phantom [38]. We resorted to this approach because the GE system artificially set the air pixels to a constant value, preventing the real edge profile from extending far enough outside the phantom.
The NPS was determined by scanning the uniformity module of the phantom twice. Three image slices from each scan were subtracted voxel-by-voxel to produce ‘detrended’ images. From the latter, thirty two ROIs (1282 voxels) selected along a 15 cm radius from the center of the image were generated, and a 3D FFT of each ROI was calculated. From the FFT results, the 3D NPS was calculated, and a radially-averaged axial NPS profile was generated [37].
We performed image analysis using the Java powered ImageJ public domain software (ImageJ version v1.46r: National Institute of Health, USA) and Gnumeric spreadsheet (version 1.10.16: GNOME Project). The noise-dose relationship of Equation (2) was determined by performing non-linear least-squares fitting using the curve fit tool box in MALATB (version R2013a: The Mathworks, Inc., Natick, MA USA). The NPS and MTF were calculated using also using the said version of MATLAB.
Image noise
Figure 2 shows the mean of the SD as a function of CTDIvol. As expected, noise decreases with dose, regardless of algorithm. Iterative algorithms yield lower noise than FBP when all other parameters are kept unchanged. Further, dose reduction increases for all scanners with increasing iterative algorithm strength. For scans with ASIR and SAFIRE, noise reduction (relative to FBP) did not change with radiation dose or pitch factor. For example, at a CTDIvol value of 1.5 mGy and pitch factor of 0.516, noise reductions of 11%, 21%, 30%, 39%, and 47% were obtained with ASIR20, ASIR40, ASIR60, ASIR80 and ASIR100 respectively. At almost 10 times the dose (CTDIvol of 14.97 mGy) and the same pitch factor, image noise decreases by 11%, 21%, 31%, 41%, and 49%, respectively. Using pitch factor 0.984 at 1.5 mGy, the noise reductions were 11%, 21%, 31%, 39% and 47%.
Noise reduction due to VEO™ and AIDR3D varied depending on the dose and pitch factor. For example, the standard deviation was lower in the VEO-reconstructed images by 43% at 1.5 mGy and 0.516 pitch. At 14.97 mGy, the noise reduction was just 12%. With 1.5 mGy and pitch factor of 0.984, the noise was lower by 51% compared to FBP. At 1.3 mGy and 0.9 pitch factors, AIDR3D-mild reduced the noise by 49%, while at 7.5 mGy the relative noise reduction was 34%. For iDose, there were small differences (<6%) in the extent of dose reduction as a function of dose or pitch.
Table 2 lists the fit coefficients for equation (2). The exponent was close to 0.5 in most cases, suggesting a quasi-Poisson noise-dose relationship. The exceptions were VEO™ and the Toshiba reconstructions, in which even FBP did not follow the Poisson relationship.
Contrast-to-noise ratio
Figure 3 shows the CNR as a function of CTDIvol for the different algorithms on all four scanners. Generally, all iterative algorithms increase CNR relative to FBP and the CNR increase is higher with increasing iterative reconstruction strength. Improvements in CNR with model-based algorithms (VEO and AIDR3D) exhibited dependence on dose, as was observed for image noise. Generally, model-based algorithms resulted in more CNR improvement when the dose was very low.
Modulation Transfer Function (MTF)
The MTF results are shown in Fig. 4. For all systems except the GE DiscoveryCT750HD scanner, their appeared to be little variation in the MTF with choice of algorithm, iterative strength, dose or helical pitch factor, over the dose range considered. Comparable results were obtained at other doses but are not shown here.
For GE DiscoveryCT750HD scanner, the spatial resolution at 50% MTF was 0.37 mm-1 for FBP and all ASIR™ strengths, while for VEO™, the resolution at 50% MTF was 0.48 mm-1. The limiting spatial resolution (at 10% MTF) was ∼ 0.67 mm-1 for FBP and ASIR™, and 0.87 mm-1 using VEO™. This represents a 30% increase in limiting resolution compared to FBP and ASIR™.
Noise Power Spectrum (NPS)
Figure 5 shows the1-D NPS curves at a dose value of ∼1.5 mGy. Generally, increasing strengths of iterative reconstruction reduce the amplitude of the NPS and shift its peak towards lower frequencies. In addition, higher helical pitch increases the NPS amplitude. For the GE scanner (Fig. 5a), the lowest peak spatial frequency of ∼ 0.1 mm-1 is obtained with ASIR100 and VEO™. For the Siemens Somatom Definition AS+ scanner, (Fig. 5b), one observes that the NPS amplitude does not depend on helical pitch. This is consistent with the use of “effective mAs” to compensate for the effect of the helical pitch on noise. This is not the case with the Philips scanner (Fig. 5d), where the NPS shows weak dependence on pitch despite use of the “effective mAs”. The same trend is observed at a higher dose (Fig. 6) for all reconstructions except in VEO™ on the GE system. The NPS obtained at high dose from the VEO™ model-based reconstruction maintains the lower peak frequency, but its magnitude is higher than high levels of ASIR.
Discussion
Image-domain standard deviation measurements and NPS calculations demonstrated that image noise decreases with the use of iterative algorithms or with increasing iterative reconstruction strength, at equivalent dose values. The shape of the NPS curve produced from FBP images is comparable to data available in literature, as the NPS increases at lower frequencies, and decreases at higher frequencies consistent with ramp and apodization filters [37, 39]. Moreover, the noise spectrum shifted towards lower frequencies with increasing iterative strength, suggesting that the image noise texture appeared coarser. With the two systems (Siemens and Philips) that employ “effective mAs” to compensate for the effect of helical pitch, the magnitude of the NPS did not vary or varied slightly with pitch. With the other two systems (GE and Toshiba), the magnitude of the NPS is higher for higher pitch, as expected. The effect of pitch, however, diminishes with increasing strength of iterative processing.
We also presented coefficients for fitting the standard deviation as a function of dose for the various algorithms, using the formula SD = a · CTDI-x where x = ½ if the Poisson model applies. GE’s VEO™ and all of Toshiba’s reconstructions did not conform to the Poisson model. Model-based algorithms employ regularization terms to enforce smoothness in the reconstructed images. Regularization terms control the tradeoff between noise and resolution. The extent of regularization depends on the local differences in pixel values [40]. Our results are similar to other reports which demonstrated this difference in noise-dose power law in model-based algorithms with an exponent less than ½ [41]. The smaller exponent suggests that image noise in model-based algorithms has a weaker dependence on dose. In our results, for example, a 50% reduction in dose will increase VEO™ image noise by 26%, compared to the Poisson relationship which predicts a noise increase of 40%. This helps explain the superior noise reduction observed for model based algorithms at very low doses.
The CNR improves with increasing strength of iterative reconstruction on all scanners. The model-based iterative reconstruction algorithm from GE, VEO™, has a different performance pattern. It offers significant improvements in CNR and noise performance at low doses. The improvements however diminish as dose increases. For example, given the scan parameters employed in our study, the CNR obtained with VEO™ becomes equal to the CNR obtained with ASIR100 at 1 mGy for PF = 0.516 and 2 mGy for PF = 0.984. A similar pattern can be observed from another study [10], although the dose value at which equivalent performance occurs was higher (∼3.5 mGy). The pitch factor in that study was 1.375. This suggests that VEO™ is more beneficial at low doses and high pitch factors. The NPS results obtained for VEO™ in our study further demonstrate this noise behavior.
Generally speaking, spatial resolution did not depend on choice of algorithm (provided the same kernels were used) or the strength of iterative reconstruction, over the range of doses of this study. In the case of GE reconstructions, VEO™ increases spatial resolution by 30% over FBP and ASIR™. This is the result of employing an image acquisition model which incorporates and compensates for sources of image blur. In this manner, VEO™ has an additional benefit over the noise-reduction capabilities of the other algorithms. Promotional claims of spatial resolution improvements due to the use of iterative algorithms should be interpreted with care. They often refer to scenarios where sharper or high resolution reconstruction kernels are used without a noise penalty due to the superior noise suppression of iterative methods.
Table 3 compares our results with those published in other reports [30]. Differences in the scanning conditions between our study and other reports are also listed in Table 3. Given these differences, the comparison of results should proceed with caution and is qualitative at best. Mieville et al. [10] reported up to a 70% relative noise reduction for pediatric scans using VEO™ with respect to FBP, at a lower dose of 0.5 mGy. The larger noise reduction is consistent with the observation of more aggressive noise reduction with VEO™ at lower dose. Although the results Mieville et al. were presented in the context of pediatric scanning, they actually used a medium body x-ray filter. They also reported MTF improvements of 6% and 37% using ASIR100 and VEO™ respectively, compared to FBP. It should be noted however that their measurements were obtained with thinner reconstruction slice thickness (1.25 mm).
For SAFIRE™, our results are comparable to those of Ghetti et al. [30], who reported up to 35% and 59% noise reduction with SAFIRE-L3 and SAFIRE-L5 respectively. They also reported no change in the MTF between FBP and SAFIRE™. For Toshiba’s AIDR3D™, Gervaise et al. [32], reported less noise reduction for the earlier version of the Toshiba iterative algorithm (AIDR™) using phantom scans. Joemai et al. [42] reported up to 65% noise reduction using AIDR3D over a similar dose range as used in our study. Both studies [32, 42] reported no spatial resolution improvements between FBP and AIDR™ and AIDR3D™.
For iDose4™, our results are in agreement with the CT vendor’s claim of a noise reduction of 11% to 45% for iDose4-L1 to L6 [25]. Our MTF measurements were consistent with those reported by Mieville et al. [10], showing no difference between the MTF obtained with FBP and iDose4™. The 50% MTF we obtained (0.29 mm-1) is slightly lower than that of Mieville et al. [10] (0.33 mm-1).
This study is limited in that it restricts the analysis to abdomen/thorax protocols at 120 kVp using the standard reconstruction kernel. Previous surveys of clinical protocols showed that these were the most common clinical parameters, in our jurisdiction and others [35, 43]. Also, only two pitch factors for each scanner were considered. It is conceivable that the performance of the different algorithms will vary with pitch and reconstruction kernel. We already alluded to the fact that the performance benefits of VEO™ are enhanced when a high pitch is used, resulting in the reduction of the dose per slice. Different kernels will affect image noise and the visibility of fine details differently [37, 39] and may alter the relative performance obtained from using iterative algorithms. In addition, some studies have demonstrated that spatial resolution in the context of iterative algorithms can be dependent on subject contrast. In model based iterative algorithms, image noise depends on the regularization parameters which in turn are affected by local pixel gradients in the image. While our results demonstrate the relative performance of iterative algorithms, care must be exercised in generalizing these results to other clinical protocols or scanning conditions.
Another limitation of this study is in that it uses image quality metrics based on linear system theory and assumptions of linearity and stationarity. Iterative algorithms are nonlinear [44] and the geometric models are shift variant [40]. There have been suggestion to use local point-spread functions [44] or task-based MTF, evaluated under different noise and contrast conditions [45]. However, linear system-based methods are well established and several studies have used them [10, 46] and recent work has shown that the NPS of a model-based image reconstruction maintains radial symmetry [44], which justifies our use of ROI’s along a fixed radius from a small number of images. There has also been recent work on using phantoms with textured backgrounds for CT performance assessment, which was not considered in this work [47].
This study focused on objective image quality metrics obtained from images of a quality control phantom. This is useful in quantifying the performance of different algorithms, comparing them and verifying vendor claims. To facilitate comparisons between algorithms and vendor platforms, in Fig. 7 we show the noise reduction achieved by each algorithm over FBP at two fixed doses, for the lower pitch factors used in our study. We also show the dose at which each algorithm achieves image standard deviations of 5 and 10 in Fig. 8. The results, should, however be interpreted with care as they apply to phantom scans using specific scan parameters. Clinical adoption in particular, requires assessment using patients or realistic anthropomorphic phantoms. Further, clinical assessment looks at a broader range of issues related to image quality than QC phantom assessment. Recent work in our group, for example, showed that a model based algorithm yielded superior results to statistical or FBP reconstruction, when examined using a porcine model and a clinically relevant rubric [48]. The results of this work are a useful starting point for more clinically relevant assessments. They provide results from several vendors and can be useful in guiding the implementation of protocols across different platforms.
Conclusion
The results presented in this study characterized the performance of modern CT scanners equipped with commercially available iterative algorithms using objective image quality metrics. Several of the algorithms investigated (ASIR™, SAFIRE™ and iDose4™) are effective noise reduction algorithms, not affecting spatial resolution or the Poisson noise-dose relationship significantly. Both the Toshiba AIDR3D™ and GE VEO™ algorithms alter the Poisson relationship, with the change more pronounced in the fully model-based VEO™.
VEO™ offers improvement in spatial resolution over the standard FBP reconstruction. It appears to be most effective when used in low dose scenarios. As dose increases, ASIR™ results in CNR values equivalent or superior to those of VEO™. VEO™ requires long processing times and does not produce reconstructed images in real time.
Several studies [4, 49–51] have explored the clinical applications of iterative algorithms. While our study focused on an objective assessment using phantom images, it is useful in that it reports on the performance of algorithms from 4 different vendors. Further, the data for our study were obtained from the different vendors under similar acquisition conditions, which facilitate inter-system comparisons.
Footnotes
Acknowledgments
The authors wish to thank the diagnostic imaging departments at St. Boniface General Hospital (Winnipeg, MB, Canada), Boundary Trails Health Centre (Winkler, MB, Canada), Bethesda Hospital (Steinbach, MB, Canada) and the Grace General Hospital (Winnipeg, MB, Canada) for providing access to their CT scanners.
