Abstract
Background:
Accurate and fast dose calculation is crucial in modern radiation therapy. Four dose calculation algorithms (AAA, AXB, CCC, and MC) are available in Varian Eclipse and RaySearch Laboratories RayStation Treatment Planning Systems (TPSs).
Objectives:
This study aims to evaluate and compare dosimetric accuracy of the four dose calculation algorithms applying to homogeneous and heterogeneous media, VMAT plans (based on AAPM TG-119 test cases), and the surface and buildup regions.
Methods:
The four algorithms are assessed in homogeneous (IAEA-TECDOCE 1540) and heterogeneous (IAEA-TECDOC 1583) media. Dosimetric evaluation accuracy for VMAT plans is then analyzed, along with the evaluation of the accuracy of algorithms applying to the surface and buildup regions.
Results:
Tests conducted in homogeneous media revealed that all algorithms exhibit dose deviations within 5% for various conditions, with pass rates exceeding 95% based on recommended tolerances. Additionally, the tests conducted in heterogeneous media demonstrate high pass rates for all algorithms, with a 100% pass rate observed for 6 MV and mostly 100% pass rate for 15 MV, except for CCC, which achieves a pass rate of 94%. The results of gamma index pass rate (GIPR) for dose calculation algorithms in IMRT fields show that GIPR (3% /3 mm) for all four algorithms in all evaluated tests based on TG119, are greater than 97%. The results of the algorithm testing for the accuracy of superficial dose reveal variations in dose differences, ranging from –11.9% to 7.03% for 15 MV and –9.5% to 3.3% for 6 MV, respectively. It is noteworthy that the AXB and MC algorithms demonstrate relatively lower discrepancies compared to the other algorithms.
Conclusions:
This study shows that generally, two dose calculation algorithms (AXB and MC) that calculate dose in medium have better accuracy than other two dose calculation algorithms (CCC and AAA) that calculate dose to water.
Keywords
Introduction
The accurate and fast calculation of the dose distribution in patient plans is one of the most crucial processes in modern radiation therapy [1]. Dose delivery uncertainties of around 5% can influence the therapeutic window through which treatment is delivered, leading to reduced Tumor Control Probability (TCP) and/or increased Normal Tissue Complication Probabilities (NTCP). Therefore, based on recommendations from ICRU report 24, the overall dose uncertainty should be kept less than 5% [2]. The availability of model based, and commercial Monte Carlo (MC) dose calculation algorithms have the potential to reduce the dose uncertainty resulting from the treatment planning process [3]. Dose calculation algorithms can be categorized in terms of the medium of radiation transport and dose calculation. The physical properties of the medium for radiation transport and dose calculation are handled differently in different treatment planning systems [4]. There are three separate quantities used to specify the dose:
As with all dose calculation algorithms, before their introduction into clinical practice, careful validation of dose calculations against measurement is required [3]. IAEA and AAPM suggest treatment simulation tests with acceptability criteria for beam models in the treatment planning systems (TPS) to assess the agreement with measured data. These publications allocate different tolerances to specific zones within the data, such as high dose, low dose, and high dose gradient sections found in the buildup or penumbra regions of beam data. They also address specific tests and tolerance levels in heterogeneous media, as well as tests for evaluating the accuracy of intensity modulated radiation therapy algorithms in fields [6–9].
Despite the existing literature on reviewing and comparing commercial dose calculation algorithms in different TPSs for various evaluation tests [10–12], there is still a lack of comprehensive studies that encompass a wide range of tests to facilitate direct comparisons and reveal the strengths and weaknesses of each algorithm. Furthermore, limited research explores different algorithms utilizing diverse types of dose calculations and using the same measured beam data for beam configuration, which is essential for unbiased and independent comparisons of their performance.
This study evaluates four commonly used dose calculation algorithms in clinical practice. The evaluation is conducted using the same measured beam data in one setting which includes all percentage depth-doses (PDDs), profiles and output factors, across various beam configurations within the Varian Eclipse and RaySearch Laboratories RayStation TPSs. By employing consistent measured beam data, the study aims to eliminate the performance dependency of the algorithms on the quality of imported measured beam data. Moreover, special attention is given to evaluating the accuracy of the Monte Carlo dose calculation algorithm within the RayStation TPS, an area that has received limited attention in existing literature. The findings of this study will provide valuable and comprehensive insights and serve as a reference for assessing the accuracy of dose calculation algorithms. Ultimately, this study aims to enhance our understanding of the strengths and weaknesses associated with various dose calculation algorithms, enabling meaningful comparisons between them.
In this study after modelling all algorithms based on vendor guidelines for each TPS, comprehensive comparison tests for all four algorithms were carried out. These tests included the algorithm’s accuracy:
Materials and methods
Dose calculation algorithms
Four dose calculation algorithms, which are commercially available in two different TPS, were included in this study. Eclipse (version 16.1) - Varian Medical Systems, Palo Alto, USA AAA: Anisotropic Analytical Algorithm AXB: Acuros XB RayStation (version 10B) - RaySearch Laboratories AB, Stockholm, Sweden CCC: Collapsed Cone Convolution MC: Monte Carlo
Both Eclipse and RayStation were chosen for the study based on their extensive utilization in clinical practice and their robust treatment planning capabilities. A key factor in their selection was the availability of two types of dose calculation algorithms: Dw,w and Dm,m. This enabled a comprehensive evaluation of their performance and accuracy, providing valuable insights into the strengths and limitations of each algorithm.
2.1.a. Eclipse ’ ‘TPS’s dose calculation algorithms
2.1.b. RayStation ’ ‘TPS’s dose calculation algorithms
A short summary of the evaluated dose calculation algorithms in the study has been provided in Table 1.
Summary of the Evaluated Four Dose Calculation Algorithms in the Study. (AXB: Acuros XB, AAA: An nisotropic analytic algorithm, CCC: Collapsed Cone Convolution and MC: Monte Carlo)
Summary of the Evaluated Four Dose Calculation Algorithms in the Study. (AXB: Acuros XB, AAA: An nisotropic analytic algorithm, CCC: Collapsed Cone Convolution and MC: Monte Carlo)
This study used the same measured data including PDDs, profiles and output factors for modelling and commissioning of all four mentioned algorithms. A dose voxel grid calculation size of 0.1 cm3 was used for dose calculation for all algorithms in all the measurement tests in this study. An uncertainty of 0.5% was selected for dose calculation by MC dose calculation algorithm in all measurement tests in this study. In both TPS, a 50 cm long cubic phantom made of virtual water was used for all point doses, percentage depth dose (PDD) and beam profile calculation in homogenous medium by all algorithms. All measurements in the homogenous medium in this study were carried out using a PTW BEAMSCAN (PTW, Freiburg, Germany) water tank. Varian IX linear accelerator with the Millennium MLC120 (Varian, Palo Alto) with nominal 6 MV and 15 MV flattened beams used for all tests of this study.
2.2.a. Algorithms accuracy comparison in homogenous media based on IAEA-TECDOC 1540
The IAEA has developed TECDOC-1540 [8], which provides specific acceptance tests for dose calculation algorithms of TPS. The proposed tests are divided into three categories: the type tests, the site tests, and the optional tests. A summary of all dosimetric tests specified in TECDOC-1540 is given in Table 2. All tests 1–12 are type tests, tests 1–9 are site tests and tests 10–12 are optional. In the present study, all tests without the presence of inhomogeneity have been performed (all the tests were performed except test 8).
Summary of all external beam calculation tests proposed in the IAEA-TECDOC-1540 and Sample criteria of acceptability for external beam TPS calculations Summary of all external beam calculation tests proposed in the IAEA-TECDOC-1540 and Sample criteria of acceptability for external beam TPS calculations
Summary of all external beam calculation tests proposed in the IAEA-TECDOC-1540 and Sample criteria of acceptability for external beam TPS calculations Summary of all external beam calculation tests proposed in the IAEA-TECDOC-1540 and Sample criteria of acceptability for external beam TPS calculations
For all tests summarized in Table 2, the measurements are given in Excel spreadsheets. Three different approaches [8] have been used in these sheets to evaluate TPS calculations and the measurements: Relative Difference (RDmeas): Dose difference between calculated dose by TPS (D
cal
) and measured dose in the same point (D
meas
) normalized to the measured dose (D
meas
).
Relative Normalized Difference for open central axis fields (RNDopen . cax): Dose difference between the dose calculated by the TPS in an open central axis field (Dcal . open) and the measured dose in the same point (Dmeas . open) normalized to the measured dose of the central axis of an open field in the same depth (Dmeas,open . cax).
Relative Normalized Difference for blocked central axis fields (RNDblocked . cax): Dose difference between the dose calculated by the TPS in a blocked central axis field (Dcal . blocked) and the measured dose at the same point (Dmeas . blocked) normalized to the measured dose in an open field for the same depth (Dmeas,open).
where, Dmeas represents the dose value measured by the PTW semi-flex ionization chamber 31002 (PTW, Freiburg, Germany) using the PTW BEAMSCAN (PTW, Freiburg, Germany) water tank.
The recommended equations for the comparison of measured and calculated data and the acceptability criteria are also given in Table 2.
PDD and beam profile measurements were acquired with a PTW pinpoint chamber 31022 (PTW, Freiburg, Germany) for 6 MV and 15 MV photon beam energies. The dosimetric data was acquired for fields sizes: 1×1 cm2, 2×2 cm2, 3×3 cm2, 4×4 cm2, 10×10 cm2 and 20×20 cm2, which were all defined by jaws. Also, beam data was measured for two MLC defined fields of 1×1 cm2 and 2×2 cm2. The accuracy of jaw and leaf calibration was verified through a short profile scan proceeding. All measurements were taken at 100 cm source to surface distance (SSD) with a 1 mm measurement step size in the scan direction, except for the 40×40 cm2 diagonal profiles that were made at an SSD of 90 cm and had a step size of 5 mm across the central 20 cm of the field. All beam profiles were measured at 10 cm depth. The PDD scanning was performed starting from 300 mm depth to 0 mm depth in the upward direction throughout the phantom to avoid water turbulence. As a reference detector for PDD and beam profile measurements for field sizes 2×2 cm2 and 4×4 cm2, the PTW T-REF plane-parallel transmission chamber (PTW, Freiburg, Germany) was used. For larger field sizes, the PTW semi-flex ionization chamber 31002 (PTW, Freiburg, Germany) was used as the reference detector.
2.2.c. Algorithm accuracy comparison in heterogeneous mediums based on IAEA-TECDOC 1583
The procedures for IAEA-TECDOC-1583 tests [9] are based on the CIRS thorax phantom (CIRS Model 002LFC). The CIRS thorax phantom (Fig. 1.a) is elliptical (30 cm long×30 cm wide×20 cm thick) and represents an average human torso in proportion, density, and three-dimensional structures. The phantom has soft tissue, lung, and bone sections with holes to hold interchangeable rod inserts. This phantom is the phantom equipped with a set of five calibrated electron density reference plugs, as shown in Table 3. The measurements in this study were performed for both 6 MV and 15 MV beams by placing a 31013 Farmer 0.6cc ionization chamber (PTW, Freiburg, Germany) into the different holes in the phantom.

a. Thorax Phantom (CIRS Model 002LFC). b. Labelling of holes and the recommended arrangement of the certified electron density reference plugs for the CT scan. Plug 1-water equivalent, plug 2- muscle substitute, plug 3 –syringe filled with water, plug 4 - adipose substitute, plug 5 - water equivalent, plug 6 - lung substitute, plug 7 - should be empty to represent air, plugs 8 & 9 - lung substitutes, plug 10- bone substitute.
Characteristics of calibrated electron density reference plugs
The holes in the phantom are labelled to identify the locations of the points of measurement within the phantom, thus enabling the comparison of the TPS calculations and the measured values. The recommended labelling of the holes and the recommended arrangement of the calibrated electron density reference plugs during CT scan are given in Fig. 1.b.
All the tests in this part of the study were performed based on the dosimetric tests proposed in Table A (1–12) of TECDOC-1583 [9]. To evaluate the measured and TPS calculated values, the following equation was used to determine the Relative Difference (RDref):
The purpose of this part of the study was to compare the dose calculation algorithms used for volumetric modulated arc therapy (VMAT) plans for the AAPM TG-119 test cases [6]. Four structure sets and computed tomography (CT) images were downloaded from the AAPM website (www.aapm.org) and imported into the Eclipse and RayStation TPS. These were the structures of Multi-Target (a), Mock Prostate (b), Mock Head-and-Neck (c), and C-shape (d) and are shown in Fig. 2.

AAPM TG-119 structure sets [reference12]. Multi-Target (a), Mock Prostate (b), Mock Head-and-Neck (c), and C-shape (d).
The VMAT dose distributions of each optimized fluence map for these cases (each case uses two complete arcs) were calculated using the AAA, AXB, CCC and MC algorithms for the 6 MV photon beam generated by a Varian IX linear accelerator. The 15 MV photon beam is generally not preferred for VMAT due to its lower modulation and potential neutron contamination [21], which is not observed with the 6 MV beam. Therefore, we solely considered the 6 MV beam in this test of our study and did not include the evaluation of the 15 MV beam. All parameters, dose prescriptions, and planning objectives followed the TG-119 guidelines. The dose calculation and dose optimization grid size were set at 0.25cm3 subsequently, the calculated dose distributions for each algorithm was converted and recalculated onto the ArcCheck phantom, (Model 1220, Sun Nuclear, Melbourne, FL). The ArcCheck is a cylindrical PMMA phantom with a three-dimensional array of 1386 diode detectors, arranged in a spiral pattern with 10 mm sensor spacing. In the final step, each dose distribution was recalculated for the ArcCheck in each TPS and compared with the ArcChack measured dose distribution using the gamma (γ) evaluation algorithm.
Roos Chamber 34001 parallel-plate ionization chamber (PTW, Freiburg, Germany) was used for surface and buildup region dose measurements for 6 MV and 15 MV photon beams. The shift of the effective point of measurement of the Roos chamber is Δz = (0.4±0.1) mm downstream from the front surface of its sensitive air volume, also fixed plane separation of this chamber is 2 mm. A set of water equivalent RW3 slab phantoms (SP34, PTW Freiburg, Freiburg, Germany) with physical density of 1.045 gcm-3 was used for open field irradiation. The effective measurement point of the chamber was defined by using water equivalent depth (WED) values in the buildup region. Surface and buildup region dose measurements were carried out with a Roos chamber for 10×10 field size at 0, 1, 2, 3, 4, 5, 10 and 15 mm phantom depths for 6 MV photon beam and same depths plus 30 mm phantom depth for 15 MV Photon beam. Irradiations were made using a Varian IX linear accelerator with 6 MV photon energy in a fixed SSD of 100 cm by delivering 100MU. To obtain the percentage depth doses (PDDs), the measured doses were then normalized to the dose at 15 mm phantom depth for 6 MV photon beam and 30 mm phantom depth for 15 MV photon beam where were accepted the dose maximum depths. The overdoses obtained from Roos chamber were corrected by applying Gerbi’s method to PDDs [22]. The measured dose values at the phantom depths were used for interpolation calculations to obtain the doses at 0.07 mm WED (0.067 mm phantom depth).
CT images of the same set of phantoms that used for Roos chamber measurements was acquired by using Siemense Big Bore CT (Siemens Healthineers, Forchheim, Germany) with a 2 mm of slice thickness. These images were then sent to the TPSs for surface and buildup region dose calculations of 10×10 field size by AAA, AXB, CCC and MC dose calculation algorithms for the 6 MV and 15MVphoton beams.
Results
Algorithms accuracy comparison in homogenous media base IAEA-TECDOC 1540
For the 6 MV and 15 MV beam models, dose deviations under reference conditions (Field size:10×10, SSD:100, Depth:10) were 0.4% and 0.3% for AAA, 0.5% and 0.4% for AXB, 0.1% and 0.5% for the CCC, 0.4% and 0.3 for MC algorithms respectively. All dose differences in reference condition were less than or equal to 0.5%.
For the 6 MV and 15 MV beam models, mean dose difference in the high-dose regions at different depths and off-axis positions, with one parameter changed from reference conditions were 0.8±1% and 0.6±1.2% for AAA, 0.9±1.2% and 0.7±1% for AXB, 0.5±1.1% and 1±1% for the CCC, 0.8±1.1% and 0.8±1% for MC algorithms. All dose deviations in these positions were less than or equal to 2%.
For the 6 MV and 15 MV beam models, mean dose deviations in fields with multiple parameter changes (e.g., an off-axis measurement in the presence of a wedge etc. . . .) were 1±1.6% and 1.6±3.2% for AAA, 1±1.6% and 1.7±1.7% for AXB, 0.7±1.2% and 1.6±3.2% for the CCC, 0.9±1.5% and 1.4±2.7% for MC algorithms respectively. All dose deviations in these positions were less than or equal to 5%.
For the 6 MV and 15 MV beam models, total percentage pass rate base tolerances recommended by TEC-DOC 1540 presented in Table 2, were 97.8% and 95.2% for AAA, 98.8% and 96% for AXB, 97.1% and 95.1% for CCC, 98.9% and 97% for MC algorithms respectively. All dose calculation algorithms had more than 95% pass rates. Figure 3 and Fig. 4 show the mean dose differences of 6 MV and 15 MV for all tests, as shown in Table 2.

Mean dose difference of 6 MV for AXB: Acuros XB, AAA: An anisotropic analytic algorithm, CCC: Collapsed Cone convolution and MC: Monte Carlo.

Mean dose difference of 15 MV for AXB: Acuros XB, AAA: An anisotropic analytic algorithm, CCC: Collapsed Cone convolution and MC: Monte Carlo.
The measured PDDs and beam profile values for 6 MV and 15 MV photon energies were compared with the TPS calculated 6 MV and 15 MV beam profiles (Fig. 5) and PDD (Fig. 6) values for each dose calculation algorithms (AAA, AXB, CCC and MC) using global gamma [21]. Base recommendation of Medical Physics Practice Guideline 5.a (MPPG 5.a) [24], the unite in gamma error measure when the distance to agreement (DTA) of less than or equal to 1 mm and a dose difference (DD) of less than or equal to 2% were used. The gamma index pass rate (GIPR), i.e., the percentage of points with γ(2% /1 mm) less than one, was recorded for global dose region. The fraction of points passing the gamma criteria is evaluated over reference points with doses higher than 10% of maximum dose. Table 4 shows GIPR result for AAA, AXB, CCC and MC.

Comparison of measured vs calculated profiles for fields sizes: 1×1 cm2, 2×2 cm2, 3×3 cm2, 4×4 cm2, 10×10 cm2, 20×20 cm2 and 40×40 cm2 diagonal which were all defined by jaws at 10 cm depth, 100 cm SSD. a) 6 MV beam profiles b) 15 MV beam profiles. (AXB: Acuros XB, AAA: An anisotropic analytic algorithm, CCC: Collapsed Cone convolution and MC: Monte Carlo).

Comparison of measured vs calculated PDDs for fields sizes: 1×1 cm2, 2×2 cm2, 3×3 cm2, 4×4 cm2, 10×10 cm2 and 20×20 cm2 which were all defined by jaws at 10 cm depth, 100 cm SSD. a) 6 MV beam profiles b) 15 MV beam profiles. (AXB: Acuros XB, AAA: An anisotropic analytic algorithm, CCC: Collapsed Cone convolution and MC: Monte Carlo).
Global index pass rate result (GIPR) of 6Mv and 15 MV for AXB: Acuros XB, AAA: An anisotropic analytic algorithm, CCC: Collapsed Cone Convolution and MC: Monte Carlo
The γ evaluation of the calculated PDDs and Profiles by dose calculation algorithms and measured PDDs and beam profiles indicate very similar results with no significant differences being seen between evaluated algorithms. The average overall gamma pass rate was greater than 98% for all algorithms and for both energies.
For the 6 MV and 15 MV PDD models, average gamma pass rates were 99.3±1.01% and 98.8±1.2% for AAA, 99.1±1.0 and 99.0±0.8% for AXB, 98.6±1.1% and 99.0±0.9% for CCC, 98.8±1.4% and 98.7±0.7% for MC algorithms respectively. For the 6 MV and 15 MV beam profile models, gamma pass rates were 99.3±4.0 and 99.3±3.4% for AAA, 99.2±4.1% and 99.2±4.3% for AXB, 99.02±0.48% and 98.87±0.47% for CCC, 99.05±0.6% and 98.89±0.58% for MC algorithms respectively.
The differences in the results between measurements, calculations and agreement criteria related to each test for AAA, AXB, CCC and MC algorithms at 6 MV and 15 MV energies are shown in Fig. 7. For the 6 MV all algorithms had a 100% pass rate, also for 15 MV just CCC had 94% pass rate, and other algorithms had 100 pass rates. In water equivalent material behind lung for 6 MV (case4, point5), MC (0.0%), AXB and CCC (both with –0.5%) and AAA (1.2%) had the best accuracy respectively. For 15 MV at the same point MC, AXB and CCC all had –0.7% accuracy and AAA had 1.3% accuracy.

Percentage dose difference of 6Mv and 15MV for AXB: Acuros XB, AAA: An anisotropic analytic algorithm, CCC: Collapsed Cone convolution and MC: Monte Carlo in CIRS heterogeneous phantom. The green region within the graph space indicates the acceptable tolerance level for each test performed and red region indicates that measurements are greater than the acceptable percentage difference.
In central axis of the beam for lung material (case4, point 6) for 6 MV, AXB (0.0%), AAA (–0.5%), MC (–0.7%) and CCC (1%) had the best accuracy respectively. For 15 MV at the same point AAA (0.1%), CCC (0.5%), and AXB and MC (both with 0.6%) had the best accuracy respectively. For the beam off axis in lung material (case 5, point 7) CCC (0.6%), AXB (–0.7%), AAA (–1.4) and MC (1.9%) had the best accuracy respectively. For 15 MV in the same point AXB (0.7%), MC (1.3%), AAA (–1.4) and CCC (1.7%) had the best accuracy respectively.
In the central axis of the beam for bone material (case1, point 10) AAA (–1.2%), AXB (–1.7%), CCC (–2.0 %) and MC (–2.3) had the best accuracy respectively. For 15 MV in the same point AAA, CCC and MC (all with –2.4%) and AXB (with –3.0%) had the best accuracy respectively. For the beam off axis in bone material (case 6, point 10) AXB (–0.2%), AAA (–0.7), MC (0.8%) and CCC (0.9%) had the best accuracy respectively. For 15 MV at the same point MC (0.0%), CCC (0.1%) and AAA and AXB (both with –0.3%) had the best accuracy respectively.
GIPRs value for each dose calculation algorithm for five different plans based on TG119 [6] was compared with the dose distributions measured by the ArcCheck during treatment delivery is summarized in Table 5. Average GIPR (3% /3 mm) were 98.14 for AAA, 98.64 for AXB, 97.42 for CCC and 98.3 for MC. In addition, Average GIPRs (2% /2 mm) were 94.5±2.5% for AAA, 96.14±1.04% for AXB, 92.82±1.68% for CCC and 94.7±1.7% for MC.
Volumetric modulated arc therapy (VMAT) gamma index passing rate (GIPR) with of 2% /2 mm and 2% /2 mm criteria for AXB: Acuros XB, AAA: An anisotropic analytic algorithm, CCC: Collapsed Cone Convolution and MC: Monte Carlo
Volumetric modulated arc therapy (VMAT) gamma index passing rate (GIPR) with of 2% /2 mm and 2% /2 mm criteria for AXB: Acuros XB, AAA: An anisotropic analytic algorithm, CCC: Collapsed Cone Convolution and MC: Monte Carlo
The measured percentage depth doses at the surface and buildup regions for 6 MV and 15 MV photon beams using the Roos parallel-plate ionization chamber are given in Table 6. Also calculated dose by AAA, AXB, CCC and MC dose calculation algorithms at 6 MV and 15 MV photon beams for same measured depths summarized in Table 6. The mean average dose difference for first 5 mm depth between calculated and measured dose of 6 MV photon beam were –9.5%, 3.3%, –8.4% and 1.9% for AAA, AXB, CCC and MC dose calculation algorithms respectively. Mean average dose difference for first 5 mm depth between calculated and measured dose of 15 MV photon beam were –11.9%, 7.03%, –10.6% and 5.2% for AAA, AXB, CCC and MC dose calculation algorithms respectively.
Percentage depth doses (PDDs) obtained with a Ross chamber and calculated dose by AXB: Acuros XB, AAA: A nisotropic analytic algorithm, CCC: Collapsed Cone Convolution and MC: Monte Carlo for same depth in a water equivalent RW3 phantom using 6 MV photon beams for 10×10 field size at SSD = 100 cm (WED: Water Equivalent Depth)
Percentage depth doses (PDDs) obtained with a Ross chamber and calculated dose by AXB: Acuros XB, AAA: A nisotropic analytic algorithm, CCC: Collapsed Cone Convolution and MC: Monte Carlo for same depth in a water equivalent RW3 phantom using 6 MV photon beams for 10×10 field size at SSD = 100 cm (WED: Water Equivalent Depth)
To obtain the dose at 0.07 mm water depth, which is representing surface dose [23], an interpolation calculation was made by comparing the PDD data to a four-order polynomial fit of Roos chamber measurements in the buildup region (Table 6). Mean average dose difference for 0.07 mm water depth between calculated and measured dose of 6 MV photon beam were –19.5.1%, 8.3%, –14.8% and 4.8% for AAA, AXB, CCC and MC dose calculation algorithms respectively. Mean average dose difference for 0.07 mm water depth between calculated and measured dose of 15 MV photon beam were –19.01%, 11.3%, –15.2% and 7% for AAA, AXB, CCC and MC dose calculation algorithms respectively.
Table 7 provides a comparison of four dose calculation algorithms based on their performance scores in passing the tests of this study. Each algorithm was assigned a score ranging from 4 to 1, with 4 being the best score and 1 representing the lowest score. The scores were determined based on the algorithms’ performance in the specific tests conducted.
Comparison of Dose Calculation Algorithms Based on Test’s Scores. Each algorithm was assigned a score ranging from 4 to 1, with 4 being the best score and 1 representing the lowest score. Since test 3.4 was only performed for the 6 MV beam, no values were available for the 15 MV beam, and “N/A” was indicated in the table. (AXB: Acuros XB, AAA: An nisotropic analytic algorithm, CCC: Collapsed Cone Convolution and MC: Monte Carlo)
Comparison of Dose Calculation Algorithms Based on Test’s Scores. Each algorithm was assigned a score ranging from 4 to 1, with 4 being the best score and 1 representing the lowest score. Since test 3.4 was only performed for the 6 MV beam, no values were available for the 15 MV beam, and “N/A” was indicated in the table. (AXB: Acuros XB, AAA: An nisotropic analytic algorithm, CCC: Collapsed Cone Convolution and MC: Monte Carlo)
Table 7 have provided a clear illustration of the performance of each algorithm across multiple tests. The total score serves as a comprehensive measure of their overall performance. It is important to highlight that for 6 MV photons, the AXB and MC algorithms demonstrated the highest total scores of 16 and 15, respectively. Following closely behind, AAA and CCC achieved scores of 10 and 9, respectively. Similarly, for 15 MV photons, the evaluation yielded total scores of 13 for both MC and AXB, while AAA and CCC obtained scores of 7. These results underscore the differing performance levels of the algorithms and underscore the significant superior performance of AXB and MC, both of which are Dm,m type dose calculation algorithms, in comparison to AAA and CCC, which are Dw . w type dose calculation algorithms (p-value is lower than 0.05).
The dose calculation algorithm commissioning process for each TPS has three steps: 1) data entry, 2) calculation of the beam model, and 3) verification of the beam model [24–26]. Before the data is entered into the TPS for beam modelling, it is re-evaluated for potential processing errors (e.g., smoothing, mirroring etc.). For CCC and MC beam modelling in the RayStation TPS, after entering measured data and selecting the machine data from the machine template library, the next step is beaming modelling and user does this part. In the beam modelling, the Medical Physicist (MP) follows the vendor instructions and adjusts the source sizes, off-axis softening and beam profile corrections by using measured data: depth dose curves, in-plane and cross-plane profiles and output factors [18]. In the Eclipse TPS, the data required to calculate the beam model comes from a combination of data entered manually by the user (e.g., beam geometry, output factors, depth dose curves, beam profiles, flattening filter material) and data retrieved from a machine data library (step.1 of the dose calculation algorithm commissioning in Eclipse and RayStation is exactly same) [18, 26]. After data entry, in step 2, the Eclipse TPS independently from the user (opposite of Ray station, where the user carries out the beam modelling) tries to fit the beam model parameters to match the input measured beam profiles and depth dose curves [26]. In both TPSs and all algorithms, the TPS allows the user to evaluate the agreement between the calculated and measured curves for all input beam profiles and depth-dose curves [18, 26].
The final step of dose calculation algorithm commissioning is beaming model verification and the results of this step can be used for comparison dose calculation algorithms accuracy. Based on the recommendations of MPPG 5.a [24] basic validation testing in homogeneous media should be completed prior to testing algorithms in heterogeneous media and for IMRT fields. With the performance evaluation of each algorithm in homogenous media, the MP can be sure of how accurate the beam modelling is.
In this study, two types of evaluations of homogenous media were carried out. In the first type of evaluation, point dose measurements were comprised and validated for dose calculation algorithms based on IAEA-TECDOC 1540 [8]. The results of this part of the study showed good agreement between measured and calculated doses by all the algorithms in homogeneous media for; symmetric open, asymmetric open, wedged fields, open fields with different SSD and different radiation beam angles. For the 6 MV and 15 MV beam models in all algorithms, the total percentage pass rate was more than 95% with results not showing any significant differences between algorithms. This was based on using tolerances recommended by TEC-DOC 1540 [8]. Looking at the percentage pass rate, the scoring of dose calculation algorithms for both 6 MV and 15 MV can be taken as follows (from best to worst): MC(Dm,m), AXB(Dm,m), AAA(Dw,w), CCC(Dw,w). In this test, both algorithms that calculated Dm,m showed better results in comparison to algorithms those calculating Dw,w.
In the second type of evaluation in homogenous media, PDDs and beam profiles were comprised and validated for the four dose calculation algorithms (AAA, AXB, MC and CCC). Results of this part of the study showed average gamma pass rates of greater than 98% (gamma criteria 2% /1 mm) for all algorithms when comparing measured and computed PDDs and profiles. Based on gamma pass rates (gamma criteria 2% /1 mm) for PDDs, the scoring of dose calculation algorithms for both 6 MV and 15 MV can be taken as follows (from best to worst): AAA (Eclipse TPS), AXB (Eclipse TPS), CCC (Ray Station TPS), MC (Ray Station TPS). The scoring for beam profiles could be as follows (from best to worst): AAA (Eclipse TPS), AXB (Eclipse TPS), MC (Ray Station TPS), CCC (Ray Station TPS). The worst gamma pass rate for PDDs and profiles in all algorithms was seen in the penumbra region of the fields for profiles and in the buildup regions for PDDs in all investigated field sizes. This observation is in line with work carried out previously by other authors [27–31]. In addition, the PDD-s and profiles result of this part of the study does not show any significant differences between 1×1 cm2 and 2×2 cm2 field size created by MLC when compared to the same field sizes created by jaws alone for all evaluated algorithms. The results of this part of the study also showed Eclipse dose calculation algorithms where beam modeling is almost independent from the user gave better results when compared to RayStation dose calculation algorithms, where the modeling was carried out by the user. The results of evaluating the performance of all algorithms (AAA, AXB, CCC and MC) in homogenous media (e.g. water) showed that the generated beam models for 6 MV and 15 MV were excellent as per recommended reference guidelines [8, 24].
The performance of algorithms in heterogeneous media were compared according to TECDOC1583 [9]. Results of this part of the study showed that all algorithms had 100% pass rates for both 6 MV and 15 MV energies except CCC for test 4 (point 10). Test 4 (point10) evaluates an algorithms performance in a four-field box in bone type material. The CCC of the RayStation TPS under-estimated the dose relative to measurements by 3.9%. For the same test, deviation between measurement and calculated dose for the other algorithms were –1.5% for AAA and –2.7% for AXB and MC. Upon observing the mean percentage dose difference between measured and calculated doses in lung heterogeneities for 6 MV, the scoring is as follows (from best to worst): AXB (0.87%), CCC (1.07%), MC (1.3%) and AAA (1.42%). The scoring for 15 MV in lung is as follows (from best to worst): AXB (0.72%), MC (1.15%), AAA (1.05%) and CCC (1.8%). The deviations presented here were in line with the findings of other studies.31–37 Upon observing the mean percentage dose difference between measured and calculated dose in bone heterogeneities for 6 MV, the scoring is as follows (from best to worst): MC (1.23%), AXB (1.36%) AAA (1.36%) and CCC (1.4%). The scoring for 15 MV in bone is as follows (from best to worst): MC (0.6%), AXB (1.25%), CCC (1.1%), and AAA (1.8%). The deviations presented here were also in line with the findings of other conducted studies [33–39]. Dose deviations reported in this study were consistent with other studies where algorithms were compared to measurements in heterogeneous media: mean deviations were reported smaller than 3% for AAA [32, 35], 1.1% for AXB [33, 37], 2.1% for MC [33, 38] and 2.5% for CCC [33–39].
The results of GIPR for dose calculation algorithms in IMRT fields showed that GIPR (3% /3 mm) for all the algorithms (AAA, AXB, CCC and MC) in all evaluated tests based on TG119,6 were greater than 97% (no significant difference seen). By changing the GIPR from 3% /3 mm to 2% /2 mm a better evaluation of the dose calculation algorithm performance can be obtained as some problems may be hidden in GIPR (3% /3 mm). Looking at the average gamma pass rates (2% /2 mm) for VMAT fields based on TG119, the scoring of dose calculation algorithms can be taken as follows (from best to worst): AXB (96.14% ±1.04%), MC (94.7±1.7%), AAA (94.5% ±2.5%), CCC (92.82% ±1.68%). This is in agreement with Zhengwen Shen et al (2022), who showed the gamma pass rate values for intensity modulated fields obtained with AAA and AXB were 95.6±1.9% and 96.2±1.7% respectively, with 3% /2 mm criteria [40].
The Eclipse dose calculation algorithms (AXB and AAA) had better results in comparison to the RayStation dose calculation algorithms (MC and CCC). This can be due to less uncertainty in penumbra positions (for profiles) and build up regions (for PDD) for AXB and AAA compared to MC and CCC. Although uncertainty in penumbra position and build up are generally small, when moving from simple fields to IMRT and VMAT, such uncertainty is combined with in-field uncertainty and uncertainty in the penumbra and buildup shape; this combination either can averaged out or can become magnified depending on the plan. In the case of magnification, this uncertainty can contribute a significant portion of the in-field dose. Although MLC beam model fine-tuning for IMRT/VMAT can be effective on this part of the study, it is outside the scope of this study (MLC modeling of all algorithms in this study performed using recommendations of vendor).
Due to the skin toxicity in radiotherapy, accuracy of the buildup region dose calculation, especially within the initial 5 mm depth, has a great importance. Thus, the surface and buildup region calculation accuracy of AAA, AXB, CCC and MC algorithm were checked by comparing the calculated and measured dose values. Result of comparison showed for both photon beams energy (6 MV and 15 MV), Dw,w type algorithms underestimated surface dose and Dm,m type dose calculation algorithms overestimated build up region dose. The scoring of dose calculation algorithms for both 6 MV and 15 MV can be taken as follows (from best to worst): MC(Dm,m), AXB (Dm,m), CCC (Dw,w) and AAA(Dw,w) .
Also in this study, PDD values were investigated at the depth of 0.07 mm, which was suggested for surface dose measurements by ICRU Report [23]. The scoring of dose calculation algorithms for both 6 MV and 15 MV in surface depth recommended by ICRU can be taken as follows (from best to worst): MC(Dm,m), AXB (Dm,m), CCC (Dw,w) and AAA(Dw,w) .
Ying Cao et al. [41] evaluate the superficial dose calculation accuracy of four commonly used algorithms in commercially available TPS by film measurements. They reported the superiority of MC over AXB and superiority of AXB over AAA which is in line with the result of our study.
Conclusion
The modeling and dosimetric accuracy of AAA and AXB of Eclipse (version 16.1) TPS and MC & CCC of RayStation (version 10.B) TPS were investigated for 6 MV and 15 MV flattened beams for a Varian CLINIC-IX linear accelerator. Testing in homogeneous media and in heterogeneous geometries has demonstrated a high level of agreement between measurements and calculations for both TPSs. The result of this study showed that generally, dose calculation algorithms that calculate dose in medium (AXB and MC) have better accuracy than dose calculation algorithms that calculate dose to water (CCC and AAA) in heterogeneous media. The same general results for algorithms were also seen when comparing GIPRs for intensity modulated fields. Also results of our study showed that Dm,m dose calculation algorithms (MC and AXB) performed better calculation accuracy of the superficial dose in compare AAA and CCC which are dose calculation algorithms that calculate dose in water. The AXB dose calculation algorithm in the Eclipse TPS and the MC dose calculation algorithm in the RayStation TPS are optional dose calculation algorithms that customers can purchase from the vender. As such, if that option is viable for customers, it is better to purchase those algorithms.
