Abstract
Background
Early prediction of response to concurrent chemoradiotherapy (cCRT) could aid to further optimize treatment regimens for locally advanced cervical cancer (LACC) in the future.
Purpose
To explore whether quantitative parameters from baseline (pre-therapy) magnetic resonance imaging (MRI) and FDG-PET/computed tomography (CT) have potential as predictors of early response to cCRT.
Material and Methods
Forty-six patients with LACC undergoing cCRT after staging with FDG-PET/CT and MRI were retrospectively analyzed. Primary tumor volumes were delineated on FDG-PET/CT, T2-weighted (T2W)-MRI and diffusion-weighted MRI (DWI) to extract the following quantitative parameters: T2W volume; T2W signalmean; DWI volume; ADCmean; ADCSD; MTV42%; and SUVmax. Outcome was the early treatment response, defined as the residual tumor volume on MRI 3–4 weeks after start of external beam radiotherapy with chemotherapy (before the start of brachytherapy): patients with a residual tumor volume <10 cm3 were classified as early responders. Imaging parameters were analyzed together with FIGO stage to assess their performance to predict early response, using multivariable logistic regression analysis with bi-directional variable selection. Leave-one-out cross-validation with bootstrapping was used to simulate performance in a new, independent dataset.
Results
T2W volume (OR 0.94, P = 0.003) and SUVmax (OR 1.15, P = 0.18) were identified as independent predictors in multivariable analysis, rendering a model with an AUC of 0.82 in the original dataset, and AUC of 0.68 (95% CI 0.41–0.81) from cross-validation.
Conclusion
Although the predictive performance achieved in this small exploratory dataset was limited, these preliminary data suggest that parameters from baseline MRI and FDG-PET/CT (in particular pre-therapy tumor volume) may contribute to prediction of early response to cCRT in cervical cancer.
Keywords
Introduction
In cervical cancer, disease stage is typically determined at diagnosis by a combination of clinical examination and pelvic magnetic resonance imaging (MRI), complemented with whole-body 18F-fluoro-deoxyglucose (FDG) positron emission tomography/computed tomography (PET/CT) for evaluation of lymph nodes and distant metastases in more advanced cases. Current treatment standard for locally advanced cervical cancer (LACC; International Federation of Gynecology and Obstetrics [FIGO] stage ≥IB2 or node-positive) is definitive concurrent chemoradiotherapy (cCRT), consisting of weekly cisplatin in combination with external beam radiotherapy (EBRT), followed by brachytherapy (1).
To date, the radiotherapy schedule is generally identical for all patients, irrespective of tumor stage or other prognostic characteristics. In the vast majority, this regime results in a complete local response (2), but there is considerable variation in the course of response between individual patients. In some patients, a significant tumor residue remains after the first weeks of EBRT (before brachytherapy), while others are “early responders” that already show a (near-)complete volume reduction at this stage (3). In the future, these “early responders” might benefit from an early start of the subsequent brachytherapy, aiming to reduce overall treatment time, or from dose de-escalation to limit treatment toxicity while maintaining a good oncological outcome (4). To facilitate such personalized treatments, tools to predict early response will be required.
One approach could be to extract such predictive information from imaging. Studies in various cancer types have shown promising results for imaging biomarkers derived from MRI and FDG-PET/CT as predictors for response and prognosis (5–8). For cervical cancer, there have been reports that markers from pre-treatment MRI or FDG-PET/CT have potential to predict “late” outcomes, including disease-free and overall survival (9–16) or final treatment response after completion of cCRT (17–26); however, so far, no studies have focused specically on prediction of early treatment response. Furthermore, of the existing studies, only a few combined FDG-PET/CT and MRI within one patient cohort (9,12,18,27).
The aim of the present study was to determine whether quantitative imaging markers from baseline staging MRI and FDG-PET/CT have potential as predictors of early response to cCRT in LACC, and whether combining parameters from both modalities has complementary value.
Material and Methods
This retrospective study was approved by the institutional ethical review board, informed consent was waived. The study workflow is illustrated in Fig. 1.

Study workflow. aMRI acquired during cCRT for clinical brachytherapy planning. For the purpose of this study, this MRI was used for volumetric response measurement. cCRT, concurrent chemoradiotherapy; MRI, magnetic resonance imaging.
Patients and treatment
Patients who underwent FDG-PET/CT and MRI in our institution for pre-treatment staging of primary cervical cancer (January 2011 to March 2018) were identified. From this group, 46 patients met the inclusion criteria: (i) LACC (FIGO stage ≥IB2 or node-positive (28)); (ii) treatment by cCRT (with curative intent) and at least four of six weekly cycles of cisplatin completed; (iii) pre-treatment MRI at 3.0 T including T2-weighted (T2W) and diffusion-weighted imaging (DWI); and (iv) available MRI at 3–4 weeks after EBRT initiation, to determine “early response.”
Routine cCRT consisted of pelvic EBRT of 46 Gy (2 Gy/fraction, five fractions/week). The radiation field was extended to the level of the renal veins in cases of suspicious para-aortic nodes. A sequential boost (14 Gy, 2 Gy/fraction, five fractions/week) was administered to suspected sites of pelvic lymph-node involvement. EBRT was followed by brachytherapy to a total dose equivalent of 90 Gy on the high-risk clinical target volume (cervix and tumor), in 3–4 fractions. Radiotherapy was accompanied by weekly cisplatin (40 mg/m2 body surface area) for six weeks, starting on day 1 of EBRT.
MRI
Pre-therapy pelvic MRI was performed at 3.0 T (Intera Achieva [+/− dStream] or Ingenia system, Philips Healthcare) with an external surface coil. The protocol included anatomical fast spin echo (FSE) T2W sequences in three orthogonal planes and an axial single-shot echo planar imaging (ssEPI) DWI sequence, with three b-values (b 0 up to b 750–1000). Axial T2W and DWI sequences were angled in identical planes (perpendicular to the cervical canal). Protocol details are provided in Table 1. Spasmolytics were not administered.
MRI protocol used for primary staging and quantitative imaging evaluation.
*Some changes in protocol and sequence parameters occurred during the study period: in 18 patients, ADC was calculated using slightly different b-values: 0, 188, 750. In two patients, ADC was calculated using b-values 0, 200, 1000.
ADC, apparent diffusion coefficient; DWI, diffusion-weighted imaging; EPI, echo planar imaging; FSE, fast spin echo; MRI, magnetic resonance imaging; SPAIR, spectral attenuated inversion recovery; T2W, T2-weighted.
FDG-PET/CT imaging
FDG-PET/CTs were performed on a hybrid PET/CT scanner (either Gemini TF 16 or Gemini TF Big Bore, Philips Healthcare). After 6 h of fasting (target blood glucose level <10 mmol/L), patients received an intravenous 2-deoxy-2-[18F]fluoro-D-glucose (FDG) bolus of 180 MBq (for body mass index [BMI] ≤28 kg/m2) or 240 MBq (BMI >28 kg/m2), followed by an accumulation period of 60 ± 5 min. Scanning ran from the skull base to upper thighs, with 2 min/bed position (reconstructed to 4-mm slices, 4 × 4 mm pixels), with a non-enhanced CT (120–140 kV, target energy 40 mAs with automatic dose modulation) for attenuation correction and anatomical correlation (reconstructed to 2 mm and 5 mm slices, 1.17 × 1.17 mm pixels).
Tumor segmentation
A board-certified abdominal radiologist (DL, with ±7 years of pelvic MRI experience) manually segmented whole-tumor volumes on the primary staging axial T2W-MRI and subsequently (in the same session) on high b-value DWI, using dedicated open-source software (3D Slicer, version 4.8.1). To assess inter-observer variation, a random subset of 15 cases was additionally segmented by a second board-certified radiologist (MM, with similar experience level), blinded for the results of the first reader.
FDG-PET/CT images were processed by a nuclear medicine physician (WV, with 13 years of experience), using the same software. The FDG-avid tumor volume was first manually segmented, while carefully excluding adjacent structures with high physiological FDG signal (e.g. urine in the bladder). The metabolic tumor volume was derived semi-automatically from this segmentation using a threshold of 42% of the SUVmax (MTV42%, according to methods previously reported (11,13,29)).
Quantitative image analysis
Using the open-source software PyRadiomics (version 2.1.0 (30)) the following quantitative parameters were extracted from the segmented volumes: T2W volume; DWI volume; mean T2W signal (T2W-signalmean); mean apparent diffusion coefficient (ADCmean); the SD of the ADC (ADCSD); SUVmax; and MTV42%. These specific parameters were selected as they represent relatively simple (first-order) parameters reflecting tumor size, cellularity, heterogeneity, and metabolism, all of which have shown potential for response prediction in previous studies (10,16,17,24). ADC values were calculated using a mono-exponential decay model including three b-values (b = 0, b = 88/200, b = 750/800/1000 s/mm2). MRI data were resampled to account for variation in the voxel dimensions, and signal intensity was normalized (by subtracting the image mean value and dividing it by the SD).
Outcome definition (early response)
The primary study outcome was the early response to cCRT, defined as the residual tumor volume on MRI (routinely performed for the purpose of brachytherapy planning) after 3–4 weeks of cCRT. This outcome was chosen as the residual tumor volume after EBRT but before brachytherapy has previously been described to correlate with final local control after completion of treatment (19,31,32). The volume threshold was set at <10 cm3, based on a previously reported cut-off (19). The tumor volume after EBRT was segmented on T2W-MRI by the same reader and using the same procedure and segmentation methods as for the initial tumor segmentations. In 29/46 patients, response evaluation was performed on a 1.5-T scanner (Intera Achieva, Philips Healthcare) using similar protocols as the standard 3.0-T protocol described above and in Table 1.
Statistical analysis
Inter-observer variability was determined with the intraclass coefficient (ICC) in a two-way random effects model. The predictive value of the different imaging parameters (T2W volume, T2W signalmean, DWI volume, ADCmean, ADCSD, SUVmax, MTV42%), together with clinical FIGO stage, to predict early response to cCRT was assessed using multivariable logistic regression analysis with bi-directional stepwise selection based on the Akaike Information Criterion (AIC) (33). FIGO stage was derived from the patients’ electronical medical records (according to the 2009 FIGO staging system, routinely used during the inclusion period of the present study) (28). In case of strong correlation between different imaging parameters (Pearson’s correlation coefficient ρ ≥ 0.8), only one of these parameters was used in the variable selection process to avoid effects of multicollinearity. Model performance was determined by calculating the area under the receiver operating characteristic (ROC) curve (AUC), and odds ratios (ORs) were calculated for all parameters selected in the model. To assess model performance in new, “independent” data, leave-one-out cross-validation (LOOCV) was performed. For LOOCV, each patient is left out of the dataset once and a prediction model is generated on the remaining cases (46 iterations for this study). Probability of early response is calculated by the model for the left-out patient. The cross-validated AUC is determined on the total of these predictions. A 95% bootstrap confidence interval for the cross-validated AUC was obtained by repeating the cross-validation process using 1000 bootstrapped samples. Univariable logistic regression analysis for the individual variables was performed independent of the multivariable analysis. Baseline variables were compared between outcome groups using Wilcoxon Mann–Whitney U test for independent samples or Fisher’s exact test.
Statistical analyses were performed using R version 3.4.3 (R Foundation for Statistical Computing, Vienna, Austria).
Results
Baseline characteristics
Baseline patient characteristics are given in Table 2. The median age was 51 years (age range = 27–75 years). Most patients had FIGO stage IIB: 34/46 (74%). In total 30/46 (65%) patients were classified as early responders (residual T2W volume <10 cm3). Total treatment time (EBRT + brachytherapy) was in the range of 5.3–8.3 weeks (median = 6.3 weeks).
Baseline characteristics of included patients (n = 46).
Values are given as n (%) or median (range).
*P > 0.05 was considered statistically significant.
†Stage according to the FIGO staging system of 2009.
‡Fisher’s exact test, others: Wilcoxon Mann–Whitney U test (independent samples).
EBRT, external beam radiotherapy; FIGO, International Federation of Gynecology and Obstetrics; MRI, magnetic resonance imaging; T2W, T2-weighted.
Inter-observer variability
Inter-observer agreement was excellent for all quantitative image parameters, with ICCs in the range of 0.916–0.995 (Table 3).
ICC for MRI parameters.
ADC, apparent diffusion coefficient; CI, confidence interval; DWI, diffusion-weighted imaging; EBRT, external beam radiotherapy; ICC, intraclass correlation coefficient; MRI, magnetic resonance imaging; T2W, T2-weighted.
Univariable analysis
Results of the univariable analysis are provided in Table 4. Among the quantitative imaging parameters, all parameters related to the pre-therapy primary tumor volume (T2W volume, DWI volume, and MTV42%) were significantly associated with early response, with ORs in the range of 0.94–0.95 (per cm3), indicating that a smaller tumor volume is associated with early response to cCRT. Of the other parameters, only ADCSD and T2W-signalmean showed borderline-significant associations (OR = 1.06, P = 0.07 and OR = 0.98, P = 0.09, respectively). FIGO stage category was not significantly associated with early response (OR = 0.60, P = 0.50).
Univariable association with early response.
*P values <0.05 were considered statistically significant.
ADC, apparent diffusion coefficient; CI, confidence interval; DWI, diffusion-weighted imaging; EBRT, external beam radiotherapy; FIGO, International Federation of Gynecology and Obstetrics; MTV, metabolic tumor volume; OR, odds ratio; T2W, T2-weighted.
Multivariable prediction model
Multivariable analysis was performed independently of the univariate results. Since a strong correlation (ρ = 0.8–0.9, P <0.01) was found between the three volume parameters (T2W volume, DWI volume, and MTV42%), T2W volume was chosen as the only volume-based parameter entered in the variable selection process because it corresponds best with the outcome definition (volume on T2W-MRI after EBRT). Results are summarized in Table 5. The multivariate prediction model included T2W volume (OR = 0.94, P = 0.003) and SUVmax (OR = 1.15, P = 0.18) as the selected predictors. Median pre-CRT T2W volume was 24.8 cm3 (range = 7.2–53.7 cm3) for the early responders versus 64.1 cm3 (range = 21.2–134.9 cm3) for the remaining patients. Median SUVmax for the early responders was 15.3 (range = 6.7–26.1) versus 15.9 (range = 4.8–33.6) for the remaining patients. The model’s performance to predict early response was AUC 0.82 within the current dataset. AUC by cross-validation (LOOCV) was 0.68 (95% CI = 0.41–0.81).
Multivariable logistic prediction model.
*There was a strong correlation (ρ = 0.8–0.9, P < 0.01) between T2W volume, DWI volume, and MTV42%. Therefore, only T2W volume was used in the variable selection process.
†LOOCV with bootstrapping was used to account for overfitting and to estimate how the model would perform on new, independent data.
ADC, apparent diffusion coefficient; AUC, area under the receiver operating characteristic curve; CI, confidence interval; LOOCV, leave-one-out cross-validation; OR, odds ratio; T2W, T2-weighted.
Discussion
Previous studies have shown that pre-therapy quantitative imaging markers from FDG-PET/CT and MRI have potential to predict “late” outcomes including the final treatment response (17–22,24–26) and survival (9–15) in patients with LACC. In addition to these previous works, the present study specifically assessed whether quantitative imaging parameters from baseline staging MRI (including DWI) and FDG-PET/CT may also have potential as predictors of early response to cCRT and could thus play a possible role in future optimization of treatment regimens. Our exploratory results indicate that the pre-therapy tumor volume is the best predictor (with similar results for volumes derived from T2W-MRI, DWI, and FDG-PET/CT in univariable analysis) for early response to EBRT (i.e. response before onset of brachytherapy), with smaller baseline volumes (median 24.8 vs. 64.1 cm3) for the early responders. Of the other parameters, only SUVmax may have some complementary value, though the SUVmax values between both outcome groups overlapped considerably. FIGO stage was not identified as a significant predictor for early response, nor were any of the DWI-related parameters (ADCmean and ADCSD). Overall, the estimated predictive performance achieved with cross-validation was only moderate (AUC ∼ 0.7) and ORs of the selected parameters were in the range of 0.94–1.15, indicating a limited predictive value, which is typically not considered sufficient for clinical decision-making. The present study was exploratory, however, and further and more in-depth research is obviously required to investigate if other (combinations of) imaging and clinical parameters yield higher predictive potential.
It is difficult to directly compare our current results with those reported by previous authors, because their studies were aimed at later endpoints (disease-free and overall survival) or final response after cCRT, while the current study focuses specifically on predicting early response to EBRT. Furthermore, few studies have combined MRI and FDG-PET/CT parameters for prediction of outcome in cervical cancer. Studies that did, in most cases found FDG-PET-based pre-therapy parameters were associated with the final outcome, including a smaller baseline metabolic tumor volume (11) and higher SUVmax (18) corresponding to a favourable outcome. In contrast, high pre-therapy SUVmax was reported to predict a poor final treatment response by Kidd et al. (16,22) looking at PET parameters only, yet others were unable to confirm an association (10,34). In three MRI-based studies that included the tumor volume on pre-treatment T2W-MRI to predict final treatment response, this parameter was not found to be a significant predictor (15,25,35). However, a study by Schernberg et al. (31) did show a significant association between pre-treatment tumor volumes and survival.
We were unable to detect a significant association between the DWI-derived parameters and early treatment response. Although association of ADCSD nearly reached significance in univariable analysis, this parameter was not identified as an independent predictor in the multivariable analysis. Previous reports on ADC-based parameters to predict response to cCRT in cervical cancer have so far shown conflicting results. Some studies found low pre-treatment ADCmean to be associated with a good final response to cCRT (17,20,24), while others found no significant results (11,15,21,25,35). The role of DWI to predict treatment response in cervical cancer thus remains unclear but appears to be relatively limited.
When tested within our original dataset, the multivariable model (including T2W volume and SUVmax as predictors) yielded an AUC of 0.82. This is likely an overestimation of its actual performance in hypothetical clinical application. Because our limited cohort size did not allow splitting of the dataset into separate training and validation sets, cross-validation of the bi-directional stepwise modelling process was used to simulate model performance on a new independent dataset (i.e. to estimate its clinical potential). This resulted in an AUC of 0.68, which is likely a more realistic approximation of the model’s actual performance. As mentioned, this is certainly not sufficient for clinical decision-making, and further optimization (and larger scale validation) will obviously be required. While FIGO stage—though specifically developed for prognostication—did not contribute to the current model, it will be worthwhile to further explore the added value of clinical factors, as well as additional histological, immunohistochemical, or genetic factors to generate stronger and more comprehensive clinical prediction models. It would also be worthwhile to see if the recently updated FIGO staging system would render different results than the 2009 FIGO staging system routinely used during the clinical inclusion period of the current study. Finally, sophisticated methods of image analysis and postprocessing such as Radiomics or deep learning, often used to assess large sets of variables, may prove to be of added value as the first available publications on these relatively novel methods have shown some promising preliminary results (9,18,36). In the present study, we consciously limited the number of variables to prevent overfitting in a small dataset and identified a small set of relatively intuitive (first-order) features based on previous works. We acknowledge, however, that with this approach other potentially valuable imaging biomarkers may have been overlooked.
The present study has some limitations, in addition to the aforementioned small patient cohort. First, some variations in MR acquisition protocols occurred over time, which are difficult to avoid during retrospective analysis of clinical data. We aimed to account for this by normalizing the MRI signal intensity (T2W and ADC). Second, a validated reference standard to classify early response during cCRT does not exist. We therefore chose the residual volume on T2W-MRI after 3–4 weeks of EBRT as a measure of early response, using a volume threshold of <10 cm3 derived from a study by Mongula et al. (19), who reported this as a cut-off that correlates with local control after brachytherapy. Finally, part of the MRIs used for response classification were acquired at 1.5 T instead of 3.0 T (29/46 patients). Given the comparable visual quality and resolution, this likely had little impact on the tumor volumes used for response classification.
In conclusion, these preliminary data suggest that parameters from baseline MRI and FDG-PET/CT (particularly primary tumor volume) may contribute to prediction of early response to cCRT in LACC, although the predictive performance achieved was limited. Future larger-scale studies are required to expand this research, by combining imaging markers with other potential predictors and by exploring more sophisticated image analysis techniques, to build prediction tools that can truly aid in further treatment personalization in cervical cancer.
Footnotes
Acknowledgements
The authors thank Prof. Dr. Michael Hauptmann, former head of the Biostatistics group of the Netherlands Cancer Institute, for his supervision of the statistical analysis and critical revision of the manuscript, and Chèrita Sombroek, physician assistant at the department of Radiation Oncology of the Netherlands Cancer Institute, for her assistance in cohort identification.
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
