Abstract
Background:
We previously demonstrated the clinical utility of using response to therapy variables obtained during the first 2 years of follow-up to actively modify initial risk estimates which were obtained using standard clinic-pathologic staging systems. While our proposed dynamic risk stratification system accurately reclassified patients who demonstrated an excellent response to therapy as low-risk patients, it grouped patients with either biochemical or structural evidence of disease into a single incomplete response to therapy cohort. This cohort included a wide variety of patients ranging from very minor thyroglobulin (Tg) elevations in the absence of structurally identifiable disease to widespread, progressive structural disease. Here we determined whether subdivision of the incomplete response to therapy category more precisely predicted clinical outcomes. We hypothesized that patients with an incomplete response to therapy based on persistently abnormal Tg values alone would have better clinical outcomes than patients having structurally identifiable disease.
Methods:
Following total thyroidectomy and radioactive iodine (RAI) ablation, 192 adult thyroid cancer patients were retrospectively identified as having either a biochemical incomplete response (abnormal Tg without structural evidence of disease) or structural incomplete response (structurally identifiable disease with or without abnormal Tg) as the best response to initial therapy within the first 24 months after RAI ablation. Clinical outcomes evaluated included structural disease progression, biochemical disease progression, and overall survival.
Results:
Sixty-three patients (33%) had a biochemical incomplete response while 129 (67%) had a structural incomplete response. Eleven to 156 months after evaluation of their responses (mean=70 months), patients with structural incomplete response were significantly more likely to have structural evidence of disease at final follow-up (37% vs. 17%, p=0.0004), structural progression (52% vs. 5%, p<0.001), biochemical progression (45% vs. 11%, p<0.001), and death from disease (38% vs. 0%, p<0.0001) than patients demonstrating a biochemical incomplete response. Overall survival was significantly better in patients with either a biochemical incomplete response or a loco-regional structural incomplete response than patients demonstrating a structural incomplete response with distant metastasis (Kaplan-Meier analysis, p<0.0001).
Conclusions:
A structural incomplete response to initial therapy is associated with significantly worse clinical outcome than a biochemical incomplete response to therapy.
Introduction
In a cohort of 588 well differentiated thyroid cancer patients treated with total thyroidectomy and radioactive iodine remnant ablation (RRA), the risk of having persistent or recurrent disease was significantly higher in patients having an incomplete response to therapy (96%) than in patients having an excellent (4%) or acceptable response to therapy (13%) (See Table 1 for definitions of response to therapy) (14). While the incomplete response to therapy category adequately captures patients with persistent/recurrent disease, it is a very heterogeneous cohort which includes patients with a wide variety of clinical conditions that can range from minimal thyroglobulin (Tg) elevations without evidence of structurally identifiable disease to wide spread, macroscopic structural distant metastases. So while an incomplete response to therapy accurately captures patients that had not been cured after initial therapy, the clinical outcomes of the patients in this group can widely vary.
Response to therapy definitions in Tuttle et al. (14).
Tg, thyroglobulin; US, ultrasound; NED, no evidence of disease.
Therefore, the goal of this study was to reanalyze the incomplete response to therapy category in an effort to subdivide this group into more homogeneous cohorts that may better correlate with final clinical outcomes. We hypothesized that patients classified as incomplete response on the basis of serum Tg elevations alone without structurally identifiable disease (biochemical incomplete response) would have significantly better clinical outcomes than patients classified as incomplete response based on persistent or newly identified structural disease (structural incomplete response).
Methods
After obtaining Institutional Review Board approval, we retrospectively reviewed the electronic medical records of 217 adult differentiated thyroid cancer patients evaluated at Memorial Sloan Kettering Cancer Center between January 1994 and December 2004 who demonstrated an incomplete response to total thyroidectomy and RRA (14). Of the 217 potentially eligible patients, 25 were excluded from this analysis for the following reasons: inadequate follow-up information (n=6), Tg determinations performed in different assays overtime (n=16), and interfering antibodies anti-Tg (TgAb) (n=3). A minimum of 3 years of follow-up was required for entry into the study unless one of the clinical end points (recurrence or death) was reached before that time point.
Each patient was risk-stratified using the 7th edition of the American Joint Committee on Cancer/Union Internationale Contre le Cancer staging system (Stage I, II, III, or IV) and the American Thyroid Association risk of recurrence stratification system (low, intermediate, or high-risk of recurrence) (5,11).
With regard to response to therapy assessment, each patient was classified as having either a biochemical incomplete response (suppressed Tg ≥1 ng/mL or stimulated Tg ≥10 ng/mL in the absence of structurally identifiable disease) or structural incomplete response (persistent/recurrent structurally identifiable disease with or without Tg elevation) based on follow-up data obtained within the first 2 years of follow-up after total thyroidectomy and RRA. Persistent/recurrent structural disease was defined as (i) positive cytology/histology; or (ii) highly suspicious lymph nodes or thyroid bed nodules on the neck ultrasound (US) (hypervascularity, cystic areas, heterogeneous content, rounded shape, enlargement on follow-up); or (iii) cross-sectional imaging highly suspicious for metastatic disease. The first fluoro-deoxyglucose (FDG) positron emission tomography (PET)/computed tomography (CT) done during the first 2 years of follow-up in 133/192 patients was used to classify the metastatic disease as either FDG avid (PET positive) or non-FDG avid (PET negative).
Patients were considered to have non iodine avid disease if at the time of the ablation they had evidence of structural disease in the single-photon emission computed tomography (SPECT)/CT or other cross-sectional images and did not have iodine uptake in the post-therapy scan or if they had a negative post-therapy scan anytime within the first 2 years with biochemical or structural evidence of disease.
During follow-up, structural disease progression was defined as: (i) enlargement of previously known metastases; (ii) development of new metastases during the follow-up, defined as findings on radioactive iodine (RAI) scans, 18-FDG-PET scans, or other cross-sectional imaging highly suspicious for metastatic disease or biopsy proven; or (iii) thyroid-cancer related death. Biochemical disease progression was defined as a progressive rise in serum Tg over time (>10% over baseline) measured in the same assay with similar levels of thyroid-stimulating hormone (TSH) suppression.
Patients were considered to have no evidence of disease (NED) at final follow-up if they had no evidence of structural disease confirmed by biopsy (cytology or histology), cross-sectional imaging (US, CT or nuclear magnetic resonance) or functional imaging (RAI scan or 18-FDG-PET scan), and a suppressed Tg <1 ng/mL.
Laboratory studies
Between 1994 and 1997, a variety of Tg assays were used with functional sensitivities of approximately 1 ng/mL. Starting in 1998, all Tg values were measured using the Dynotest-TgS immunoradiometric assay (Brahms Inc., Berlin, Germany, functional sensitivity 0.6 ng/mL normalized to CRM 457). All patients had Tg and TgAb levels while on TSH suppressive therapy at the time of final follow-up. Patients with interfering TgAb were excluded (Dynotest-TgS immunoradiometric assay; Brahms, Inc.).
Statistical analysis
Continuous data are presented as mean and standard deviations with median values. For comparing medians nonparametric Mann–Whitney test was used and for categories we used χ 2 and Fisher's exact tests. Analysis was performed using SPSS software (Version 18.0.1; SPSS, Inc., Chicago, IL).
Results
Of the 192 patients demonstrating an incomplete response to total thyroidectomy and RRA included in this study, 63 (33%) had persistently abnormal serum Tg values without structurally identifiable disease (biochemical incomplete response) while 129 (67%) had structural evidence of persistent disease (structural incomplete response: 54 with only loco-regional disease, 75 with distant metastases with or without loco-regional disease).
Compared with patients with a biochemical incomplete response to therapy, patients with a structural incomplete response were older (50±16 vs. 43±14 years, p=0.002), more likely to be male (47% vs. 22%, p=0.001), more likely to have FDG PET positive disease (45% vs. 0%, p<0.0001), less likely to have RAI avid disease (60% vs. 78%, p=0.015), had a higher median suppressed Tg (20.3 vs. 3.6 ng/mL, p<0.0001) during the first 2 years of follow-up, and received a higher administered activity of RAI for ablation (150 vs. 100 mCi, p<0.0001). Patients with structural incomplete response were also more likely to have distant metastases at diagnosis (AJCC stage IVc disease in 43% vs. 3%, p<0.0001), to have been classified as ATA high risk (67% vs. 21%, p<0.0001), and to have received additional surgery (57% vs. 6%, p<0.0001), external beam radiation therapy (EBRT; 35% vs. 0%, p<0.0001), or systemic therapy after RRA (13% vs. 2%, p<0.0001) (see Table 2).
SD, standard deviation; HCC, Hurthle cell carcinoma; rhTSH, recombinant human thyroid-stimulating hormone; THW, thyroid hormone withdrawal; PET, positron emission tomography; CT, computerized tomography; RAI, radioactive iodine; AJCC, American Joint Committee on Cancer; UICC, Union Internationale Contre le Cancer; ATA, American Thyroid Association; RRA, RAI remnant ablation; EBRT, external beam radiation therapy.
Further, patients with a structural incomplete response to therapy were significantly more likely to have either structural disease progression (52% vs. 5%, p<0.001) or biochemical disease progression (45% vs. 11%, p<0.001) than patients with biochemical incomplete response.
The clinical status at final follow-up was significantly worse in patients having a structural incomplete response to initial therapy than those demonstrating only a biochemical incomplete response (See Table 3). A structural incomplete response was more likely to be associated with structural evidence of disease at final follow-up (37% vs. 17%, p<0.0004) and death from disease (38% vs. 0%, p≤0.0001) than a biochemical incomplete response. Conversely, patients with a biochemical incomplete response to therapy were significantly more likely to have NED at final follow-up than patients with a structural incomplete response (68% vs. 15%, p≤0.0001).
Further, patients classified as having structural incomplete response to therapy because of macroscopic distant metastasis were more likely to have progression of disease or death than patients classified as structurally incomplete on the basis of loco-regional evidence of disease without distant metastases (73.3% vs. 22.2%, p<0.0001). Both biochemical incomplete response to therapy and loco-regional structural incomplete responses were associated with a significantly better overall survival than structural incomplete response to therapy with distant metastases (see Fig. 1, Kaplan–Meier analysis, p<0.0001). Over the course of the study, no deaths were seen in the patients with biochemical incomplete response, while death was the final outcome in 11% of the patients with loco- regional disease and 57% of the patients with distant metastases (p<0.001).

Overall survival is significantly better in patients demonstrating a biochemical (B) incomplete response to therapy or a loco-regional (L-R) structural incomplete response than in patients demonstrating a structural incomplete response to therapy with distant metastases (DM) (Kaplan–Meier analysis, p<0.0001).
As can be seen from Table 3, 62 patients (43 with a biochemical incomplete response and 19 in the structural incomplete response cohorts) were reclassified as NED at the time of their final follow-up. Five of the biochemical incomplete patients had mildly elevated suppressed Tg levels that became undetectable over time without any additional therapy. All of the remaining patients with incomplete response to therapy received additional treatments during follow-up (a median of one additional treatment performed a median of 13 months after initial RRA). These additional treatments included: one additional RAI therapy in 53% (33/62), one additional neck surgery in 10% (6/62), additional neck surgery and RAI therapy in 18% (11/62), and multiple additional RAI treatments in 11% (7/62).
Discussion
Our data clearly demonstrate that two clinically distinct cohorts can be identified within patients classified as having an incomplete response to therapy using our previously proposed restaging system (14). Patients with only biochemical evidence of disease have significantly better clinical outcomes in terms of having a higher likelihood of being NED at final follow-up, a lower likelihood of having biochemical or structural disease progression, and a much lower likelihood of dying from thyroid cancer than patients exhibiting a structural incomplete response to therapy. Interestingly, after a median follow-up of 8 years, structural disease was present at the time of final follow-up in only 17% (8/63 patients) of patients initially classified as having a biochemical incomplete response. Further, patients classified as having a structural incomplete response on the basis of macroscopic distant metastases had much poorer outcomes than either patients with biochemical incomplete response or structural incomplete response based on loco-regional disease without identifiable distant metastases.
Based on these findings, we are proposing a modification of our initial response to therapy classification system to allow a distinction to be made between biochemical incomplete response and structural incomplete response (See Table 4). This modification will allow more accurate prediction of outcomes in patients with an incomplete response to therapy based on the presence or absence of structurally identifiable disease at the time of restaging.
The impact of this subclassification of the incomplete response to therapy cohort is best appreciated when data from our previous study (14) is combined with the results from this study. In Table 5, it is readily apparent that our response to therapy restaging system can identify four separate cohorts of patients that have significantly different clinical outcomes. The best clinical outcomes are seen in those patients demonstrating an excellent response to therapy with undetectable suppressed and stimulated Tg without evidence of structurally identifiable disease. Patients with an acceptable response to therapy also do very well with the majority of patients being NED at final follow-up and a smaller number (13%) having persistent biochemical evidence of disease without identifiable structural disease.
Because the definition of recurrent disease requires a period of NED, patients with persistent disease cannot be classified as having a recurrence.
Patients with a biochemical incomplete response to therapy have higher rates of persistent biochemical (19%) or structural disease (17%), but 68% eventually become NED with no disease specific mortality over a median 8 years of follow-up. Clearly, the worst clinical outcomes are seen in those patients classified as having a structural incomplete response with only 15% being NED at final follow-up, 37% with persistent structural disease, 10% with persistent biochemical evidence of disease, and a 38% disease specific death rate over a median follow-up period of 3.5 years.
In conclusion, our findings demonstrate that restaging within the first 2 years of initial therapy provides important prognostic information in patients with differentiated thyroid cancer. While all patients with an incomplete response to initial therapy are less likely to be classified as NED at final follow-up than patients with either an excellent or acceptable response to therapy, all of the disease specific deaths were restricted to the patients with a structural incomplete response. Therefore, restaging can be used to modify initial risk estimates to provide more accurate and dynamic estimates of risk of recurrence, persistent disease, and disease specific death than static staging systems that are not usually modified over time.
Footnotes
Disclosure Statement
F.V., R.G., and H.T. have nothing to declare. R.M.T. is a consultant to and has received honoraria from the Genzyme Corporation.
