Abstract
Background:
In most staging systems, 45 years of age is used to differentiate low risk thyroid cancer from high risk thyroid cancer. However, recent studies have questioned both the precise 45 year age point and the concept of using a binary cut off as accurate predictors of disease specific mortality.
Methods:
A cohort of 3664 thyroid cancer patients that received surgery and adjuvant treatment at Memorial Sloan Kettering Cancer Center (MSKCC) from the years 1985 to 2010 were analyzed to determine the significance of age at diagnosis as a categorical variable at a variety of age cutoffs (5 year intervals between 30 and 70 years of age). The unadjusted and adjusted hazard ratio for the association between disease-specific survival and age was determined using a Cox proportional hazards model adjusted for other predictive variables sex, histology, and pathological T, N, and M status. Furthermore, predictive nomograms of disease-specific mortality were created and validated on an external dataset of 4551 patients to evaluate the impact of age at diagnosis as both a categorical and continuous variable.
Results:
In the MSKCC cohort, with a median follow-up time of 54 months (range 1–332), there were 59 deaths from thyroid cancer with a 10 year disease-specific survival of 96%. Adjusted hazard ratios for all age cutoffs from age 30 to age 70 years were significant. There was no specific cutoff age which risk stratifies patients with differentiated thyroid cancer (DTC). Categorizing age into five strata (<40, 40–49, 50–59, 60–69 and >70 years) showed a 37-fold increase in hazard ratio from age <40 years to age >70 years. A predictive nomogram using age as a continuous variable with other predictive variables had a high concordance index of 96%. Validation on the external cohort had a concordance index of 73%.
Conclusions:
Mortality from DTC increases progressively with advancing age. There is no specific cutoff age which risk stratifies patients with DTC. A predictive nomogram using age as a continuous variable may be a more appropriate tool for stratifying patients with DTC and for predicting outcome.
Introduction
T
Unlike any other adult cancer, thyroid cancer is the only cancer that has age as a prognostic factor in the TNM staging system. The current staging system uses the age of 45 as the cutoff with patients 45 years of age or over having poorer outcome. The reason why 45 years of age was selected as the prognostic cutoff is unclear. One reason could be based upon the observation that 45 years of age is the median age of patients in multiple series. However, a report by Byar et al. (9) in 1979 was the first to show survival was poorer at the age of 45 years. It is widely thought that this age cutoff is too young and that a better cutoff age may be age over 55 years. There is also some argument that a binary variable for age may not be appropriate and that a staging system utilizing age as a continuous variable may be more appropriate. For example, a 44-year-old man with a T1N1b thyroid cancer is currently staged as stage 1 with a survival of 100% at 5 years. However, a 46-year-old man with an identical tumor is staged as stage 4 with an estimated 5-year survival of 51%. Clearly the difference of 1 year should not move patients from stage 1 to stage 4 disease, reducing the estimate of survival by 49%. The aim of our study was to challenge the traditional concept that there is a distinct age cutoff around which outcomes are significantly different.
Methods
After internal review board approval, a database of 3664 patients with differentiated thyroid cancer who received their primary therapy at Memorial Sloan Kettering Cancer Center (MSKCC) was created. These patients all received surgery and adjuvant treatment at MSKCC during the years 1985 to 2010. Patient, tumor, and treatment characteristics were extracted from patient electronic records by clinicians. Patient, tumor, and treatment characteristics for all patients are summarized in Table 1. Characteristics of an external dataset of 4551 patients, comprising patients from Brazil (n = 646), Toronto (n = 925), Sydney (n = 1129), and University of California, San Francisco (n = 1851) used for validation are also shown.
MSKCC, Memorial Sloan Kettering Cancer Center.
Causal hypothesis
Our causal hypothesis was that cancer-specific survival from thyroid cancer is associated with age after adjusting for other prognostic variables. Our outcome variable was disease (cancer) specific survival (DSS). There were 312 overall deaths, of which 59 were due to thyroid cancer. DSS was calculated using the date of last follow up with a MSKCC physician from the thyroid cancer multidisciplinary team. Details of death were determined from the social security death index and hospital records. All patients were cross-linked to the social security death index. The cause of death in patients who were still under follow up was determined from patient medical records. However, in patients who were lost to follow-up, death from thyroid cancer was defined if the patient had active structural disease at the last follow up date and died within three years of this follow-up date.
Assessment of age as a prognostic factor
Variables predictive of DSS were determined by univariable analysis using the Kaplan Meier method. The significance of age was determined as a categorical variable at a variety of age cutoffs from 30 to 70 years at 5-year intervals. The unadjusted and adjusted hazard ratio (HR) for the association between DSS and age was determined using a Cox proportional hazards model. In this model, age was adjusted for other predictive variables including sex, histology, pathological T (tumor) status, pathological N (node) status, and M (metastases) status. Pathological N status was categorized into N0, Nx, N1a, N1b. N0 was defined as negative pathological neck nodes. Nx was defined as no clinically enlarged or suspicious lymph nodes, N1a was defined as positive pathological nodes in the central neck, and N1b defined as positive pathological lymph nodes in the lateral neck. We further categorized the N class into three categorical variables N0/Nx, N1a, and N1b. M status was categorized into a binary variable M0/Mx and M1. M0/Mx was defined as no distant metastases at presentation, and M1 defined as distant metastases at presentation. To test the proportional hazards assumption, time-dependent variables of the interaction between the predictors, and the logarithm of time to event (DSS time) were introduced into the model. The proportional hazards assumption was reasonable in all models.
Creation of nomograms using Cox proportional regression analysis
Cox proportional hazards models were created for age as a categorical variable (stratified into the following strata age <40, 40–49, 50–59, 60–69 and >70 years) and also as a continuous variable. In each model age was adjusted for other variables prognostic for disease-specific mortality. Nomograms predictive of five-year disease-specific mortality were created from the models. Predictive accuracy was assessed by discrimination (the ability of a model to separate patients with different outcomes). Discrimination was measured with the concordance index, similar to the area under the receiver operating characteristic curve: values range from 0.5 (no discrimination) to 1.0 (perfect discrimination). The final nomogram was validated using the external dataset and concordance index determined.
Statistical analyses
Statistical analysis was carried out using SPSS (version 21, IBM Corporation). Patient, tumor, and treatment characteristics were compared using Pearson's chi-squared test. Survival outcomes were analyzed using the Kaplan-Meier method. Unadjusted and adjusted HRs were calculated using the Cox proportional hazard model. A p-value less than 0.05 was considered significant. R version 3.0.2 (The R Foundation for Statistical Computing) was used to create the nomograms and for validation.
Results
Table 1 describes the patient and tumor characteristics of the MSKCC and external cohorts in detail. The MSKCC cohort had more male patients, more patients over 45 years of age, and had a greater percentage of advanced T stage tumors. However, the MSKCC cohort tended to have fewer patients with positive neck disease and slightly less distant metastases. The distribution of age within all cohorts was similar (Supplementary Figs. S1, S2; Supplementary Data are available online at
Age as a prognostic variable
Table 2 shows the impact that age had on outcome as a binary variable. All cutoff ages from 30 to 70 years showed that the older cohort had poorer DSS (Fig. 1). Multivariable analysis for age at different cutoffs, adjusting for the other predictor variables using Cox proportional hazards regression, also showed that age was significant at all cutoff values from 30 to 70 years. The HRs for adjusted age cutoffs were ranging from 7.09 to 19.03.

Ten year disease-specific survival (DSS) at different age cutoffs from age 30 to age 70 years.
Adjusted for sex, pathology, and T (tumor), N (node), and M (metastases).
Age as a categorical variable
We then categorized age into the following categories: <40, 40–49, 50–59, 60–69, and >70 years and calculated the HR for these age strata after adjusting for the other prognostic variables. The unadjusted and adjusted HRs are shown in Table 3. The data demonstrate that the HR progressively increases with increasing age. Patients between 50–59 years had a 14-fold increased risk of death compared to patients less than 40 years after adjusting for sex, histology, and T, N, and M status. Patients over the age of 70 years had a 37-fold increased risk of death compared to patients less than 40 years after adjusting for sex, histology, and T, N, and M status. The unadjusted and adjusted HRs for the other variables in this model are also shown in Table 3. In our adjusted model, the other important variables predictive of DSS were a nonpapillary pathology (follicular HR = 4.36, Hürthle HR = 3), pathological T4 status (HR = 3.66), and M1 status (HR = 14.85).
Age (categorical) adjusted for sex, pathology, and T, N, and M stage.
Age as a continuous variable
We then calculated the unadjusted and adjusted HRs for age as a continuous variable, again adjusting for the same variables of sex, pathology, and T, N, and M stage. This is shown in Table 4. The adjusted HR was 1.076 indicating that for every additional year the risk of death progressively increased.
Age (continuous) adjusted for sex, pathology, and T, N, and M stage.
Nomograms predictive of disease specific mortality using age as a categorical and continuous variable
A nomogram was created using the age categories described above (<40, 40–49, 50–59, 60–69, and >70 years) (Fig. 2A). The concordance index for this nomogram was 96%. We then constructed a nomogram using age as a continuous variable along with the other prognostic variables. The nomogram for this model is shown in Figure 2B. The concordance index for the nomogram was 96%. Validation of this nomogram with the external dataset had a concordance index of 73%. Using this nomogram, the predicted 5-year mortality for 4 hypothetical patients at different ages and stages is illustrated in Figure 3. This shows how patients with an identical stage of disease have a progressively poorer outcome as age increases. To illustrate how such a nomogram works, Figure 2C illustrates a hypothetical patient, a 50-year-old female with a T2N1bM0 papillary thyroid cancer (PTC) tumor. The total score for this patient is 60, which equates to a 50 year predicted mortality of 0.5%.

(

Five year disease-specific mortality for four hypothetical patients with advancing age.
Discussion
The aim of our study was to investigate the association between age and survival in thyroid cancer to determine if there is a specific age cutoff predictive of outcome. We show that the HR for mortality increased with increasing age. Based on this observation we were able to create statistical models predictive of mortality, one using age as a categorical variable and the other using age as a continuous variable. These nomograms may be a more appropriate method of predicting outcome in patients with PTC and may supersede the current AJCC staging classification system.
The first study that showed age to be important in thyroid cancer was reported by Byar et al. in 1979 (9). Since then, several different staging systems for differentiated thyroid cancer have been described all of which use an age cutoff as a prognostic variable. The Mayo Clinic's Metastasis, Age, Completeness of resection, Invasion, and Size classification has an age of 40 years as the cutoff (10). The Lahey Clinic's Age, Metastases, Extent, and Size system has different cutoffs for men (age 40) and women (age 50) (11). The Grade, Age, Metastases, Extent, and Size staging developed at Memorial Sloan Kettering assigned patients above age 45 a higher score (12). The most important staging system is the AJCC staging protocol for differentiated thyroid cancer (DTC) (Supplementary Table S1). In this system, patients under 45 years are either stage 1 (absence of metastatic disease) or 2 (presence of metastatic disease). In contrast, patients age 45 and above are divided into stages 1 through 4c (8). This 45 year age cutoff has been a component since the second edition of the AJCC Cancer Staging Manual (13). However, there is evidence that age adds to the risk of mortality starting at age 35 (14,15). Furthermore, recent studies by Orosco et al. (16) and by Londero et al. (17) have suggested that although age is an important prognostic factor, there is no specific age cut off which predicts survival. Recent studies by Yang et al. (18) and Banerjee et al. (19) using SEER data have also reported evidence that death increases progressively with increasing age and that age as a continuous variable may be more appropriate.
The initial age division for our multivariable analysis was chosen according to the current AJCC staging protocol (45 years). Our adjusted HR showed this age cutoff to be significant, but all other age cutoffs were also significant. This supports recent studies that have indicated that there may not be a sudden increase in mortality risk at one age point as the AJCC staging implies, but that the relationship between increasing age and risk might be more complex (20,21). It also supports reports that survival disadvantages for DTC patients may come into effect as early as age 35 (9,22). Based on our analysis, it is evident that older age increases the likelihood of thyroid cancer–specific mortality. However, age as a binary variable does not show a specific age cutoff suggesting that while age is an important factor to assess an individual patient's prognosis, a specific binary age cutoff cannot be used to stratify patients into unique risk categories. Indeed, our study shows that mortality increased with older age, with patients over 70 years having a 37-fold increased risk of death compared to patients less than 40 years. The current TNM staging system therefore seems to be insufficient in predicting the prognosis for patients with thyroid cancer. Our data suggests that the utilization of age either as a categorical variable stratified into different age ranges or age as a continuous variable would be a more rational approach for predicting outcome in these patients. Such a system would require the use of age in a predictive model containing the other important prognostic factors such as T status, N status, and M status.
Having established that age—either as a categorical variable or as a continuous variable—is more appropriate for predicting prognosis, we therefore designed two nomograms to create tools for individual risk prediction. Nomograms are statistical tools shown to accurately predict outcome in an individual patient by utilizing multiple variables in addition to the standard TNM variables. These nomograms are created using regression analysis (23). Well-designed nomograms have outperformed the projections of experienced clinicians (24,25) and have been incorporated into clinical trial inclusion criteria and National Comprehensive Cancer Network guidelines (26). Both nomograms developed here have extremely high concordance indexes of over 95% illustrating their predictive accuracy. Moreover, we were then able to validate the nomograms on a separate independent external dataset with a concordance index of 73%. A previous report by Yang et al. (18) has already described a nomogram for death in thyroid cancer using data from the SEER database. We would argue that our nomograms are more accurate because of the inherent limitations in the SEER database. In the MSKCC cohort, all case records were retrospectively reviewed by physicians with an interest in thyroid surgery and cancer. Therefore the T, N, and M status is extremely accurate. Importantly, all pathology has also been reviewed by a pathologist with a special interest in thyroid cancer. In the SEER data, clinical data is not reviewed by physicians and pathology review is not carried out. The data is collected from both academic and also nonacademic centers. Therefore, the SEER data may potentially consist of more poorly differentiated forms of thyroid cancer, anaplastic cancer, medullary cancer. The T, N, and M classifications in the SEER data may also be inaccurate. The data from single institutions specialized in thyroid cancer provides both a high volume of cases as well as expertise in treatment and pathology; this is expected to generate more accurate data but does limit the cohort size. Our study is also of importance in that it describes two nomograms that may have importance in the future risk estimation of individual patients with thyroid cancer. Since the AJCC is now planning on adopting individual risk estimation nomograms into the staging manual for all types of cancer, our study is both timely and relevant for these revisions.
It is important to discuss the limitations of our study. Firstly, this is a retrospective study and is therefore susceptible to the limitations of retrospective studies, such as incomplete data from chart review, and selection bias by physicians regarding surgical and adjuvant therapy; however, at MSKCC the surgeons and physicians treating these patients have many years of experience working as a multidisciplinary team managing several hundred patients annually. This unified treatment approach will therefore limit the selection bias present within the dataset. Secondly, the sample size may not be representative of the United States and international populations of patients with thyroid cancer. MSKCC is a tertiary care cancer center with an international reputation in the treatment of thyroid cancer. The quality of surgical treatment and adjuvant therapy with radioactive iodine or external beam radiation may be superior to that delivered by nonacademic centers and smaller institutions. This may be a threat to validity to our study, making any extrapolation to the general population invalid. To address this, we obtained external datasets from other international cancer centers specialized in thyroid cancer management (Brazil, Toronto, University of California–San Francisco, and Sydney). This external dataset consisted of 4551 patients from which there were 88 disease specific deaths. Comparison of the age distribution of patients with DTC in the internal and external cohorts showed a similar distribution with a median age of 45–46 years (Supplementary Figs. S1 and S2). This external dataset had different clinical and tumor characteristics compared to the MSKCC cohort. Despite this, our nomograms were validated with concordance indices of 73%. This indicates that our nomograms may be translatable to other institutions and populations. However, one could still argue that there is still a threat to external validity since these four external cohorts are also institutions with specialized high-volume thyroid cancer management. Applying our results to the general population may therefore still not be valid. A third limitation relates to the limited number of events. Even with the collaborative effort employed in this study, there were only 59 deaths in the MSKCC cohort and 88 deaths in the external cohort. However, despite limited events, our multivariable model converged and was stable, indicating that the conclusions from the model were justified.
In conclusion, our data show that mortality from DTC increases progressively with advancing age. There is no specific cutoff age which risk stratifies patients with DTC. Risk associated with age may be better expressed as a continuous function or as multiple categories. A predictive nomogram using age as a continuous variable may be a more appropriate tool for stratifying patients with DTC and for predicting outcome.
Footnotes
Acknowledgments
Ian Ganly, R. Michael Tuttle, Snehal G. Patel contributed to study design, data analysis and interpretation, and manuscript writing. Iain J. Nixon, Laura Y. Wang, Frank L. Palmer, Jocelyn C. Migliacci contributed data collection and analysis. Ahmad Aniss, Mark Sywak, Antoine E. Eskander, Jeremy L. Freeman, Michael J. Campbell, Wen T. Shen, Fernanda Vaisman, Denise Momesso, Rossana Corbo, Mario Vaisman, Ashok Shaha, and Jatin Shah participated in data collection and manuscript editing.
Author Disclosure Statement
No competing financial interests exist.
