Abstract
Background:
Regional as well as national series show an increasing incidence of thyroid cancer largely small size papillary thyroid carcinoma (PTC). Prognostic scoring systems have been developed, but these do not take into account the rapidly changing case mix, and adjustments may be required. The purposes of this study were to evaluate treatment outcomes and to analyze the value of older prognostic scoring systems tested on a relatively new, unselected national cohort of PTC patients.
Methods:
This was a national prospective cohort study conducted in Denmark, which has a population of 5.5 million.
Results:
A total of 1350 patients were diagnosed with PTC during 1996–2008, and the median follow-up time was 7.9 years. The 10-year recurrence-free survival rate was 90.2%, and the 10-year crude and cause-specific survival (CSS) rates were 83.7% and 93.8% respectively. By multivariate Cox regression, it was possible to confirm age, metastases (distant and nodal), extrathyroidal extension, and tumor size as predictors of mortality, whereas only nodal metastases, extrathyroidal extension, and tumor size were predictors of recurrence. In analyses of older prognostic scoring systems, a significant correlation between the risk group ranks was found for survival as well as recurrence. The c-index for CSS was highest for MACIS (0.92) and lowest for AMES (0.80). In the TNM, MACIS, and EORTC systems, most patients were classified as stage 1, and for these patients, the 10-year CSS rate was approximately 99.5%, confirming the generally excellent survival.
Conclusion:
This national study provides further evidence that a favorable prognosis is to be expected for patients diagnosed with PTC. Also, it was possible to confirm age, metastases, extrathyroidal extension, and tumor size as predictors of mortality, whereas only nodal metastases, extrathyroidal extension, and tumor size were predictors of recurrence. All the scoring systems evaluated were able to produce a highly significant risk group stratification, showing that in spite of the changes in the case mix of PTC, these systems are still applicable, and in fact contain valuable prognostic information useable for treatment planning.
Introduction
T
In Denmark, all inhabitants have their own personal identification number (obligatory when using the healthcare system), which makes it feasible to trace patients over time at a national level. Using this system, a clinical national database was established in 1996. The database includes information about patient characteristics, diagnostics, treatment, pathology, recurrence, survival, and causes of death for all registered Danish thyroid cancer patients. The purposes of this study were to evaluate treatment outcomes and to analyze the value of older prognostic scoring systems in a relatively new unselected national cohort of PTC patients.
Materials and Methods
This was a prospective national cohort study. In Denmark, which has a population of 5.5 million, all patients diagnosed with thyroid cancer since January 1, 1996, have been registered in a validated national database, DATHYRCA (11). Registration takes place at the five centers for head and neck oncology, located at Copenhagen University Hospital, Herlev Hospital, Odense University Hospital, Aarhus University Hospital, and Aalborg University Hospital. Contributors to the database include head and neck surgeons, pathologists, and oncologists. The DATHYRCA database is part of the Danish Head and Neck Cancer Group (DAHANCA) and prospectively registers clinical, surgical, histopathological, and follow-up data from all Danish thyroid cancer patients on a national level. Due to a unique 10-digit personal identification number (CPR-number, which also contains birth date and sex information), it is possible to trace the individual patient through all their contacts with hospital and clinical outpatient services, thus ensuring a high level of database completeness. To identify patients who might have been missed at initial contact, in early 2010, the DATHYRCA database was cross-checked against three national governmental databases: the Danish Cancer Registry, the Danish Pathology Databank, and the Danish National Patient Register. All cancers in Denmark are reported to the Danish Cancer Registry, and since 1987, reporting of data has been mandatory. The Danish Pathology Databank is a national database that includes all histological and cytological reports in Denmark. The Danish National Patient Register includes codes for clinical diagnoses and surgical procedures for all hospitalizations and outpatient procedures in Denmark.
CPR-number lists from the DATHYRCA database were compared to lists from the Danish National Patient Register and the Danish Cancer Registry according to the ICD-10 thyroid cancer code (C73.9), and those patients not found in the DATHYRCA database were identified. Furthermore, a cross-reference based on topological and morphological codes was made to the Danish Pathology Databank. For patients missing from the DATHYRCA database, medical records were retrieved and reviewed, resulting in the retrospective inclusion of 185 thyroid cancer patients.
This study is based on data available from the DATHYRCA database on May 21, 2012. The inclusion criteria were PTC verified by cytological or histological examination; diagnosed between January 1, 1996, and December 31, 2008; no prior history of thyroid cancer; and alive at the time of diagnosis.
The overall database consisted of 2037 thyroid carcinoma patients, including papillary (67%), follicular (18%), medullary (7%), or undifferentiated (anaplastic) type (8%). A total of 1350 patients met the inclusion criteria.
Predictor variables were chosen from the clinical and histopathological data in the DATHYRCA database and consisted of age, sex, date of diagnosis, TNM status (5th ed.), tumor diameter, and information about the number of tumors at histological examination.
Outcome variables were time to death and time to recurrence. If a patient was registered as being alive, the information was checked in the Central Office of Civil Registration, and when deceased, the information was updated. The follow-up period ended on March 1, 2012, and patients were censored in the event of emigration from Denmark.
Four of the best-known and most used prognostic scoring systems were chosen for evaluation: TNM staging (5th ed.), MACIS, EORTC, and AMES (see Supplementary Table 1; Supplementary Data are available online at
To estimate the most precise classification, the T, N, and M stages were based on a synthesis of clinical and pathological information made available immediately following the conclusion of treatment. Thus, information obtained by clinical examination, imaging, surgery, and pathology could be included. If a discrepancy was observed, the pathological state was used. Most of the distant metastases were diagnosed solely by imaging.
In the database, recurrences were defined as persistent disease or occurrence of disease after the end of primary treatment. This could be confirmed by the following modalities: histology, cytology, and/or imaging.
Descriptive statistics were derived according to data type, that is, categorical variables were displayed as frequencies and respective percentages, whereas continuous variables were analyzed by means, medians, and ranges. Multivariate regression using Cox method was applied to examine predictor variables, under the assumption of proportional hazards and confirmed by test of Schoenfeld residuals. The Kaplan–Meier method was used to evaluate survival. In order to analyze the prognostic scoring systems, log-rank testing was applied, and Harrell's C calculated. The C index estimates the probability of concordance between predicted and observed responses. A value of 0.5 indicates no predictive discrimination, and a value of 1.0 indicates perfect separation of patients with different outcomes (12). The level of significance was 5% (two-sided). The database and analysis system Medlog was used for data registration, and STATA/IC 11 (StataCorp LP, College Station, TX) was used for statistical analyses.
The project was approved by the Regional Ethics Committee and by the National Danish Data Protection Agency.
Results
The study population consisted of 1350 patients. Table 1 shows available data by variables (patient, tumor, surgery) including “unknown,” which is an option if information has been sought but found not to exist. From Table 1, it is evident that the data have a very high degree of completeness, ranging from 97% to 100% in all variables. Median follow-up for the study was 7.9 years (range 0.0–16.2). Mean age was 47.5 years, and the male:female ratio was 1:2.8. In 406 patients, a microcarcinoma was histologically confirmed (≤10 mm). These are described in detail in a separate publication (13).
Recurrence
The database contained 123 cases of recurrences, and thus 9.1% of patients experienced a recurrence during follow-up. Figure 1 shows the pattern of failure, and it can be seen that a nodal site (N) is most commonly involved (65% of cases). In three cases, the site of recurrence was not reported.

Venn diagram of pattern of recurrence of papillary thyroid carcinoma diagnosed in Denmark between 1996 and 2008.
The five-year recurrence-free survival (RFS) was 92.2%, and the 10-year RFS was 90.2%.
Mortality
A total of 202 cases of mortality occurred in the follow-up period. Of these, 70 died from or had active thyroid cancer, which means the study's overall cause-specific mortality was 5.2%.
The five-year crude and cause-specific survival (CSS) rates were 90.1% and 95.9% respectively, and the 10-year crude and CSS rates were 83.7% and 93.8% respectively.
Variables affecting outcome
Univariate analysis showed significance of all predictor variables, except for multifocality. This variable was subsequently omitted from the multivariate testing.
Multivariate Cox regression showed that only extrathyroidal extension, tumor size, and nodal metastases were predictors of recurrence, with nodal metastasis as the strongest predictor after adjusting for other covariates.
For cause-specific mortality, being over 45 years of age was the strongest predictor, but nodal metastases, distant metastases, extrathyroidal extension, and tumor size were also significant predictors. Table 2 shows the results of the regression analyses.
HR, hazard ratio; CI, confidence intervals; CSM, cause-specific mortality.
Prognostic scoring systems
The 1350 patients diagnosed with PTC were classified according to risk stages in the chosen prognostic systems. Not all patients could be classified in all four systems, and in particular, the completeness of resection in the MACIS system led to exclusion of cases because of “unknown” status.
Table 3 shows the allocation into risk groups in the four systems. Log-rank testing showed the significance of the higher risk group in all systems, even for the occurrence of recurrence, for which they have not been developed. The C index was calculated for CSS for all four systems; 1172 patients who could be classified in all four systems were included. MACIS had the highest score—0.92 [CI 0.88–0.96]—followed by TNM—0.86 [CI 0.79–0.92]—and EORTC—0.86 [CI 0.78–0.94]. AMES had the lowest score—0.80 [CI 0.72–0.89]. In the TNM, MACIS, and EORTC systems, most patients were classified as stage 1, and the 10-year CSS in the systems were 99.6%, 99.4%, and 99.4% respectively, confirming the excellent survival for these patients. However, of these stage 1 patients, 6.2%, 4.8%, and 6.8% experienced recurrence during follow-up.
CS, crude survival; CSS, cause-specific survival; RFS, recurrence-free survival.
Discussion
This national and prospective study provides further evidence that a favorable prognosis is to be expected for patients diagnosed with PTC, with a 10-year CSS of 93.8%. Moreover, it was possible to confirm age, metastases (distant and nodal), extrathyroidal extension, and tumor size as predictors of mortality, whereas only nodal metastases, extrathyroidal extension, and tumor size were predictors of recurrence. Sex and multifocality were not found to be predictors. All the evaluated scoring systems were able to produce a highly significant risk group stratification, showing that in spite of changes in the case mix of PTC, these systems are still applicable and contain valuable prognostic information that can be used for treatment planning.
The DATHYRCA database is national, and it has several contributors. This could potentially be a weakness of the study, since the structure allows for some inconsistency in the way data are registered. However, cross-checks to governmental databases were performed in order to secure sufficient catchment, and validation of reported data shows overall good κ values regarding reliability (11), indicating that the DATHYRCA database is valid. Furthermore, the database covers all patients in Denmark, and is unselected and unbiased with regard to social class. Conversely, Denmark is a population of primarily ethnic Caucasians, and the database does not represent a diversity of ethnicity. To our knowledge, this is the first national study of prognosis of PTC from a national prospective, clinical database.
As a clinical database, some biological explanatory variables may exist that are not registered. It has been stated that aggressive subtypes may affect prognosis. Variables concerning this aspect have not been established in the DATHYRCA database, and we have not been able to analyze this characteristic. However, in a recent study, Silver et al. reviewed existing literature on the subject, concluding that the evidence for histological subtype as an independent risk factor was weak, and that the aggressiveness of these carcinomas was probably correlated to other, already known factors, associated with worse prognosis (14). Another limitation of our study is its observational nature, by which change in treatment strategy and quality is not included in the analyses, even though these factors might have influenced prognosis. However, since 2001, all Danish thyroid cancer patients have been treated according to national guidelines (15), which reduces the influence from the treatment aspect, and former studies indicate a generally good quality of thyroid surgery in Denmark (16,17).
The data on mortality in this study were considered as worst case. In a few cases, it was not possible to establish the exact cause of death. In these cases, thyroid cancer was considered the cause of death if the patient had any signs of disease at the last follow-up. This could bias the results, especially the value of age as prognostic indicator, since older patients are more prone to die from other causes.
This study found a favorable rate of recurrence for patients diagnosed with PTC. Compared to data reported by Grogan et al., where the 10-year recurrence rate was 19%, it was 9.8% in our study (18). However, the cohort studied by Grogan et al. consisted of highly selected patients and our unselected series may suggest an overestimation in the cohort of these authors, or it may reflect differences in treatment strategies. Toniato et al. previously reported a recurrence rate of 8.3% after a follow-up of 7.8 years (19), which is similar to that found in this study. The mortality rate found was also favorable and in accordance with previous long-term studies from the Mayo clinic (4) and the University of Chicago cohort (18).
For both recurrence and mortality, more events are to be expected with longer follow-up, which imposes another limitation. This study's median follow-up of 7.9 years, with some patients followed for more than 16 years, should ensure that most events have occurred. Late recurrences are seen, however, and the study by Grogan et al. showed that 11% of recurrences and 17% of deaths occurred after 20 years (18). Further follow-up could thus modify some of the conclusions.
It has previously been reported that age, tumor size, extrathyroidal extension, and nodal and distant metastasis were predictors for mortality (20 –25). The data from our study confirm this. For recurrence, it seems that lymph node metastasis is the most significant predictor, at least for Danish patients. Previous studies have shown conflicting data regarding this subject. In the study by Toniato et al. on 950 consecutive patients from a single institution, nodal metastasis was not found to have significant impact on recurrence (19). Grogan et al., however, found this parameter to be significant. The conflicting evidence may be partly explained by the mode of discovery of nodal metastases. In a study from Kuma Hospital, Japan, it was shown that only clinically evident metastases were significant, as opposed to those found only after histological examination (26). In Denmark, it is not customary to perform prophylactic nodal dissection, apart from exploration/dissection of level VI. Thus, neck dissection is reserved for patients with clinically apparent metastases. Recent data also suggest that the significance is modulated by size and number of metastases and location in either central or lateral neck (27). It was not possible to test this in this national cohort, since data on these parameters were not available.
As for the analysis of scoring systems, no greater differences in outcomes were found among EORTC, MACIS, and TNM, which were all significant and had similar C indexes. AMES also showed significance but with lesser ability to discriminate between patients with a favorable outcome and aggressive cases, with only two risk groups and a slightly lower C index. In general, the C index confirms that the scoring systems are able to separate patients with different outcomes. However, overlapping confidence intervals were found, and no system was significantly superior. When compared to the results of the multivariate regression in this study, TNM would be the preferred system, since it is the only one to incorporate lymph node metastases as a prognostic factor, which is in accordance with the current recommendation of the American Thyroid Association (28). Preferably, evaluation should have been undertaken using the 7th edition of the TMN classification. However, this was not possible because the 7th edition was not used in the early years of registration. A study by Meixner et al. comparing the 5th and 7th editions did not show significant superiority of the 7th edition regarding PTC (29). Implementation of the 7th edition is therefore not expected to change the results significantly.
Earlier, Lang et al. found results comparable to ours when comparing staging systems in a cohort of 589 patients diagnosed at a single institution between 1961 and 2001 (30). Thus, it is confirmed that the “good old” prognostic scoring systems are still going strong and that they are applicable in spite of changing demographics.
Ethics
This study has been approved by the Danish Regional Ethics Committee and by the Danish Data Protection Agency.
Footnotes
Acknowledgments
The authors thank all contributing departments for reporting data to the DATHYRCA database. Funding for the study was provided by Odense University Hospital, University of Southern Denmark, DAHANCA, Becket-Fonden, Else og Mogens Wedell-Wedellsborgs Fond. The study sponsors have had no role in the present study, apart from funding.
Author Disclosure Statement
No competing financial interests exist.
