Abstract
Background:
There remains controversy surrounding the extent of lateral neck dissection required in patients with papillary thyroid cancer (PTC) and suspicious or confirmed metastatic lateral neck lymphadenopathy. The evidence for this clinical dilemma has never been reviewed systematically nor has there been an attempt to meta-analyze the data by lymph node levels to better characterize the pattern of spread.
Methods:
This meta-analysis used MEDLINE and EMBASE including all cohort studies reporting the pattern of lateral neck disease in patients who underwent a neck dissection for clinically, radiographically, or cytologically suspicious or confirmed metastatic lymphadenopathy for PTC. Our main outcome was the number of patients with positive involvement at a given level as a percentage of the cohort of patients with positive lateral neck disease, each level being measured separately.
Results:
Eighteen studies with a total of 1145 patients and 1298 neck dissections were included in our meta-analysis. Levels IIa and IIb had disease in 53.1% [95% confidence interval (CI) 46.6–59.5%] and 15.5% [CI 8.2–27.2%], respectively. Studies that did not distinguish between level IIa and IIb or in which both were collapsed into one category showed a total level II involvement of 53.4% [CI 49.7–57.1%]. Level III and level IV were involved in 70.5% [CI 67.0–73.9%] and 66.3% [CI 61.4–70.9%] of specimens. Studies that did not distinguish between level Va and Vb or in which both were collapsed into one category showed a total level V involvement of 25.3% [CI 20.0–31.5%]. Levels Va and Vb had positivity in 7.9% [CI 2.8–20.0%] and 21.5% [CI 7.7–47.6%], respectively, but had only three studies that could be meta-analyzed.
Conclusions:
This systematic review of the literature and meta-analysis of the pattern of spread indicates significant rates of lymph node metastasis to all lateral neck levels in patients with PTC with regional involvement. This evidence leads us to recommend a comprehensive selective neck dissection of levels IIa, IIb, III, IV, and Vb in patients with lateral neck disease from PTC. The evidence for level Va is lacking, as most studies did not distinguish between levels Va and Vb, and the border between the two levels was inconsistent. Future studies will need to address these sublevels separately.
Introduction
This systematic review and meta-analysis focused on a single outcome: pattern of disease spread in patients with metastatic PTC. The outcome was measured by analysis of the final pathology specimen in patients who underwent a selective lateral neck dissection for suspected or confirmed metastatic PTC. We hypothesize that a meta-analysis in this population will reveal a high incidence of metastasis to all lateral neck levels with the greatest incidence of spread to levels III (middle jugular) and IV (lower jugular), followed by II (upper jugular) and V (posterior triangle), in that order.
Methods
Our review protocol has not previously been published or registered. We were specifically focused on the population of patients with PTC who underwent lateral neck dissection for metastatic lymph node spread.
Search strategy
A literature search was performed using the MEDLINE and EMBASE databases. The search was restricted to human subjects and adults (age≥18 years). The following search terms were used: (neck dissection) and (lateral or cervical lymphadenopathy) and (PTC or thyroid neoplasms or papillary carcinoma) not (medullary or follicular carcinoma). Key terms were explored and exploded in each database separately, and non-MeSH words were included when they were not included in the databases. The date of the last search was November 4, 2011.
Before reviewing any article, a number of techniques were used to ensure that all the relevant references were included in our search. Citations were cross-checked (snowballing) from key publications; citations from existing reviews were assessed (11,12); the related articles to key publications in PubMed were reviewed; forward citations were used; and two experts in the field (J.F. and I.W.) were surveyed to make sure that we were not missing any key references. This systematic review was planned, conducted, and reported in adherence with all of the standards of quality for reporting meta-analyses (MOOSE) (13).
Review process
The search results were reviewed in three stages. All three stages included the same two independent reviewers. The first two stages included a title and abstract review. A kappa statistic for inter-rater reliability was calculated for each stage. The full articles were then acquired for abstracts that had either been chosen by one or both reviewers. These articles were then assessed for study inclusion/exclusion criteria. Another kappa statistic was calculated for this step. The last phase involved two abstractors for each article. Any discrepancies were resolved by consensus. Reasons for exclusions were recorded at every step.
Study inclusion/exclusion criteria
Studies considered for inclusion treated adults with PTC or well-differentiated thyroid cancer (including follicular thyroid cancer) with lateral neck disease treated by a comprehensive neck dissection (levels II–IV or II–V). The study must have described disease spread by level and specifically indicate how many neck dissections from their cohort had at least one positive level on final pathology. For a study to be included, they had to describe their indications for surgery. The search included prospective and retrospective cohort studies.
Review articles, commentaries, and technique articles were excluded, although these publications were used only in the snowballing process. Case reports with 10 patients or less were excluded. Articles that focused on pediatric patients, prophylactic neck dissection, central neck dissections, medullary or anaplastic carcinoma, and studies that did not have adequate data for use in our meta-analysis were also excluded.
Data extraction
Each study was abstracted by the same two independent reviewers who reached a consensus about any discrepancies in the collection process. Authors were contacted when there were any questions about their methodology or results.
Summary measures
The main outcome measure was the number of neck dissections with positive neck node involvement at a given level as a percentage of the cohort of neck dissections with positive lateral neck disease, each level being measured separately. Since studies use a variety of inclusion criteria (clinical evidence of disease, radiographic evidence of disease, and/or fine-needle aspiration-proven neck disease), the denominator used for all studies was the number of neck dissections with pathology-proven metastasis to at least one level. This provided a consistent denominator for studies with slightly differing inclusion criteria and standards of care. By utilizing the number of neck dissections assessed and the event rates at each level, confidence intervals were calculated using the Comprehensive Meta-Analysis program version 2 (Biostat, Inc., Englewood, NJ) utilizing binomial distributions. Findings are presented as a percent, from the cohort of neck dissections, having involved disease at a particular level with its appropriate confidence intervals.
Bias across studies and heterogeneity
Risk of bias across studies may be present, particularly with regard to publication bias. As the topic involves surgical procedures and outcomes, it is very likely that smaller studies or those with unfavorable outcomes may not be published in the literature. A funnel plot was created to assess publication bias, and the Duval and Tweedie's trim-and-fill method was used for the analysis of each level to assess where the missing studies would be plotted.
For many of the reasons mentioned above, the a priori hypothesis was that a large degree of heterogeneity existed among studies. Important areas of variation include % female patients, prospective or retrospective, continent of primary data (North America yes/no), PTC or well-differentiated thyroid cancer, standard of care for neck dissection (II–IV or II–V), year of initiation of study, single- or multisurgeon, and single- or multicenter. The prespecified statistical plan to account for effects of heterogeneity included subgroup and metaregression analyses.
Assessment of study quality
A validated tool to assess bias in observational studies without a control or comparison group does not yet exist and is quite population specific. For this reason and to assess key differences between studies, data were collected on a number of additional study features, which may be considered as surrogates of study quality. Data on study design (retrospective vs. prospective), mean number of lymph nodes per neck, mean number of lymph nodes per level, pathology (PTC vs. well-differentiated thyroid cancer), whether the neck dissection specimen was marked or not, multicenter or single center, multisurgeon or single surgeon, and the institution's standard of care in treating patients with PTC with suspected lateral neck disease were collected. Data on the percentage of female patients in the population and the year of initiation of the study were also collected.
Statistical analyses
Eight overall statistical analyses were performed—one for each neck level (IIa, IIb, II, III, IV, V, Va, and Vb). Due to some anticipated heterogeneity, a more conservative statistical approach applying the random-effects model was chosen for all analyses. This method takes into account within-study variation, between-study variation, and the heterogeneity inherent in comparing results from different centers and surgeons. For each forest plot, the random-effects model summary-effect measure, 95% confidence interval (CI), and the I2 statistic were included. To assess publication bias, a standard funnel plot was used as well as random-effects Duval and Tweedie's trim-and-fill.
Results
Study selection
Search results were reviewed in three stages. The first stage included a title review, which resulted in a reduction from 569 to 217 articles. This yielded an inter-rater reliability kappa statistic of 0.603±0.04. The second stage included an abstract review, which resulted in a reduction from 217 to 94 articles. This yielded an inter-rater reliability kappa statistic of 0.636±0.08. The final stage included an assessment of the full article for preselected inclusion/exclusion criteria that yielded 18 full-text articles for which we could extract our metameters (14 –31). Any disagreements at this stage were resolved by consensus. Figure 1 displays reasons for exclusion and number excluded at each stage.

Selection of studies for inclusion in the meta-analysis.
Study characteristics
The characteristics of the studies included in the meta-analysis are summarized in Table 1. The 18 studies were published between 2002 and 2012. Six of these studies were from North America, three from Europe, two from Israel, and seven from Asia. Each study had between 22 and 185 patients and 22 and 248 comprehensive neck dissections (II–IV or II–V) involving at least one level with positive nodes. This provided a total of 1145 patients and 1298 neck dissections included in this meta-analysis. The chosen metameter was extractable for levels IIa, IIb, II, III, IV, and V from 9, 9, 15, 18, 18, and 14 studies, respectively.
Total number of neck dissections (NDs) with positive disease.
Same first author, but different patient populations described.
NA, not available.
Pattern of spread
The random-effects model meta-analysis summary for each of the neck levels is represented in Tables 2 –7. Level IIa was involved with metastasis in 53.1% [CI 46.6–59.5%] and level IIb in 15.5% [CI 8.2–27.2%]. Studies that did not distinguish between level IIa and IIb or in which both were collapsed into one category showed a total level II involvement of 53.4% [CI 49.7–57.1%]. Level III and level IV were involved in 70.5% [CI 67.0–73.9%] and 66.3% [CI 61.4–70.9%] of cases. Studies that did not distinguish between level Va and Vb or in which both were collapsed into one category showed a total level V involvement of 25.3% [CI 20.0–31.5%]. Levels Va and Vb had three studies each (not shown) and had positivity in 7.9% [CI 2.8–20.0%] and 21.5% [CI 7.7–47.6%], respectively.
I2=52.65
CI, 95% confidence interval.
I2=86.93
I2=30.90
I2=39.34
I2=65.65
I2=75.97
Study heterogeneity
The studies ranged in heterogeneity with I2 statistics of 53, 87, 31, 39, 66, and 76 for levels IIa, IIb, II, III, IV, and V, respectively. This ranges from moderate to high range of heterogeneity between studies.
Assessment of study quality and bias
Important study characteristics that are surrogates for study quality are summarized in Table 8. None of the studies report on exact technique of neck dissection nor did any of the studies report on the number of pathologists that reviewed their specimens.
LNs, lymph nodes; R, retrospective; P, prospective; PTC, papillary thyroid cancer; WDTC, well-differentiated thyroid cancer; op, operatively.
Subgroup analyses and metaregression
The heterogeneity could not be explained by subgroup analyses of the prospective or retrospective nature of the study, continent of primary data (North America yes/no), whether the study focused on PTC or well-differentiated thyroid cancer, whether it was a single- or multisurgeon study, and whether it was a single- or multicenter study. A metaregression of percent female patients and of the year of initiation of the study also did not reveal any significant variation.
Discussion
This study is the first systematic review and meta-analysis of the literature on patients undergoing comprehensive lateral neck dissection for metastatic PTC. We demonstrated high rates of metastasis to all lateral neck levels. Included in our analysis were 18 unique studies from 2002 to 2012 with 1145 patients who underwent 1298 neck dissections. Level IIa, IIb, II, III, IV, and V had metastatic involvement in 53%, 16%, 53%, 71%, 66%, and 25% of neck dissections, respectively. Level Va and Vb only had three studies in each meta-analysis and demonstrated 8% and 22% positivity, respectively. Assessment of bias, subgroup analyses, and metaregression could not explain the moderate-to-high levels of heterogeneity between studies detected in our analysis.
Our findings, particularly at levels II and V, demonstrate the need to perform a comprehensive neck dissection in PTC patients with metastasis to the lateral neck. These systematically reviewed findings support the guidelines put out by the ATA in 2009 and the Triological Society in 2010, which were largely based on expert opinion and small retrospective cohort studies (10,11).
The true impact of local lymph node metastasis on survival in PTC remains controversial. Nonetheless, there are ample data in the literature associating lymph node metastasis with decreased disease-specific survival and increased risk of local recurrence. Outcome data from large institutional cohorts from the United States, Canada, and Germany have shown a significant and independent negative impact on survival with lymph node metastasis (32 –34). Analysis of large cancer registry databases from the United States and Sweden has similarly shown poorer outcomes and increased mortality rate in patients with locoregional lymph node metastases (8,35). Additionally, the presence of local lymph node metastasis has been associated with an increased risk of future local recurrence (4). As a result, lateral neck metastasis is currently considered a significant prognosticator according to the latest thyroid cancer TNM classification (36). Lymph node metastasis is not, however, a predictor in several institutional risk stratification schemes, including AMES, AGES, and MACIS (37 –39).
Considering the available evidence, lateral neck dissection for excision of metastatic lymph nodes may improve survival and decrease regional recurrence. However, the potential survival benefit must be weighed against the morbidity associated with neck dissection surgery. The most significant and common of these complications is shoulder dysfunction resulting from manipulation and injury of the spinal accessory nerve with surgery to levels II and V (9). Dissection of the posterior triangle (level V) is associated with higher rates of shoulder dysfunction (40); nonetheless, transection of the nerve remains a rare complication in experienced hands (41). In addition, there are no differences in health-related quality of life secondary to shoulder dysfunction between modified radical neck dissection and selective neck dissection at 1 year postoperatively (42). Synthesis of this literature suggests that the increased risk of recurrence and poorer survival outweigh the risk of complications from neck dissection.
The limitations of the study need to be considered. Whenever performing a meta-analysis to estimate an outcome or effect from a group of similar studies, the measure of heterogeneity informs whether the combined estimate is meaningful. Our a priori hypothesis of significant heterogeneity in a meta-analysis of observational studies proved to be true based on our moderate-to-high I2. It has been suggested that quantification of heterogeneity is only one aspect of variability assessment across studies (43). The studies included in our meta-analysis have some diversity in their clinical and methodological aspects (Tables 1 and 8), but subgroup analyses and meta-regression on these factors could not explain or decrease the observed heterogeneity estimates. In our opinion, the clinical implication of the observed degree of inconsistency across studies is not significant. The results of our quantitative analyses were consistent with the qualitative trends across the individual studies. This is also evidenced by the fairly narrow confidence intervals around our estimates.
In this meta-analysis, the distribution of lateral neck disease involvement demonstrates the need for comprehensive neck dissection in patients presenting with confirmed PTC metastasis to any lateral neck level. Our findings corroborate the guidelines put out by the ATA in 2009 and the Triological Society in 2010 (10,11).
Footnotes
Disclosure Statement
The authors declare that no competing financial interests exist.
