Abstract
Background
Lung cancer is one of the most common cancer types in the world. An accurate diagnosis of lung cancer is crucial for early treatment and management.
Purpose
To perform a comprehensive meta-analysis to evaluate the diagnostic performance of dual time point 18F-fluorodexyglucose position emission tomography/computed tomography (FDG-PET/CT) and single time point 18FDG-PET/CT in the diagnosis of pulmonary nodules.
Material and Methods
PubMed (1966-2011.11), EMBASE (1974-2011.11), Web of Science (1972-2011.11), Cochrane Library (-2011.11), and four Chinese databases — CBM (1978-2011.11), CNKI (1994-2011.11), VIP (1989-2011.11), and Wanfang Database (1994-2011.11) — were searched. Summary sensitivity, summary specificity, summary diagnostic odds ratios (DOR), and summary positive likelihood ratios (LR+) and negative likelihood ratios (LR–) were obtained using Meta-Disc software. Summary receiver-operating characteristic (SROC) curves were used to evaluate the diagnostic performance of dual time point 18FDG-PET/CT and single time point 18FDG-PET/CT.
Results
The inclusion criteria were fulfilled by eight articles, with a total of 415 patients and 430 pulmonary nodules. Compared with the gold standard (pathology or clinical follow-up), the summary sensitivity of dual time point 18FDG-PET/CT was 79% (95%CI, 74.0–84.0%), and its summary specificity was 73% (95%CI, 65.0–79.0%); the summary LR+ was 2.61 (95%CI, 1.96–3.47), and the summary LR– was 0.29 (95%CI, 0.21–0.41); the summary DOR was 10.25 (95%CI, 5.79–18.14), and the area under the SROC curve (AUC) was 0.8244. The summary sensitivity for single time point 18FDG-PET/CT was 77% (95%CI, 71.9–82.3%), and its summary specificity was 59% (95%CI, 50.6–66.2%); the summary LR+ was 1.97 (95%CI, 1.32–2.93), and the summary LR– was 0.37 (95%CI, 0.29–0.49); the summary DOR was 6.39 (95%CI, 3.39–12.05), and the AUC was 0.8220.
Conclusion
The results indicate that dual time point 18FDG-PET/CT and single time point 18FDG-PET/CT have similar accuracy in the differential diagnosis of pulmonary nodules. Dual time point 18FDG-PET/CT appears to be more specific than single time point 18FDG-PET/CT.
According to WHO statistics, lung cancer has become the world's most commonly fatal cancer, with an approximately 1.2 million annual increase in patients. The morbidity and death rate of this cancer have become an enormous social and economic burden (1). Over the last several decades, there have been no breakthroughs in the treatment of lung cancer, and effective treatment is closely correlated with early diagnosis. Currently, the more commonly used methods include X-ray and computed tomography (CT); positron emission tomography (PET) and PET/CT are also used at times. Among these methods, low-resolution X-ray and PET are not conducive to the detection of small lesions. The resolution of CT can reach 2 mm, which permits the timely detection of small lesions, but the specificity of this method is low; thus, the accurate diagnosis of lung cancer still has certain limitations.
There are studies in the literature (1, 2) reporting that 18-fluorodeoxyglucose (18FDG)-PET/CT has greater value than other methods (X-ray, CT, PET) in the diagnosis of lung cancer. Using 18FDG as a tracer for glucose metabolism, lesions can be found before morphologic changes are detectable, and accurate qualitative diagnoses can be achieved. Because one of the characteristics of malignant tumors is high glucose metabolism, these tumors can accumulate 18FDG. Conventional 18FDG-PET/CT often uses a single time point standardized uptake value (SUV) to identify benign and malignant tumors, but the SUVs of benign and malignant lesions overlap greatly, resulting in incorrect judgements; certain malignant tumors appear to be false-negative, while certain benign tumors appear to be false-positive. Because the SUV can be affected by many factors, such as plasma glucose levels, the patient's weight and height, dual time point imaging can be used. This technique conducts additional measurements at a second time point after the single time point, with a 1.5–4 h delay. This delay provides dual time point imaging with advantages compared to single time point imaging regarding the problem of differentiating benign tumors from malignant tumors because the metabolic SUV of malignant tumors increases significantly after this delay, whereas benign lesions do not change.
Several studies (1–3) have reported that dual time point 18FDG-PET/CT is superior to single time point 18FDG-PET/CT imaging at identifying lung malignant lesions and greatly improves diagnostic accuracy, but other research has reported the opposite conclusion. Based on these differing reports, this study aims to use meta-analysis to evaluate the diagnostic value of dual time point 18FDG-PET/CT and single time point 18FDG-PET/CT in differentiating benign from malignant pulmonary nodules. The study also aims to determine whether dual time point imaging is superior to single time point imaging.
Material and Methods
Execution of data collection and statistical analysis
The methods employed for the execution of data collection and the statistical calculations were decided on before the start of the study. The statistical analyses were performed independently by the authors of this study. Inconsistencies in these analyses were resolved by discussion and consensus.
Data sources and searches
PubMed (1966-2011.11), EMBASE (1974-2011.11), Web of Science (1972-2011.11), the Cochrane Library (-2011.11) and four Chinese databases — CBM (1978-2011.11), CNKI (1994-2011.11), VIP (1989-2011.11), and Wanfang Database (1994-2011.11) — were searched for English- and non-English-language literature. The searches included the following keywords: (‘positron emission tomography/computed tomography’ OR ‘PET/CT’ OR ‘positron emission tomography-computed tomography’ OR ‘PET-CT’ OR ‘fluorodeoxyglucose’ OR ‘FDG’ OR ‘18F-FDG’ OR ‘18FDG’ OR ‘FDG-F18’) AND (‘dual time point’ OR ‘double phase’) AND (‘sensitivity’ OR ‘specificity’). All of the possible pertinent articles returned by searches; the final bibliography was distributed to two experts in the field of radiology to identify missing or unpublished studies. Duplications and overlapping articles were excluded; meeting abstracts were also excluded because they did not provide sufficient detail with regard to data and results.
Study selection
Publications were included if they fulfilled the following criteria: (i) used dual time point 18FDG-PET/CT and single time point 18FDG-PET/CT as diagnostic tests for pulmonary nodules; (ii) used pathology or clinical follow-up as the reference standard; and (iii) reported absolute numbers of true-positive (TP), true-negative (TN), false-positive (FP), and false-negative (FN) results or reported sufficient data to derive these values. Publications were eligible regardless of the dual time point 18FDG-PET/CT and single time point 18FDG-PET/CT technique, as were animal studies, phantom studies, and studies with <10 patients.
Data extraction and quality assessment
The following data were extracted for each study: first author, journal title, year of publication, total number of patients, mean age, scanner brand, type of study (retrospective or prospective), dual time technique, patient selection, imaging assessment, definitions of benign versus malignant pulmonary nodules, numbers and sizes of pulmonary nodules, early and delayed pulmonary nodule SUV values, and type of follow-up reference standards (pathology or clinical follow-up). With regard to the dual time technique, the extracted data included radiotracer doses, time delay between scans, and the determination of regions of interest (ROIs) for maximal SUV. The statistical data were derived from the numbers of TPs, TNs, FPs, and FNs reported in the individual studies. To assign a quality score to each selected study, two investigators (ZL and WYZ) independently assessed the methodological quality using the Quality Assessment of Diagnosis Accuracy Studies (QUADAS) score tool (4), which consists of 11 questions answered ‘yes’, ‘no’, or ‘unclear’. The following phrases represents the complete text of the 11 questions in QUADAS: disease spectrum composition, selection criteria, disease progression bias, partial reference bias, multiple reference bias, mixed bias, interpretation of test bias, gold standard interpretation bias, clinical interpretation bias, and results difficult to interpret withdrawn from case evaluation.
Statistical analysis
The statistical analyses described here were performed using Metadisc, version 1.4. The reviewers independently constructed 2*2 contingency tables for each study. The sensitivity, specificity, positive likelihood ratio (LR+), negative likelihood ratio (LR–), diagnostic odds ratio (DOR), and the 95% confidence intervals (CIs) for these parameters were calculated from the original data. A summary receiver-operating characteristic curve (SROC), the area under the curve (AUC), and the Q* index were calculated (the Q* index is a best statistical method that reflects the diagnostic value at the point on the SROC at which the sensitivity and specificity are equal). The heterogeneity of the results was assessed graphically using forest plots and the value of inconsistency index in statistics (I2), which describes the percentage of total variability across studies that is attributable to heterogeneity rather than to chance. The value of I2 is calculated using the following equation: I2 = 100([Q-df]/Q), where Q is Cochran's heterogeneity, and a value >50% can be considered significant heterogeneity. We grouped the studies and pooled the data in a randomized meta-analysis when they exhibited high heterogeneity; when the data exhibited low heterogeneity, the fixed-effect model was used. When there were more than nine studies included, we used funnel plots to assess bias (5).
Results
Literature identification and eligibility
On the basis of the computer search and extensive cross-checking of the reference lists, a total of 587 potentially relevant studies were initially retrieved. Eight studies met all of the inclusion and exclusion criteria after the full texts of the relevant studies were reviewed (Fig. 1).
Flowcharts for the identification of studies
Study characteristics
The results of methodological quality assessment
Four studies were prospective cohort studies, and four were retrospective cohort studies. The QUADAS quality assessment results are shown in Fig. 2.
QUADAS quality assessment results
The results of the meta-analysis
Dual time point 18FDG-PET/CT compared with the gold standard
The dual time point 18FDG-PET/CT compared with the gold standard (95% CI)

Forest plot

Forest plot

SROC curve
Single time point 18FDG-PET/CT compared with the gold standard
The single time point 18FDG-PET/CT compared with the gold standard (95% CI)

Forest plot

Forest plot

SROC curve
Reporting bias analysis
The Cochrane handbook (5) suggests using funnel plots to assess bias when there are more than nine included articles. Because our study included only eight articles, we did not use funnel plots.
Discussion
For dual time point 18FDG-PET/CT, three of these eight articles (1, 7, 8) used a percentage of the retention index (RI = [delayed SUVmax – early SUVmax]/early SUVmax × 100%) >10% as a diagnostic threshold to differentiate benign from malignant pulmonary nodules. Sathekge et al. (8) used delayed SUVmax >5.5 as a diagnostic threshold. Yao et al. (2) used RI ≥15% to differentiate benign from malignant lesions, whereas Zhao et al. (6) used RI > 24.66% and Kim et al. (10) used RI >−2.3%. Yang et al. (3) used delayed SUVmax >2.5 as a diagnostic threshold because their retention index was not statistically significant.
There were four articles (2, 3, 9, 10) that used multiple thresholds to analyze or obtain the best diagnostic threshold. For single time point 18FDG-PET/CT, five studies (1, 3, 7, 8) out of eight used SUVmax >2.5 for the diagnostic threshold to differentiate benign from malignant pulmonary nodules. Zhao et al. (6) used SUVmax >2.45 as the diagnostic threshold, and Kim et al. (10) used SUVmax >2. Yao et al. (2) did not clearly indicate the diagnostic threshold they used (we attempted to contact the authors but received no reply).
Qian et al. (11) demonstrated that chest CT scanning was a good predictive indicator of malignancy in pulmonary nodules. The pooled sensitivity of this method was 0.89 (0.87–0.91), and the pooled specificity was 0.69 (0.66–0.72). Barger et al. (12) reported that for dual time point FDG-PET in the diagnosis of pulmonary nodules, the summary sensitivity was 0.85 (0.82–0.89), and the summary specificity was 0.77 (0.72–0.81). Gould et al. (13) reported in a meta-analysis of single time point FDG-PET that the sensitivity and specificity were 96.8% and 77.8%, respectively. Our study showed that the pooled sensitivity of dual time point 18FDG-PET/CT was 79% (95% CI, 74.0–84.0%) and the specificity was 73% (95% CI, 65.0–79.0%). The pooled sensitivity of single time point 18FDG-PET/CT was 77% (95% CI, 71.9–82.3%), and the specificity was 59% (95% CI, 50.6–66.2%). The pooled specificity was 59%, which might have been due to the fact that benign lesions, inflammation, and tuberculosis comprised the majority of cases in the studies that were included in this analysis.
Benign and malignant lesions can be differentiated through glucose metabolism, but many benign or inflammatory lesions can also appear as areas of increased FDG uptake, resulting in false-positive findings. In contrast, certain malignant tumors with low glycometabolism can appear as false-negative. The reasons for the pathology and frequency of non-FDG-avid lung cancer and FDG-avid benign lesions were as follows: false-negatives occurred because, first, the sizes of certain lesions were smaller than the imaging equipment could detect using focal resolution; and second, certain tumors (such as carcinoid tumors, mucous carcinomas, and bronchioloalveolar carcinomas) have very low FDG uptake rates that are not sufficient to yield abnormal concentration ranges upon imaging. Because FDG uptake is not tumor-specific, many benign lesions, such as granulomas, tuberculosis, and sarcoidosis, can also demonstrate FDG uptake, leading to errors in judging imaging results. Some increases in FDG uptake values are primarily caused by the presence of inflammatory cells (neutral polynuclear granulocytes, lymphocytes, or macrophages) and glucose metabolic activity enhancement. Activated white blood cells exhibit facilitated diffusion of FDG into the cell interior; hexokinase function, which phosphorylates FDG to form FDG-6-PO4; and the subsequent retention of this compound in the cells; these types of cells exhibit particularly high levels of glucose transport and phosphorylation processes (14). Another complicating factor is radiotherapy. During radiotherapy, lung inflammatory lesion uptake values can increase.
This meta-analysis demonstrated heterogeneity for several reasons. The thresholds chosen to distinguish benign from malignant nodules were likely a primary source of heterogeneity in the studies included in this meta-analysis. Therefore, a comprehensive evaluation of different diagnostic thresholds should be conducted to inform clinicians regarding the threshold that is most useful for diagnosis. The method of SUV calculation might also have been a source of heterogeneity. Other elements potentially affecting heterogeneity included different sizes of pulmonary nodules, different ages of the patients, and various brands of instruments.
This study also found that with a unique one-off body check, 18FDG-PET/CT can comprehensively evaluate both primary lung cancer and local or distant metastasis. Zhao et al. (6) reported that dual time point 18FDG- PET/CT detected 21 lung cancer lymph nodes, as well as distant metastasis in 15 patients. Hu et al. (1) reported finding 48 metastases in 31 patients. This detection of metastases is an advantage of PET/CT that is unequalled among other current diagnostic methods.
This study is subject to several potential biases: (i) this study only included Chinese- and English-language literature; thus, a language bias might exist; (ii) this study attempted to search a number of key databases and use more comprehensive search terms, but tables could not be extracted from three articles because the necessary data were not included (15–17); (iii) the articles that were included in this study demonstrated several differences among their data, such as in the scan parameters and the brands of time delay imaging instruments that were used. If the included articles had used a uniform scanning protocol, then the results would have exhibited greater reliability (11); (iv) FDG uptake is not tumor-specific and is subject to many factors, such as plasma glucose level, the patient's weight and height; and (v) this study included a small sample, which resulted in lower test performance.
In conclusion, this meta-analysis demonstrates that dual time point 18FDG-PET/CT and single time point 18FDG-PET/CT have the similar accuracy compared with the gold standard in the differential diagnosis of pulmonary nodules. Dual time point 18FDG-PET/CT appears to be more specific than single time point 18FDG-PET/CT. Nevertheless, certain limitations remain with respect to the diagnosis of lung cancer; thus, larger samples are needed to confirm the findings of this meta-analysis in further studies.
