Abstract
The purpose of the study was to explore the differences of conventional ultrasound characteristics, thyroid imaging reporting and data system (TI-RADS) category and shear wave speed (SWS) measurement between follicular adenoma (FA) and follicular thyroid carcinoma (FTC). Twenty-eight FTCs and 67 FAs proven by surgery were retrospectively included for analysis. Conventional ultrasound and point-shear wave elastography (p-SWE) were performed in all of the included patients. The ultrasound features, American Thyroid Association (ATA) TI-RADS category and American College of Radiology (ACR) TI-RADS category, SWS measurement were compared between the two groups. Receiver operating characteristic (ROC) curve was performed and area under ROC curve (AUC) was obtained for significant features. There were no statistical differences in mean age (46.9±15.7years vs. 48.6±13.6years, P = 0.639), gender (9 males, 32.1% vs. 18 males, 29.0%, P = 0.766) and mean diameter (28.3±16.2 mm vs. 33.8±11.9 mm, P = 0.077) between FTCs and FAs. Hypoechogenicity, lobulated or irregular margin, macrocalcification were more common in FTCs than FAs (all P < 0.05). Mean SWS of FTCs (2.29±0.64 m/s) was slightly higher than that of FAs (1.94±0.68 m/s) (P = 0.023). The AUCs were 0.655, 0.744, and 0.744 with the cut-off SWS≥1.89 m/s, ACR TI-RADS category 4 and intermediate suspicion of ATA TI-RADS category. The sensitivity and AUC were 82.1% and 0.812 with combined ultrasound features of hypoechogenicity, lobulated or irregular margin and macrocalcification. In Conclusion, SWS measurement and TI-RADS categories were useful for the identification of FTCs from FAs.
Keywords
Introduction
Follicular thyroid carcinoma (FTC) is the second malignancy in thyroid cancers (10–22%), followed by papillary thyroid carcinoma (PTC) (80–90%) [1, 2]. High resolution ultrasound examination is the preferred method for thyroid nodules [2]. The differential diagnosis for thyroid malignancy from all the thyroid nodules depends on the suspicious features on ultrasound images including solid, hypoechoic, irregular of microlobulated margin, taller than wide shape and microcalcifications with sensitivity of 26–87% and specificity of 53–93% [3, 4]. However, it remains difficult to distinguish FTC from follicular neoplasms (FN) before surgery in thyroid nodules management.
Thyroid imaging reporting and data system (TI-RADS) based on the above-mentioned features has been widely recognized in clinical practice. TI-RADS category has been widely used with sensitivity 75.0–89.6% for American Thyroid Association (ATA) TI-RADS category and 74.7–92.0% for American College of Radiology (ACR) TI-RADS category [5–8]. However, these TI-RADS categories are extracted from overall thyroid carcinomas in which most of them are PTCs, it is unknown whether they are applicable for the diagnosis of FTC. Kim et al. reported that there were no statistically significant differences in tumor size and growth rate between FTC and FA, leading to that short-time follow up by US had no effect for differentiation of FTC and FA [9]. Another study reported that presence of calcifications, heterogeneous echo texture and solid component were risk factors for FTC, with odds ratios, 25.6, 24.9, and 9.4, respectively [10].
Radionuclide thyroid scan is recommended when the patient’s serum TSH is subnormal with or without thyroid nodules [2]. Focal [18F] fluorodeoxyglucose positron emission to mography (18FDG-PET) uptake is not recommended for those nodules >10 mm because of increased risk of thyroid cancer [2]. The utilization of SPECT/CT in FTC patients can detect the presence of metastases (lung, bone, adrenal, liver, and brain) and exclude potential false-positive lesions [11]. Contrast-enhanced ultrasound (CEUS) is a valuable supplemental method for thyroid carcinoma with 90% sensitivity and 86% specificity, while characteristics of FTC and FA had not been mentioned [12–14].
Fine-needle aspiration (FNA) is the standard method to detect thyroid malignancy before surgery with sensitivities of 54–90% and specificities of 60–96% [2]. However, FNA is expensive and invasive and 17–22% of FNA results are indeterminate and unavailable. In addition, FNA cytology IV (follicular neoplasm or suspicious for a follicular neoplasm) indicates that FNA was unsuitable for FTC.
Elastography can be used to evaluate the hardness of tissue and hardness is considered to be related with malignancy. For shear wave elastography (SWE), transient pulses from the linear array probe are used to generate propagating shear waves in the region of interest (ROI) and shear wave speed (SWS) is calculated, which can quantitatively evaluate the tissue stiffness [15]. Point shear wave elastography (p-SWE) can obtain the mean SWS of the 6×5 mm sampling box by transmitting on point shear wave. P-SWE and/or combination of TI-RADS are proved to be useful in differential diagnosis between benign nodules and malignant nodules with sensitivity of 57–97% and specificity of 62–100% [16–19]. Besides, it was reported that CEUS and SWE had similar diagnostic performance (accuracy 86.8% vs. 84.3%) and accuracy was increased to 93.1% from combination of CEUS and SWE [20].
Samir et al reported that SWE was helpful for thyroid malignancy in those nodules with fine needle aspiration (FNA) cytological results of FNA III (follicular lesion of undetermined significance or atypia of undetermined significance) and FNA IV (follicular neoplasm or suspicion for follicular neoplasm). The sensitivity and specificity was 82% and 88% with cut-off 22.3 kPa [21]. Therefore, we hypothesize that SWS measurement is statistically different between FTC and FA and SWS measurement is useful for the diagnosis of FTC.
Therefore, in this retrospective study, we assessed the differences of clinical features, conventional ultrasound features, ATA TI-RADS category, ACR TI-RADS category and SWS measurement between FTC and FA, the purpose of which was to find out the potential significant parameters and analyze their diagnostic performance for differentiation between FTC and FA.
Methods and materials
Patient selection
This retrospective study was approved by the Ethical Committee of the university affiliated hospital (approval number 19K100) and informed consent was waived. The inclusion criteria were as following: (a) patient age was 18–80 years old; (b) the patient received conventional ultrasound and p-SWE examination before surgery; (c) proven by surgery and pathological result was FTC or FA. 128 patients met the inclusion criteria. Patients were excluded when one of the following situations appeared: (a) the nodule was mostly cystic and the solid portion was smaller than 6×5 mm (n = 10); (b) the image data was incomplete (n = 17); (c) the pathological result of the target nodule was unclear or uncertain when the patient had multifarious pathological results (n = 11).
For one patient, only one thyroid nodule was included for analysis in the study. If nodules were multiple, the largest one was included for evaluation. Finally, 90 nodules (mean diameter, 32.1±13.6 mm; range, 8–63 mm) from 90 patients (27 males and 63 females; mean age, 48.0±14.2 years; range, 18–70 years) were included for analysis, including 28 FTCs (mean diameter, 28.3±16.2 mm; range, 9–58 mm) and 62 FAs (mean diameter, 33.8±11.9 mm; range, 8–63 mm).
Conventional ultrasound examination
All the incorporated patients underwent conventional ultrasound examination before surgery using a linear transducer (frequency: 10–12 MHz) from Siemens S2000 (Siemens Medical Solutions, Mountain View, Calif, USA) or Siemens S3000 (Siemens Medical Solutions, Mountain View, Calif, USA). The patients were lying in the examination couch and make sure the neck skin fully exposed. The coupling medium was used to reduce the interference of air between the linear transducer and skin. For each patient, transverse and longitudinal plane examination of gray scale and color Doppler flow image were performed. The diameter of tumor was the maximal diameter measured in longitudinal plane.
P-SWE examination
The target nodule was confirmed before p-SWE examination. ROI was placed on the solid area without macrocalcifications or cystic parts. The operator was asked to hold the transducer gently with minimum pressure on the patient neck skin. The patient was required to hold the breath during the acquisition of p-SWE. Generally, ROI was placed at the solid area without macrocalcifications. P-SWE was evaluated by SWS (m/s) measurement and effective SWS measurement was acquired at least 7 times. The SWS ranges from 0.5 to 8.4 m/s. The numbers displayed as “X.XX m/s” was replaced by 8.4 m/s corresponding to the solid portion and deleted when corresponding to the cystic portion. The mean SWS of the nodule was calculated after maximum and minimum were deleted. The whole process of conventional ultrasound and p-SWE examination lasted about 10 minutes.
Interpretation of conventional ultrasound features
The analyzable ultrasound features were as followed (classified according by thyroid ultrasound reporting lexicon from ACR): nodule composition (mixed cystic and solid, solid), echogenicity (hyperchoic, isoechoic, hypoechoic, mixehoic), shape (wider than tall shape, taller than wide shape), margin (smooth margin, ill-defined margin, lobulated or irregular margin), calcifications (none or large comet-tail artifacts, macrocalcifications, peripheral calcifications, microcalcifications). The echogenicity was compared with the surrounding thyroid tissue. Microcalcifications were defined as foci less than 1.5 mm. Vascularity was evaluated in the color Doppler flow imaging with velocity scale of 5–10 cm/s.
Interpretation of ATA and ACR TI-RADS category
ATA TI-RADS category [2] was classified as benign (risk of malignancy: <1%), very low suspicion (risk of malignancy: <3%), low suspicion (risk of malignancy: 5–10%), intermediate suspicion (risk of malignancy: 10–20%), high suspicion (risk of malignancy: 70–90%). ACR TI-RADS category [22] was classified as category 1 (0 point), category 2 (2 points), category 3 (3 points), category 4 (4–6 points), category 5 (≥7 points) according ultrasound features. The ultrasound features of thyroid nodules were divided into benign features (0 point each, including cystic or almost completely cystic, spongiform, large comet-tail artifacts and smooth margin), minimally suspicious features (1 point each, including hyperchoic or isoechoic, mixed cystic and solid, macrocalcifications), moderately suspicious features (2 points each, including solid or almost completely solid, hypoechoic, lobulated or irregular margin, peripheral calcifications) and highly suspicious features (3 points each, including markedly hypoechoic, extra-thyroidal extension, microcalcifications, taller than wide shape).
Statistical analysis
All the statistical analysis in the study was performed using SPSS 23.0 software. Continuous variables were expressed as mean±standard deviation (SD) and range (age, diameter, SWS). Enumeration data and categorical data were expressed as numbers (gender, ultrasound features, et al). The measurement data were compared using independent-samples t test while enumeration data using χ2 test (Fisher’s exact test was performed when one of numbers less than 5), and categorical data using non-parametric tests (echogenicity, calcifications, TI-RADS category). The receiver operating characteristic (ROC) curves of significant features were plotted and the the area under ROC curve (AUC) and its 95% confidence interval (CI) were obtained. The comparison of AUCs of different features was performed by using Z test. The cut-off value was acquired with the maximum of Youden index (sensitivity + specificity-1). Sensitivity, specificity, accuracy, positive predictive valve (PPV) and negative predictive value (NPV) of the significant features were calculated. A two-tailed P-value < 0.05 indicated statistically significant difference.
Results
Basic characteristics
The clinical information of FTC and FA was shown in Table 1. Finally, 28 FTCs (mean diameter, 28.3±16.2 mm) from 28 patients (9 males and 19 females) and 62 FAs (mean diameter, 33.8±11.9 mm) from 62 patients (27 males and 35 females) were included in the study. There were no statistically differences in mean diameter, gender and mean age (P = 0.077, 0.766, 0.639 respectively) between FTC and FA. Four FTCs had cervical lymph node metastasis and one of them also had distant metastasis (lung). 27 patients of the 28 FTC patients had single FTC while one patient (lung metastasis) had PTC and FTC in the two lobes respectively.
Comparison of clinical features, ultrasound features and SWS between follicular thyroid carcinoma and follicular adenoma
Comparison of clinical features, ultrasound features and SWS between follicular thyroid carcinoma and follicular adenoma
Note: SWS, shear wave speed; No., numbers;*, statistical significance.
The ultrasound features and SWS of FTC and FA were presented in Table 1. There were statistical differences in echogenicity, margin and calcifications between FTC and FA. Hypoechogenicity, lobulated or irregular and macrocalcification were more common in FTCs than FAs (all P < 0.05) (Fig. 1). However, the highly suspicious characteristics such as taller than wide shape and microcalcification had no significant differences between FTCs and FAs (P = 0.073 and P = 0.311 respectively). Taller than wide shape and microcalcification were uncommon in FTCs. Mean SWS of FTCs was higher than that of FAs (2.29±0.64 m/s vs. 1.94±0.68 m/s, P = 0.023).

Ultrasound features of follicular thyroid carcinoma and follicular adenoma. A, the nodule is hypoechoic (FTC); B, the nodule is hypoechoic and irregular in shape (FTC); C, the hypoechoic nodule has marocalcifications (FTC); D, the mixed cystic and solid has smooth margin (FA); E, the nodule is solid and isoechoic (FA); F, the nodule is solid and hyperechoic (FA). FTC, follicular thyroid carcinoma; FA, follicular adenoma.
ATA TI-RADS category and ACR TI-RADS category of FTC and FA were presented in Table 2. There were statistical differences in ATA TI-RADS category and ACR TI-RADS category between FTCs and FAs (all P < 0.001) (Figs. 2 and 3). The highest frequency was low suspicion of ATA TI-RADS category (77.4%) and ACR TI-RADS category 3 (58.1%) in FAs while was high suspicion of ATA TI-RADS category (39.3%) and ACR TI-RADS category 4 (46.4%) for FTCs.

Follicular adenoma in a 59 years old man. A, the 37×24 mm nodule in left lobe of thyroid is solid and isoechoic on gray scale ultrosound; B, the nodule shows peripheral flow on color doppler flow imaging. The nodule is low suspicious of ATA TI-RADS category and is ACR TI-RADS category 3; C, SWS is 1.10 m/s on p-SWE measurement; D, the nodule is proven to be thyroid follicular adenoma pathologically (H&E staining, magnification, 10×10). ATA, American Thyroid Association; TI-RADS, thyroid imaging reporting and data system; ACR, American College of Radiology; SWS, shear wave speed; p-SWE, point shear wave elastography.

Follicular thyroid carcinoma in a 29 years old woman. A, the 13×8 mm nodule in right lobe of thyroid shows isoechogenicity, irregular margin and peripheral (rim) calcifications on gray scale ultrosound; B, the nodule shows peripheral flow on color doppler flow imaging. The nodule is intermediate suspicious on ATA TI-RADS category and is ACR TI-RADS category 4; C, SWS is 2.21 m/s on p-SWE measurement; D, the nodule is proven to be minimally invasive follicular thyroid carcinoma pathologically (H&E staining, magnification, 10×10). The capsule is incomplete and infiltrative by blood vessel (arrow). ATA, American Thyroid Association; TI-RADS, thyroid imaging reporting and data system; ACR, American College of Radiology; SWS, shear wave speed; p-SWE, point shear wave elastography.
Comparison of ATA TI-RADS category and ACR TI-RADS category between follicular thyroid carcinoma and follicular adenoma
Note: ACR, American College of Radiology; ATA, American Thyroid Association; TI-RADS, Thyroid Imaging Reporting and Data System;#, significant features were hypoechogenicity, lobulated or irregular margin and macrocalcifications;*, statistical significance.
The diagnostic performance of conventional ultrasound features, SWS measurement and TI-RADS categories is presented in Table 3. Lobulated or irregular shape, macrocalcifications and SWS≥1.89 m/s had moderate diagnostic values with AUCs ranging 0.5–0.7. Hypoechogenicity, ATA TI-RADS category≥intermediate suspicion and ACR TI-RADS category≥4 had good diagnostic values with AUCs ranging 0.7–0.9.
Diagnostic performance of significant parameters
Diagnostic performance of significant parameters
Note: SWS, shear wave speed; ACR, American College of Radiology; ATA, American Thyroid Association; TI-RADS, Thyroid Imaging Reporting and Data System; PPV, positive predictive value; NPV, negative predictive value; AUC, area under receiver operating characteristic curve.
The AUC of combined application in ultrasound significant feature (one or more suspicious features appeared) was statistically higher than that of macrocalcifications (0.812 vs. 0.630, P = 0.024), lobulated or irregular shape (0.812 vs. 0.661, P = 0.046), and SWS≥1.89 m/s (0.812 vs. 0.655, P = 0.034), while no statistical difference was found for hypoechogenicity (0.812 vs. 0.717, P = 0.200), ATA TI-RADS category≥intermediate suspicion (0.812 vs. 0.744, P = 0.354) and ACR TI-RADS category≥4 (0.812 vs. 0.744, P = 0.354). The highest valve belonged to the combined application of hypoechogenicity, lobulated or irregular shape and macrocalcifications with sensitivity 82.1% and AUC 0.812.
The present study compared the clinical features, conventional ultrasound characteristics, SWS measurement, ATA and ACR TI-RADS category between FTC and FA, and explored the diagnostic performance of significant parameters in distinguishing FTC from FA. There were statistical differences in hypoechogenicity, lobulated or irregular margin, macrocalcifications, SWS measurement, ATA and ACR TI-RADS category between FTC and FA. The combined application of hypoechogenicity, lobulated or irregular margin, macrocalcifications had the highest sensitivity and AUC.
In previously studies [8, 24], malignant thyroid nodules were smaller than benign nodules and more common in females and younger person. However, there were no differences in gender, mean age and mean diameter between FTCs and FAs in this study. Cystic degeneration was considered as a characteristic of FA [10] and the similar result was present in our study while no statistical difference (15 of 62 in FAs vs. 3 of 28 FTCs, P = 0.114). Most FAs were solid or almost completely solid, so solid component was not a differential point for FTCs. Hypoechogenicity, lobulated or irregular margin, macrocalcifications were the significant features on conventional ultrasound for FTC in the present study. It can be hypothesized that out of control of follicular cells growth leading to reduction of echogenicity and lobulated or irregular margin. In ATA TI-RADS category, hypoechogenicity was intermediate suspicion, lobulated or irregular margin was high suspicion and macrocalcifications was not specified. In ACR TI-RADS category, hypoechogenicity and lobulated or irregular margin was intermediate suspicion (2 points), macrocalcifications was low suspicion (1 point). Therefore, ATA and ACR TI-RADS category had higher classification in FTCs than FAs (P < 0.001).
Microcalcifications and taller than wide shape, which were highly suspicious features in thyroid nodules were rare in FTC (3.6% and 24.3%), markedly less than in PTC (40.4% and 51.2%) [8]. Thus, 54.2% malignant thyroid nodules were high suspicion for ATA TI-RADS category while only 39.3% FTCs in our study. For ACR TI-RADS category, 52.2% malignant thyroid nodules were category 5 while only 25.0% in this study [5]. ATA and ACR RI-RADS category had equivalently diagnostic value for FTC and FA, the sensitivity was 71.4% and AUC was 0.744 with intermediate suspicion of ATA TI-RADS category and ACR TI-RADS category 4. ATA and ACR TI-RADS category were useful for FTC and FA (sensitivity 71.4%), while the diagnostic performance was inferior to the previous studies that TI-RADS categories were performed in overall nodules (Sensitivity 74.7–98.3%) [25–28]. The sensitivity and AUC were improved to 82.1% and 0.812 when hypoechogenicity, lobulated or irregular margin and macrocalcifications were combined.
It was reported that 37.2% PTC had multifocal malignant tumors, multifocal PTC had higher invasiveness, cervical lymph node metastasis, TNM staging and recurrence [29, 30]. Literatures about multifocal FTC had not been found, likewise in our case there had no multifocal FTC. It is interesting to note that one patient with PTC in the left and FTC in the right resulted had lung metastasis. It was unclear whether multiple types of thyroid cancer promote the occurrence of metastasis.
SWS measurement of p-SWE can quantitatively evaluate the stiffness of thyroid nodules and the tissue stiffness is associated with malignancy. In this study, the mean SWS of FTCs were statistical higher than FAs, similar to the result of previous study that SWS of malignant nodules was larger than that of benign nodules [17]. However, the difference of mean SWS between FTC and FA (2.29±0.64 m/s vs. 1.94±0.68 m/s) was much smaller than that of malignant nodules and benign nodules (4.90±2.74 m/s vs. 2.10±1.04 m/s) [18]. Besides, the diagnostic performance was unsatisfactory with sensitivity 75.0%, specificity 53.2% and AUC 0.655 compared with the previous study with most of the malignancy being PTCs (sensitivity 82%, specificity 90% and AUC 0.86) [18]. The reason probably was that the pasmmoma bodies and fibrosis which markedly increase the stiffness of tissue were uncommon in FTC than PTC.
It was difficult to diagnose FTC from FN before surgery for that FNA cytology and frozen pathology had a large proportion of false negative rate [31–33]. However, in the present study, there were statistical differences in hypoechogenicity, lobulated or irregular shape, macrocalcification, SWS measurement, ATA and ACR TI-RADS category between FTC and FA. Therefore, comprehensive evaluation of these significant ultrasound features will be helpful to preoperatively predict malignancy and surgical planning in FNs, meanwhile possible disputes with patients can be reduced or avoided.
There had several limitations in this study. Firstly, it was a retrospective study. Prospective studies with large sample and long time follow-up results were needed for further analysis. Secondly, the sample size was small (28 FTCs and 62 FAs). More cases and long-term follow-up were needed for further study. In addition, the study didn’t include some relevant technologies in thyroid management (contrast-enhanced ultrasound, fine needle aspiration, gene mutation detection, et al).
Conclusion
Hypoechogenicity, lobulated or irregular shape, macrocalcification, SWS measurement, ATA and ACR TI-RADS category were useful for differential diagnosis between FTC and FA. The combinations of suspicious features, SWS measurement, ATA and ACR TI-RADS category may provide differential diagnosis value and help surgeons in making decisions regarding with selection of thyroid surgery or operation methods.
Conflicts of interest
The authors declare that there are no conflicts of interest regarding the publication of this article.
Footnotes
Acknowledgments
This work was supported in part by the National Natural Science Foundation of China (Grants 81671695, 81601502, 81725008, and 81927801), Fundamental Research Funds for the Central Universities (Grants 22120190021 and 22120190137), Shanghai Hospital Development Center (Grants 16CR3061B and SHDC12014229), and the Science and Technology Commission of Shanghai Municipality (Grants 19441903200, 16411971100 and 14441900900).
