Abstract
OBJECTIVE:
To explore the value of color-coded virtual touch tissue imaging (CCV) using acoustic radiation force pulse technology (ARFI) in diagnosing malignant thyroid nodules.
METHODS:
Images including 189 thyroid nodules were collected as training samples and a binary logistic regression analysis was used to calculate regression coefficients for Thyroid Imaging Reporting and Data System (TI-RADS) and CCV. An integrated prediction model (TI-RADS+CCV) was then developed based on the regression coefficients. Another testing dataset involving 40 thyroid nodules was used to validate and compare the diagnostic performance of TI-RADS, CCV, and the integrated predictive models using the receiver operating characteristic (ROC) curves.
RESULTS:
Both TI-RADS and CCV are independent predictors. The diagnostic performance advantage of CCV is insignificant compared to TI-RADS (P = 0.61). However, the diagnostic performance of the integrated prediction model is significantly higher than that of TI-RADS or CCV (all P < 0.05). Applying to the validation image dateset, the integrated predictive model yields an area under the curve (AUC) of 0.880.
CONCLUSIONS:
Developing a new predictive model that integrates the regression coefficients calculated from TI-RADS and CCV enables to achieve the superior performance of thyroid nodule diagnosis to that of using TI-RADS or CCV alone.
Keywords
Introduction
Providing patients with appropriate clinical management is inextricably linked to the accurate diagnosis of thyroid nodules [1, 2]. The advantages of real-time, convenience, non-radioactive and non-invasive are why conventional ultrasound (US) is the preferred choice [3, 4]. The US diagnoses thyroid nodules by echogenicity, margins, microcalcifications, etc. The Thyroid Imaging Reporting and Data System (TI-RADS), used by many radiologists, was introduced by the American College of Radiology (ACR) in 2017. Based on TI-RADS, radiologists classified thyroid nodules into five categories (TR1 to TR5) [5–7]. However, thyroid nodules may be misdiagnosed or missed when benign and malignant nodules present with similar US features. As a result of the extensive malignancy of TR4 and TR5 thyroid nodules, the management options are controversial. Patients may choose to have a fine needle aspiration (FNA) or surgical thyroid nodule removal. Unnecessary FNA or surgery can increase patient anxiety and medical costs [7, 8].
According to previous studies, malignant thyroid nodules often have more stiffness than benign ones. Acoustic radiation force impulse (ARFI) imaging, an elastography technique, can be used to assess the hardness of thyroid nodules. ARFI has been investigated as a diagnostic tool for thyroid nodules, mainly for Virtual Touch Tissue Quantification (VTQ) or Virtual Touch Tissue Imaging and Quantification (VTIQ). VTQ and VTIQ are quantitative methods of ARFI to measure the target tissue’s shear wave velocity (SWV). As the hardness of the tissue increases, the value of SWV also increases. However, it may be that the VTQ or VTIQ often displays X.XX m/s because the shear wave is of poor quality or exceeds the upper limit of the measurement. As a result, VTQ and VTIQ cannot be used widely. It is also possible to assess the stiffness of a nodule at a local level with VTQ or VTIQ but not at the whole nodule level [9–11].
As a qualitative assessment method for ARFI, virtual touch tissue imaging (VTI) uses acoustic pulses to investigate tissue displacement. VTI has the following advantages: the ROI size can be adjusted according to the size of the nodule; the measurement range is unlimited; it reflects the overall hardness of the nodule; and the hardness of the nodule can be analyzed semi-quantitatively by the scoring method. Using VTI technology, tissue hardness can be displayed in grayscale or color. Studies on VTI have been limited, and most studies use grayscale VTI to diagnose thyroid nodules. Compared to the ability to distinguish different colors, the human eye is significantly weaker in determining the degree of grayscale [12–16].
Therefore, we aim to investigate the ability of color-coded VTI (CCV) to distinguish benign from malignant thyroid nodules. The added value of CCV for TI-RADS will also be explored simultaneously.
Materials and methods
We have received ethical approval for this retrospective study. Due to the anonymity of all patient data, informed consent was not required.
Patients
From January 2016 to December 2021, we included a group of patients with thyroid nodules in this retrospective study, all of whom underwent US and CCV before surgery or biopsy. The decision to biopsy or operate depends on US findings, patient requests, or risk factors. The results of the CCV test do not affect the decision to perform surgery or biopsy.
The samples for analysis in the training set were 189 thyroid nodules from 159 patients from January 2016 to December 2019. As a validation set, we collected 40 thyroid nodules from January 2021 to December 2021. The flow chart of the study is shown in Fig. 1.
Following are the exclusion criteria: (1) Data from the US and CCV is incomplete; (2) There were incomplete pathological findings in thyroid nodules; (3) Patients who have had a previous thyroidectomy or biopsy of a thyroid nodule; (4) Any cancer patient receiving chemotherapy or radiation therapy.

Flowchart of the inclusion of patients in the training and validation sets. US = ultrasound. CCV= color-coded virtual touch tissue imaging.
Siemens Acuson S2000 (Siemens Medical Solutions Inc., CA, USA) and equipped with a 4–9 MHz linear transducer, was used as the equipment for US and CCV examinations in this study.
Image acquisition
All image acquisition was done by an experienced radiologist (an expert in thyroid diagnosis with more than ten years of experience). We first performed a US examination of each thyroid nodule and recorded the nodule’s characteristics, such as size, composition, margins, shape, and calcification. After the US exam was completed, we used the VTI function to check for thyroid nodules. We adjust the size and location of the region of interest (ROI) in VTI mode to include the thyroid nodule and part of the surrounding thyroid tissue. If the nodule’s size exceeded the ROI’s maximum range, we placed the ROI at the junction of the nodule and the surrounding thyroid tissue. In addition to grayscale maps, VTI also displays maps in color. When VTI is displayed in grayscale mode, the tissue in the bright areas is less hard than in the dark places. In the color-coded VTI mode, the order of hardness from high to low is red, orange, green, blue, and purple. Grayscale VTI was not used in this study, only color-coded VTI.
Image analysis
We randomized all thyroid nodules to be sorted after masking the patient’s name, gender, and age. Two radiologists reviewed the US and CCV images of the thyroid nodules together. Both radiologists had more than five years of experience in thyroid diagnosis. The US and CCV images were not acquired by two radiologists who were unaware of the pathological findings of thyroid nodules. An outcome was reached by discussion when two radiologists disagreed on the definition of image features.
Two radiologists evaluated US features of thyroid nodules according to ACR TI-RADS. In each US feature, a score was assigned, and those with the same score were merged and analyzed. We determined the classification based on the total score obtained for thyroid nodules, as follows: benign (0 points, TR1); no suspicion (2 points, TR2); mildly suspicious (3 points, TR3); moderately suspicious (4–6 points, TR4); and highly suspicious (7 points or more, TR5) [17].
In CCV, we assessed the hardness of both the nodule and the surrounding tissue. In this study, CCV scores were performed for each thyroid nodule as follows: If no red or orange is seen in the nodule, indicating low hardness (score 1); A nodule with an orange primary color and a green secondary color is rated as medium hard (score 2); Score 3 were assigned to nodules whose predominant colors were red and orange, followed by other colors, indicating high hardness; When nodules and surrounding tissues have a dominant color of red or orange, the nodules and surrounding tissues are considered to be hard (score 4) [15].
Interobserver agreement analysis
We randomly selected 30 cases of thyroid nodules for interobserver agreement analysis. Two radiologists with different experiences performed TI-RADS classification and CCV score on 30 thyroid nodules, for which they were unaware of the pathological findings. Both radiologists had six and three years of experience, respectively.
Statistical analysis
In this study, we used MedCalc (version 18.0) and SPSS software (version 26.0) for statistical analysis. Count data were analyzed using the chi-square test or Fisher’s exact test. We analyzed continuous variables using a t-test or Mann-Whitney U-test. Spearman’s rank correlation was used to assess the correlation between TI-RADS and CCV. We developed the logistic regression equation based on a binary logistic regression with TI-RADS and CCV as independent variables and pathological outcomes as dependent variables.
The equation includes are two regression coefficients X1 and X2, and one constant X0. The Hosmer and Lemeshow test was used to test the model’s goodness of fit. The integrated prediction model (TI-RADS+CCV) was defined as TI-RADS+X1/X2×CCV [14, 18]. An evaluation and comparison of TI-RADS, CCV, and integrated prediction model diagnostic performance were conducted using receiver operating characteristic curves (ROC). The validation set data were used to validate the diagnostic performance of the integrated prediction model [19, 20]. We included 40 thyroid nodules in the validation set, which met the same criteria as the training set. We assessed interobserver agreement using the Kappa statistic (poor agreement, K-value less than 0.40; moderate agreement, K-value between 0.40 and 0.60; substantial agreement, K-value between 0.60 and 0.80; and almost perfect agreement, K-value greater than 0.80). Comparisons between independent proportions were made using chi-square tests. Differences were statistically significant if the P-value was less than 0.05.
Results
Participant characteristics
In this study, 199 patients with 229 thyroid nodules were included. Table 1 summarizes the essential features, TI-RADS classification, and CCV scores of patients with thyroid nodules. The training set (n = 189) and validation set (n = 40) did not differ significantly in essential features, TI-RADS classification, or CCV scores (P > 0.05).
Basic characteristics, TI-RADS, and CCV in the training and validation sets
Basic characteristics, TI-RADS, and CCV in the training and validation sets
a = Determined with the χ2 test. b = Determined with Fisher’s exact test. c = Determined with the t-test. Notes. Unless otherwise noted, data points are the number of lesions, and numbers in parentheses are percentages. TI-RADS = thyroid imaging report and data system. CCV = color-coded virtual touch tissue imaging.
The training set included 46 malignant and 143 benign nodes, all confirmed by biopsy or surgery. The 46 malignant nodules included 43 papillary carcinomas and three follicular adenocarcinomas. And among 143 benign nodules, 129 were nodular goiter, 13 were follicular adenoma, and 1 was subacute thyroiditis.
The validation set included nine malignant and 31 benign nodes, all confirmed by biopsy or surgery. The nine malignant nodules had eight papillary carcinomas and one follicular adenocarcinoma. And among the 31 benign nodules, 26 were nodular goiter, and 5 were follicular adenomas.
The univariate analyses for the training set
Table 2 shows the US characteristics and CCV scores of thyroid nodules. Significant differences in US features between benign and malignant nodules included echogenicity, composition, margins, shape, and microcalcifications (all P < 0.001). In this study, malignant thyroid nodules were predominantly TR5 (P < 0.001). Benign nodules had a significantly lower CCV score than malignant nodules (P < 0.001) (Figs. 2 –5).
Sonographic features, TI-RADS, and CCV of benign and malignant thyroid nodules in the training dataset
Sonographic features, TI-RADS, and CCV of benign and malignant thyroid nodules in the training dataset
a = Determined with the χ2 test. b = Determined with Fisher’s exact test. Notes. Unless otherwise noted, data points are the number of lesions, and numbers in parentheses are percentages. A/T = anteroposterior/transverse diameter. TI-RADS = thyroid imaging report and data system. CCV = color-coded virtual touch tissue imaging.

Comparison of conventional US (left) and CCV (right) images of a follicular adenoma with a CCV score of 1 and a TI-RADS classification of TR3. US = ultrasound. CCV= color-coded virtual touch tissue imaging. TI-RADS = thyroid imaging reporting and data system.

Comparison of conventional US (left) and CCV (right) images of a case of nodular goiter with a CCV score of 2 and a TI-RADS classification of TR3. US = ultrasound. CCV= color-coded virtual touch tissue imaging. TI-RADS = thyroid imaging reporting and data system.

Comparison of conventional US (left) and CCV (right) images of papillary thyroid carcinoma with a CCV score of 3 and a TI-RADS classification of TR5. US = ultrasound. CCV= color-coded virtual touch tissue imaging. TI-RADS = thyroid imaging reporting and data system.

Comparison of conventional US (left) and CCV (right) images of papillary thyroid carcinoma with a CCV score of 4 and a TI-RADS classification of TR5. US = ultrasound. CCV= color-coded virtual touch tissue imaging. TI-RADS = thyroid imaging reporting and data system.
In the evaluation of thyroid nodules, a strong correlation was found between TI-RADS and CCV (R = 0.710, P < 0.001).
Multivariate analysis of TI-RADS and CCV in the training set
Both TI-RADS (odds ratio [OR]: 3.736; 95% confidence interval [CI]: 1.775, 7.863) and CCV (OR: 9.397; 95% CI: 3.620, 24.396) were independent predictors of malignant thyroid nodules(all P < 0.05) (Table 3). The regression equation was written as follows:
Multivariate logistic regression analysis based on TI-RADS and CCV in the training dataset
Multivariate logistic regression analysis based on TI-RADS and CCV in the training dataset
TI-RADS = thyroid imaging report and data system. CCV = color-coded virtual touch tissue imaging. CI = confidence intervals. B = regression coefficient. SE = standard error. Wald = chi-square value. The Hosmer & Lemeshow test was 0.893.
Diagnostic performance of TI-RADS, CCV, or both for thyroid nodules in the training dataset
Note. Unless otherwise indicated, the data point is a percentage, and the number in parentheses is 95% CI. TI-RADS = thyroid imaging report and data system. CCV = color-coded virtual touch tissue imaging. CI = confidence intervals. SEN = sensitivity. SPE = specificity. * = data are percentages and raw data in brackets. a = Compared with TI-RADS, P < 0.05. b = Compared with CCV, P < 0.05. c = Compared with TI-RADS+CCV, P < 0.05.
Additionally, Hosmer and Lemeshow test had a value of 0.893.
In the present study, the area under the curve (AUC) was higher for CCV than for TI-RADS, but the difference was insignificant (P = 0.6104). The integrated prediction model had the largest AUC, whether compared with TI-RADS or CCV (all P < 0.05) (Fig. 6).

ROC analysis of TI-RADS, CCV, and TI-RADS+CCV in the training set. ROC = receiver operating characteristic curves. TI-RADS = thyroid imaging reporting and data system. CCV = color-coded virtual touch tissue imaging.
Table 5 shows the variables used in the validation set for the integrated prediction model. The integrated prediction model performed equally well for diagnosis in the validation set, with AUC, sensitivity, and specificity of 0.880, 88.89%, and 77.42%, respectively (Table 6) (Fig. 7).
TI-RADS, CCV, or both for thyroid nodules in the validation dataset
TI-RADS, CCV, or both for thyroid nodules in the validation dataset
a = Determined with Fisher’s exact test. b = Determined with the t-test. Notes. Unless otherwise noted, data points are the number of lesions, and numbers in parentheses are percentages. TI-RADS = thyroid imaging report and data system. CCV = color-coded virtual touch tissue imaging.
Diagnostic performance of TI-RADS+CCV in the validation dataset
Note. Unless otherwise indicated, the data point is a percentage, and the number in parentheses is 95% CI. TI-RADS = thyroid imaging report and data system; CCV = color-coded virtual touch tissue imaging; CI = confidence intervals; SEN = sensitivity; SPE = specificity. * = data are percentages and raw data in brackets.

ROC analysis of TI-RADS+CCV in the validation set. ROC = receiver operating characteristic curves. TI-RADS = thyroid imaging reporting and data system. CCV = color-coded virtual touch tissue imaging.
Our results show that radiologists with different experiences can obtain substantial agreement using the TI-RADS (K-value: 0.658 and 95% CI: 0.442, 0.874) and CCV (K-value: 0.633 and 95% CI: 0.396, 0.869).
Discussion
In the training set, malignant nodules were mainly scored as 3 and 4, while benign nodules were mainly scored as 1 and 2. When the hardness of thyroid nodules and surrounding tissues increases, the nodule may be malignant. Because the invasion of cancer cells may increase the hardness of the surrounding tissues. The CCV score effectively diagnosed malignant thyroid nodules with a sensitivity, specificity, and AUC of 89.13%, 79.72%, and 0.907, respectively. Our results are consistent with those of previous elastography studies [21, 22].
Conventional US remains the primary method for diagnosing thyroid nodules despite the presence of elastography. In the present study, the area under the curve (AUC) was higher for CCV than for TI-RADS, but the difference was insignificant (P = 0.6104), similar to previous studies [23, 24].
There were limitations when the CCV was used alone. The results of CCV may vary based on the pulsations of the large blood vessels in the neck. The CCV does not provide information about the thyroid nodule or surrounding tissue other than its hardness. Benign nodules and malignant nodules can have the same level of hardness.
The TI-RADS classification is a semi-quantitative comprehensive evaluation method that classifies thyroid nodules based on US characteristics. Malignant thyroid nodules that are small in size and have an aspect ratio of less than 1, or benign nodules with internal calcification, can be misdiagnosed by the TI-RADS classification alone.
The single application of CCV or TI-RADS classification produces a high rate of misdiagnosis, whereas the combined application allows for complementarity and improved diagnostic performance. Based on the results of previous studies, the combination of elastography and conventional US could improve the diagnostic accuracy of thyroid nodules [25–27]. Our results showed that the combined prediction model had the largest AUC (all P < 0.05) compared to either TI-RADS or CCV.
In this study, we generated a prediction model (TI-RADS+CCV) using the training set data and validated them using an external validation set. The validation set does not contain duplicate data, and the training set and validation set do not overlap. In the validation set, TI-RADS+CCV showed good diagnostic power with a sensitivity of 88.89%, specificity of 77.42%, and AUC of 0.880.
There were the following limitations in this study. Our study was a retrospective study and might have some selectivity bias. We only included patients with thyroid nodules who underwent surgical resection or biopsy for the study and, therefore, might have resulted in a higher malignancy rate that was not representative of the overall population incidence. The proportion of malignant nodule cases was small, and the pathological types were unevenly distributed; in the follow-up study, we will expand the sample size and increase the pathological types. The same radiologist did all acquisitions of conventional US images and CCV images in our study. There was no comparison of differences in image acquisition by different radiologists. Similarly, the diagnostic performance of other radiologists who use CCV was not analyzed, and future studies should address and improve these issues.
Conclusion
Based on the results of this study, we thus recommend CCV as an adjunct to TI-RADS to improve diagnosis because CCV provides additional information on hardness.
Footnotes
Acknowledgments
The authors gratefully acknowledge Wenxia Lin, Xiaohuan Zhu, and Xiao-E Huang from the Department of Ultrasound, The First Affiliated Hospital of Shantou University Medical College, Shantou, Guangdong Province, P.R. China.
Conflict of interest
The authors declare that they have no conflict of interest.
Ethical approval
This retrospective study was applied for review and approved by the Institutional Review Board (The first affiliated Hospital of Medical College of Shantou University-Ethical review-No. B-2021-020).
Informed consent
The institutional review board approved waivers of informed consent.
Funding
This study was supported by the Shantou Bureau of Science and Technology (NO. 210413116491209).
