Abstract
Background
The relevance of Epstein–Barr virus (EBV) in gastric carcinoma has been represented by the existence of EBV-encoded small RNA (EBER) in the tumor cells and has prognostic significance in gastric cancer, while gastric adenocarcinoma represents the most frequently occurring gastric malignancy.
Purpose
To observe the capacity of radiomic features extracted from contrast-enhanced computed tomography (CE-CT) images to differentiate EBER-positive gastric adenocarcinoma from EBER-negative ones.
Material and Methods
A total of 54 patients with gastric adenocarcinoma (EBER-positive: 27, EBER-negative: 27) were retrospectively examined. Radiomic imaging features were extracted from all regions of interest (ROI) delineated by two experienced radiologists on late arterial phase CT images. We distinguished related radiomic features through the two-tailed t test and applied them to construct a decision tree model to evaluate whether EBER in situ hybridization positive had appeared.
Results
Nine radiomics features were significantly related to EBER in situ hybridization status (P < 0.05), four of which were used to build the decision tree through backward elimination: Correlation_ AllDirection_offset7, Correlation_ angle135_offset7, RunLengthNonuniformity_ AllDirection_offset1_SD, and HighGreyLevelRunEmphasis_ AllDiretion_offset1_SD. The decision tree model consisted of seven decision nodes and six terminal nodes, three of which demonstrated positive EBER in situ hybridization. The specificity, sensitivity, and accuracy of the model were 84%, 80%, and 81.7%, respectively. The area under the curve of the decision tree model was 0.87.
Conclusion
Radiomics based on CE-CT could be applied to predict EBER in situ hybridization status preoperatively in patients with gastric adenocarcinoma.
Introduction
Gastric carcinoma is one of the most common gastrointestinal malignancies around the globe (1). In 2015, gastric cancer ranked ninth for cancer incidence and eighth for cancer deaths in the United Kingdom (2). Since 1997, Epstein–Barr virus (EBV) has been regarded as causally related to gastric carcinomas in a small portion of patients (3–5). Studies have found that EBV-associated gastric carcinoma is related to improved survival and can be efficiently treated with chemotherapy (6). EBV-encoded RNA (EBER) is the expression product of EBV, which exists in high nuclear numbers in nuclei. It is applied as an accurate supplementary meant to diagnose EBV positive malignancies histopathologically (16). Since gastric adenocarcinoma represents the most frequently occurring gastric carcinoma, the diagnosis of EBV-positive gastric adenocarcinoma is of great significance, both for treatment decision and prognosis of gastric cancers. However, the identification of EBER in situ hybridization is usually possible through postoperative testing of biopsy tissues or surgical specimens, while the opportunity to confirm EBER status preoperatively is limited. Thus, the modality to confirm EBER in situ hybridization status preoperatively is of great significance for efficient and precise treatment of gastric adenocarcinoma.
Multidetector computed tomography (MDCT) is shown to be a successful tool for preoperative staging work as well as detection of distant nodal metastasis for gastric carcinoma. It has fine spatial resolution and can quantify biological and physical characteristics of the microenvironment that attenuates X-ray (7–10).
Radiomics, which can be applied to different imaging modalities including ultrasound, positron emission tomography, radiography, CT, and magnetic resonance imaging (MRI), is a non-invasive technique that involves extraction of a large quantity of high-throughput imaging features. These features quantify tumoral heterogeneity by measuring spatial variation in gray-level intensities in imaging information. Radiomics on CT has demonstrated promising result on the prediction of pathological staging, therapy response, and occult peritoneal carcinomatosis of gastric carcinoma (11,14,15). However, there is still no study on the application of radiomics from CT as a modality to predict EBER in situ hybridization in gastric adenocarcinoma. Therefore, the aim of the present study was to examine the potential of CT radiomics to identify EBER status of gastric adenocarcinoma.
Material and Methods
Study population
The Clinical Research Ethics Committee approved the study and the requirement for written informed consent was waived. A total of 83 patients diagnosed pathologically with EBV-positive gastric adenocarcinoma from January 2015 to December 2018 and 106 patients diagnosed pathologically with EBV-negative gastric adenocarcinoma from January 2015 to December 2015 are included in the study. The inclusion criteria were as follows: (i) pathological diagnosis of gastric adenocarcinomas by gastroscopic biopsy or surgery; (ii) pathologically confirmed with positive EBER; (iii) available contrast-enhanced CT (CE-CT) examination before therapy; (iv) patient did not undergo any preoperative chemotherapy or surgery before the examination. The exclusion criteria were as follows: (i) lesions without absolute borders for region of interest (ROI); and (ii) poor CT image quality due to artifact. Finally, a total of 54 patients (27 patients with EBV-positive adenocarcinoma and 27 patients with EBV-negative adenocarcinoma; age range = 24–75 years; mean age = 53 years) were included in the study. The flow chart of this study is shown in Fig. 1. The clinicopathological characteristics of the study cohort are shown in Table 1.

Flow of patients through the study.
Demographic and clinical characteristics of the study population (n = 54).
*Values are given as median (range).
EBV, Epstein–Barr virus.
CT acquisition
An informed consent was signed by each patient for CT examination, and patients were required to fast from solid food for at least 2 h before the examination and were instructed to drink 500 mL of water to reach gastric distension. Two 64-section and one 128-section spiral MDCT scanners were used. The patients were instructed to hold their breath during the scan. All patients were in the supine position while the scan covered the entire or upper abdomen. After the unenhanced scan, a high-pressure syringe (Medrad Stellant CT; Bayer HealthCare LLC, Whippany, NJ, USA) was used to inject 100–120 mL of iodinated contrast medium (Optiray 320 mg iodine/mL; Jiangsu Hengrui Medicine, Lianyungang, Jiangsu, PR China) sampler system at a velocity of 2.5 mL/s; late arterial phase with a post-injection delay of 35–40 s. The parameters of the CT included in this study were: tube current = 250–300 mA; tube voltage = 120 kVp; slice interval = 5 mm; slice thickness = 5 mm; matrix = 512 × 512; field of view = 36–42 cm; rotation time = 0.5–0.8 s; pitch = 0.844–0.993. CT acquisition was performed in the form of a spiral dataset; meanwhile, imaging review was performed using 5-mm continuous axial reconstruction.
Pathological evaluation
In accordance with the World Health Organization (WHO) categorization of tumors of the Digestive System (2010 edition), tumor differentiation through endoscopic biopsy was evaluated as one of the preoperative indicators in our retrospective study. The EBER in situ hybridization status was confirmed through needle biopsy. Among the 54 patients with gastric adenocarcinomas, 52 patients received radical gastrectomy (whole or partial) and the remaining two received palliative resection. Every patient had one identified lesion. A pathologist with 30 years of experience in gastrointestinal pathology retrospectively analyzed the postoperative histopathological specimens. Lauren classification, the level of differentiation, and neural as well as vascular invasion of every tumor were assessed and noted down according to the WHO categorization. The results of the gastric carcinomas were as follows: well differentiated, n = 0; moderately differentiated, n = 8; poorly to moderately differentiated, n = 16; and poorly differentiated, n = 30 poorly differentiated. There were 24 diffuse, 15 intestinal, and 15 mixed types in accordance with the Lauren categorization; there were 35 and 19 gastric adenocarcinoma cases with and without neural invasion, respectively; and there were 31 and 23 gastric adenocarcinoma cases with and without vascular invasion, respectively. The average tumor size in the training and validation cohorts was 38 cm3 (range = 7–73 cm3) and 22 cm3 (range = 9–55 cm3). Furthermore, the amount of gastric cancer with lymphoid stroma (GCLS) was also noted down as another primary pathological characteristic of EBV gastric carcinoma (27). Generally, there were 19 lesions with a small amount of GCLS, 17 lesions with a medium amount of GCLS, and 18 lesions with a high amount of GCLS. The pathology images of GCLS are demonstrated in Fig. 2, and the pathological characteristics of EBER are shown in Fig. 3.

Example of gastric cancer with lymphoid stroma (white arrow) and cancer cells (black arrow) under 200× microscope.

Comparison of EBER-negative gastric adenocarcinoma cell (black arrow) and EBER-positive gastric adenocarcinoma cell (white arrow) under 400× microscope.
Feature selection
Two radiologists segmented the ROI from late arterial CT images independently. They were blinded to the pathological and clinical information of the patients including age, gender, name, TNM stage (tumor, lymph node, and metastasis), degree of tumor differentiation, histological type, neural and vascular invasion of the lesion, and Lauren classification, as this is the effective phase for identifying the mucosal enhancement for early gastric cancers (Fig. 4) (26).

Example of segmentation of gastric adenocarcinoma in late arterial phase images. (a) Original image. (b, c) Segmentation by two experienced radiologists. (d) Ground truth tumor image showing the area of overlap between the two readers.
Late arterial phase CE-CT images for all patients were derived from the picture archiving and communication system (PACS), after which the images were uploaded into in-house software (Artificial Intelligence Kit; GE, Boston, MA, USA). The two radiologists manually segmented a polygonal ROI along the edge of the tumor on each slice the lesion involved. The gastric cavity and artifacts were carefully avoided when drawing the ROIs.
Feature extraction was performed on all ROIs in late arterial phase CT images of the 54 patients. The final tumor area was determined by the overlapping regions of two ROIs that were segmented separately by the two radiologists. Areas with uncertainty were not included in the delineation.
Through manual delineation by two radiologists, a total of 385 radiomic features were extracted from the tumor region. The feature extraction was processed using in-house software (Artificial Intelligence Kit; GE). Quantitative radiomic features were separated into three subcategories: first-order statistical features; textural features; and shape- and size-based features. First-order statistical features demonstrate distribution of voxel intensities within the CT images through basic and commonly used indicators (e.g. entropy, kurtosis, energy). Textural features were calculated using gray-level co-occurrence (GLCM) and gray-level run-length texture matrices (GLRLM). Gray-level co-occurrence (GLCM) is an effective texture feature type that utilizes second-order statistics to distinguish two adjacent pixel values at specific sites. The GLCM features applied in our research were energy, entropy, correlation, inertia, inverse different moment, cluster shade, cluster prominence, and Haralick correlation (22–24). GLRLM is another texture feature type that can extract the spatial plane features of each pixel. In our study, 10 GLRLM features were selected, i.e. long run emphasis, short run emphasis, run length non-uniformity, gray-level non-uniformity, high gray-level run emphasis, low gray-level run emphasis, short run low gray-level emphasis, long run low gray level emphasis, short run high gray level emphasis, long run high gray level emphasis (25,26). Features based on shape and size provide an indication of the tumor’s compactness, volume, and area, as well as the tumor’s spherical, circularity, or slenderness (17).
The first-order statistical, textural, and shape- and size-based features account for 42, 324, and 19 features, respectively.
Statistical analysis
The Kolmogorov–Smirnov normality test was applied to evaluate the distribution characteristics of the CT radiomics parameters of gastric adenocarcinomas. Depending on the normality test results, univariate analysis was performed by the Mann–Whitney U test for continuous characteristics and Pearson’s chi-square test or Fisher’s exact test for categorical characteristics. Statistical power and 95% confidence intervals (CI) of the difference of statistically significant CT texture parameters in univariate analysis were calculated. The selected features were applied to construct a discriminant model through the decision tree method to predict the presence of EBER in situ hybridization. The model was constructed through the classification and regression tree method, which is based on empirical statistical techniques for recursive partition analysis and generate binary decision trees. With decision tree analysis, diagnostic threshold values were iteratively decided to maximize the accuracy of diagnosis while minimizing false-positive EBER in situ hybridization. Unless the Gini index does not improve by > 0.001, the tree does not expand its node. In order to limit overfitting, the minimum number of cases for the node is set to nine parent nodes and three child nodes. Therefore, if the output branch of the decision node contains three or fewer patients in any diagnostic category, no further separations of the tree were required. We assessed the efficacy of the final decision tree through receiver operating characteristic (ROC) curve and specificity, sensitivity, and accuracy values. Statistical analysis was processed using SPSS (IBM Corp., Armonk, NY, USA). P < 0.05 was regarded as statistically significant.
Results
The volumes of the ROIs were calculated as VolumeCC in the shape- and size-based feature. The mean volume of ROI in the training cohort and validation cohorts was 35 cm3 (range = 5–68 cm3) and 26 cm3 (range = 4–60 cm3), respectively.
Nine radiomics features were found significantly related with EBER in situ hybridization status (P < 0.05). Four features were selected from the nine features through backward selection to build the decision tree according to the classification regression tree. The selected features are listed in Table 2, comparing the distribution of the four features according to EBER in situ hybridization status. Statistical significance was identified in four radiomic features between EBER-positive patients and EBER-negative patients (P < 0.05). The Correlation_AllDirection_offset7 (0.00109 ± 0.00098; P = 0.04859), Correlation_ angle135_offset7 (0.00085 ± 0.00092; P = 0.04418), RunLengthNonuniformity_AllDirection_offset1_SD (375.79 ± 457.97; P = 0.02111), HighGreyLevel RunEmphasis_AllDiretion_offset1_SD (8.11 ± 7.39; P = 0.04661) had a significant ability to differentiate positive EBER in situ hybridization from negative EBER in situ hybridization (0.00064 ± 0.00064, 0.00043 ± 0.00052, 147.77 ± 172.72, and 4.93 ± 3.03). The box plot in Fig. 5 shows the differences in the four imaging features.
Comparison of tumor radiomic features parameters between EBV-positive and EBV-negative patients.
Values are given as mean ± SD.
EBV, Epstein–Barr virus.

A box plot comparing the distribution of EBV-positive gastric carcinoma with that of EBV-negative gastric carcinoma. (a) HighGreyLevelRunEmphasis_ AllDiretion_offset1_SD (375.79 ± 457.97; P = 0.02111). (b) Correlation_angle135_offset7 (0.00085 ± 0.00092; P = 0.04418). (c) RunLengthNonuniformity_ AllDirection_offset1_SD (375.79 ± 457.97; P = 0.02111). (d) Correlation_ AllDirection_offset7 (0.00109 ± 0.00098; P = 0.04859). The central line in the box plot indicates the median value of the data. The lower and upper boundary lines of the central box represent the 25% and 75% quartiles. The box indicates the 95% confidence interval.
The final decision tree consisted of seven nodes (Fig. 6). The first node of the decision tree used the Correlation_angle135_offset7 with a threshold value of 0.000585785. Second, the Correlation_angle135_ offset7 and HighGreyLevelRunEmphasis_ AllDirection_offset1_SD were evaluated. Third, the RunLengthNonuniformity_AllDirection_offset1_SD and Correlation_ AllDirection_offset7 were assessed. Finally, the RunLengthNonuniformity_AllDirection_ offset1_SD and Correlation_ angle135_offset7 were assessed. Therefore, the final decision tree consisted of seven decision nodes and six terminal nodes, three of which distinguished positive EBER in situ hybridization and the remaining three identified negative EBER in situ hybridization. The area under the curve (AUC) was 0.87 (95% CI = 0.766–0.975; P < 0.001), showing the model had a relatively high efficacy in differentiating EBER-positive gastric adenocarcinomas from EBER-negative ones. The proposed model demonstrated a fine specificity, sensitivity, and accuracy, which were 85%, 89%, and 87%, respectively. The ROC curve is shown in Fig. 7.

Decision tree for identification of EBER in situ hybridization status in patients with gastric adenocarcinoma using CT imaging features of late arterial phase.

Receiver operating characteristic curve showing the performance of the decision tree model.
Discussion
In the present study, the decision tree analysis was used for the prediction of EBER in situ hybridization status through late arterial phase of CE-CT imaging features to determine the therapeutic strategy and predict the prognosis of gastric adenocarcinoma. Formerly, invasive biopsy or surgeries are required to confirm the status of EBER in situ hybridization in clinical procedures. CE-CT imaging-based radiomics could be utilized as a non-invasive diagnostic modality to overcome these limitations.
Several studies attempted to study CT radiomics characteristics as a biomarker for gastric cancer diagnosis as well as prognosis. Giganti et al. (14) found significant associations between pretreatment CT-based texture analysis and treatment response in gastric carcinoma, with a specificity, sensitivity, and accuracy of 71%, 75%, 74% for entropy, and a specificity sensitivity, and accuracy of 73%, 64%, and 67% for volume. Kim et al. (15) show that preoperative CT texture features can be used as a predictive tool for occult peritoneal carcinomatosis in patients with advanced gastric cancer. The specificity, sensitivity, and AUC of that study were 80.5%, 75.6%, and 0.77, respectively. Liu et al. (12) found CT texture analysis useful for identifying pathological and histological features of gastric cancers; the specificity, sensitivity, and accuracy were 83.3%, 78.7%, and 81.3%, respectively, with an AUC of 0.774. Similarly, Liu et al. (11) reported CT texture analysis of gastric carcinoma demonstrated significant relationships with TNM staging. However, to our knowledge, this is the first study on the relationship between CE-CT radiomics features and EBER in situ hybridization of gastric adenocarcinoma. CT imaging holds the advantages of fewer artifacts and higher spatial resolution, which is suitable for clinical staging and texture feature extraction compared with MRI on the diagnosis of gastric carcinoma. Previous studies have shown different clinical treatment and prognosis of EBER-positive gastric cancer from EBER-negative patients, while still there is no study to determine the relationship between CT imaging characteristics and EBER in situ hybridization status. Therefore, in this preliminary study, the decision tree model built through CT radiomics features can be applied as a non-invasive tool to support clinical decisions. Moreover, this study selected a total of three commercial models of CT machines commonly used in clinical practice. Therefore, the results of the study are more robust, with greater reference for future transformation in clinical practice.
For the prediction of EBER in situ hybridization status, we used a decision tree model, which is able to efficiently handle large, complex datasets without imposing complex parameter structures. The decision tree model is both easy to understand and perform. By applying decision tree analysis based on CT radiomics, we were able to identify the EBER in situ hybridization status simply through related imaging features. The specificity, sensitivity, and accuracy of the decision tree to differentiate EBER-positive and EBER-negative gastric adenocarcinomas are 85%, 89%, and 87%, respectively. Compared with previous studies, the model applied in this study performed better as a biomarker for gastric cancer.
The present study has some limitations. First, prospective studies are needed for patients whose data are not used to build models in the future, and external validation of the proposed models is equally required. Second, the population included in this research is relatively small, so the reliability of the constructed model requires being testified in prospective studies with more cases included. Lastly, radiomic features were merely extracted from late arterial phase CE-CT images in this study. Radiomics analysis can be improved with the involvement of other CE-CT images (such as portal phase images and delayed phase images).
In conclusion, the present study shows that late arterial phase CE-CT imaging has the potential to be a non-invasive and preoperative modality to predict the EBER in situ hybridization status in patients with gastric adenocarcinoma. The proposed model can also assist with deciding on appropriate therapeutic strategies for patients with gastric adenocarcinoma.
Footnotes
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
