Skip to main content

Virtual biopsy using CT radiomics for evaluation of disagreement in pathology between endoscopic biopsy and postoperative specimens in patients with gastric cancer: a dual-energy CT generalizability study



To develop a noninvasive radiomics-based nomogram for identification of disagreement in pathology between endoscopic biopsy and postoperative specimens in gastric cancer (GC).

Materials and methods

This observational study recruited 181 GC patients who underwent pre-treatment computed tomography (CT) and divided them into a training set (n = 112, single-energy CT, SECT), a test set (n = 29, single-energy CT, SECT) and a validation cohort (n = 40, dual-energy CT, DECT). Radiomics signatures (RS) based on five machine learning algorithms were constructed from the venous-phase CT images. AUC and DeLong test were used to evaluate and compare the performance of the RS. We assessed the dual-energy generalization ability of the best RS. An individualized nomogram combined the best RS and clinical variables was developed, and its discrimination, calibration, and clinical usefulness were determined.


RS obtained with support vector machine (SVM) showed promising predictive capability with AUC of 0.91 and 0.83 in the training and test sets, respectively. The AUC of the best RS in the DECT validation cohort (AUC, 0.71) was significantly lower than that of the training set (Delong test, p = 0.035). The clinical-radiomic nomogram accurately predicted pathologic disagreement in the training and test sets, fitting well in the calibration curves. Decision curve analysis confirmed the clinical usefulness of the nomogram.


CT-based radiomics nomogram showed potential as a clinical aid for predicting pathologic disagreement status between biopsy samples and resected specimens in GC. When practicability and stability are considered, the SECT-based radiomics model is not recommended for DECT generalization.

Critical relevance statement

Radiomics can identify disagreement in pathology between endoscopic biopsy and postoperative specimen.

Graphical abstract

Key points

  • Radiomics can identify disagreement in pathology between endoscopic biopsy and postoperative specimen.

  • Multiple machine learning algorithms were applied to construct radiomics models.

  • The SECT-based pathology-specific radiomics signatures cannot be stably generalized to DECT.


Gastric cancer (GC) is the third leading cause of cancer-related mortality globally owing to its high heterogeneity [1]. Endoscopic biopsy is the current standard approach for GC diagnosis, which guides clinicians' decisions regarding the implementation of subsequent treatment for patients with GC. Reliable biopsy results can avoid unnecessary surgery and considerably decrease health care costs. However, endoscopy only captures limited tumor information from a small portion of the tumor tissue, providing an incomplete characterization of the tumor; therefore, biopsy results sometimes deviate from those of postoperative macroscopic pathology specimens [2]. In addition, studies have shown that the increasing popularity of endoscopy has increased the workload of pathologists and affected the accuracy of cancer diagnosis [3, 4]. Owing to these shortcomings, transitional or complementary tools to differentiate between endoscopic biopsy and postoperative specimen diagnosis are warranted.

Radiomics-enabled imaging biomarkers provide insights into the properties of the tumor phenotype that are imperceptible to human eyes and correlate intratumor heterogeneity with clinical outcomes [5, 6]. Therefore, the use of radiomics in cancer has expanded in recent years, triggering many research projects worldwide. Currently, the relationship between biopsy and postoperative specimens has been investigated for GC [7], rectal lesions [8], prostate cancer [9], spinal lesions [10], and skin carcinoma [11]. However, no study has focused on evaluating the pathologic disagreement between the two specimens in GC using a radiomics approach.

In medical imaging, it has been proved that dual-energy computed tomography (DECT) is superior to single-energy CT (SECT) because its spectral information contains unique attenuation properties that improve the visualization of biological processes and tissue characterization [12, 13]. Radiomics has also been successfully applied to the analysis of DECT images and has shown promising diagnostic and prognostic power in tumor research [14,15,16]. Brendlin et al. revealed that DECT radiomics approaches yield a remarkable additive value over SECT radiomics analysis in noninvasively predicting immunotherapy response in patients with stage IV melanoma [12]. Nevertheless, few attempts have been made to apply radiomics models trained on SECT to the validation cohort of DECT, which probably hinders radiomics generalizability in the field of CT imaging.

In this study, we aimed to develop and test SECT-based radiomics nomogram to identify the disagreement frequency between pathologic assessment based on an endoscopic biopsy and a postoperative specimen in GC patients, and to compare the performance of different machine learning algorithms for building radiomics signatures (RS). We also explored the feasibility of generalizing RS trained on SECT cohort to the DECT cohort. We hypothesize that RS based on SECT images at 120 kVp allows validation on 70 keV virtual monochromatic images (VMIs) derived from DECT, given the fact that they have similar photon energy levels and visual equivalence.

Materials and methods

Study population

This single-center observational study involved a retrospective single-energy imaging study for the training and testing of predictive models and a prospective dual-energy imaging validation study to assess the generalizability of the model.

Cohort 1 included patients who underwent SECT between May 2020 and May 2021. Cohort 2 prospectively recruited eligible patients who underwent DECT examination from March 2022 to September 2022. A total of 181 patients were recruited for the present study in accordance with the enrollment criteria (Additional file 1: Appendix. E1). Patients in the retrospective cohort were randomly divided into a training set (n = 112) and a test set (n = 29) at a ratio of 8:2, whereas all patients in the prospective cohort were included in the validation cohort (n = 40). The clinical data, enhanced pretreatment CT images, pathological results of endoscopic biopsies prior to treatment, and gross specimens after gastrectomy of patients with advanced GC who were treated with radical operations at our institutions were collected.

All patients underwent endoscopy and gastrectomy for GC. Both endoscopic biopsy specimens and gross specimens resected from the primary site after surgery were pathologically analyzed to determine the Lauren classification using hematoxylin–eosin staining. Those with consistent Lauren classification of endoscopic biopsies and postoperative specimens were classified into the pathologic agreement group, and those with different Lauren classifications were classified into the disagreement group (Fig. 1). This study was approved by the Institutional Ethics Review Board, and the need for informed consent was waived owing to the observational design.

Fig. 1
figure 1

Typical original CT images, tumor segmentation, endoscopic biopsy and postoperative pathological images of two patients in pathologic disagreement and agreement group

Image acquisition and reconstruction parameters

All patients underwent abdominal dual-phase (arterial and venous)-enhanced CT after preparation for examination. Detailed patient preparation, imaging scheme, and acquisition parameters are summarized in Additional file 1: Appendix E2 & Table S1.

The dual-energy imaging-specific monoenergetic image subtypes were reconstructed at 70 keV energy level using a commercially available workstation (Advantage workstation, Version 4.7, GE Healthcare).

Tumor segmentation and extraction of radiomics features

All CT DICOM images of the venous phase were transferred to prototypical software (Syngo Frontier, Radiomics 1.3.0, Siemens Healthineers, Germany) for tumor segmentation and radiomics feature extraction. A radiologist with 10 years of experience in gastrointestinal radiology performed the tumor segmentation. The details are provided in Additional file 1: Appendix E3. The intraclass correlation coefficients (ICCs) were calculated to assess reproducibility of the extracted radiomic features, and only those features with an ICC greater than 0.75 were retained for further analysis.

Radiomic feature selection and radiomics signature building

All extracted radiomic features from each patient were normalized using the z-score method. Feature selection was carried out according to the following approach: Hierarchical analysis was performed based on Pearson’s correlation analysis, and the redundancy with correlation coefficients > 0.90 was eliminated. Subsequently, we performed feature selection using an analysis of variance (ANOVA) F-test statistic to select the top 30% features ranked by F-value (each feature has individual F-values related to target events).

Based on the final selection of features from the training set, five RS were constructed using different machine learning classifiers: Logistic Regression (LR), Support Vector Machine (SVM), Decision Tree (DT), Stochastic Gradient Descent (SGD), and K-Nearest Neighbors (KNN). The grid search method was used to select the optimal combination of hyperparameter values during training of each classification model.

Best radiomics signature evaluation and its DECT generalization testing

The discriminative power of the five radiomics signatures obtained using the area under the receiver operating characteristic curve (AUC) and the 95% confidence intervals (CI) were computed. Delong test was used to evaluate the statistically significant differences in AUC values: (1) between five RS; (2) each RS in different cohorts. The best RS was defined as having the highest AUC value with no difference between the training and test sets, such RS had excellent discrimination and stability.

For cohort 1, the diagnostic value of the best RS was evaluated for all patients and subgroups defined by clinicopathological factors. Associations between the best RS and pathologic disagreement were assessed using Mann–Whitney U test analysis. Additionally, the performance of the best RS was validated in the DECT cohort to test its dual-energy generalization ability.

Development and validation of the clinical-radiomics nomogram

Univariate analysis was used to assess the clinical variables for discriminating pathologic disagreement status in the training set, and those significant variables (p < 0.05) were included in multivariate logistic regression analysis to determine independent clinical predictors of disagreement status. A predictive model was developed by integrating the selected clinical predictor and the best RS using multivariable logistic regression. Moreover, the predictive model was visualized as a clinically applicable individualized nomogram. The discriminative power of the nomogram was assessed by using AUC value. Decision curve analysis (DCA) was performed to estimate the clinical usefulness of the nomogram by quantifying the net benefits at different threshold probabilities. To quantify the calibration of the nomograms, calibration curves accompanied by the Hosmer–Lemeshow test were plotted.

Statistical analysis

Independent t test or the Wilcoxon rank-sum test was used to compare continuous variables between the two groups, while the chi-squared or the Fisher’s exact test was used to compare categorical variables. The AUCs, sensitivity, specificity, and positive and negative predictive values (PPV&NPV) were computed by using ROC curve analysis. AUC range 0.6–0.7 was considered as an indication of poor classification accuracy, 0.7–0.8 as fair, 0.8–0.9 as good, and 0.9–1.0 as excellent classification accuracy. The optimal cutoff point in the ROC curve was determined using Youden’s index. Moreover, the F1 score was computed for each model because of the imbalance between the two groups. A two-sided p < 0.05 was considered to indicate statistical significance.

Statistical analyses, model construction and evaluations were performed using the Deepwise Multimodal Research Platform version 2.1 (, Beijing Deepwise & League of PHD Technology Co., Ltd, Beijing, China.) and the R software package (version 4.1.2).


Clinicopathological characteristics of the investigated patients

Table 1 lists the detailed clinicopathological characteristics of the patients in the training (n = 112), test (n = 29), and validation (n = 40) sets. Among the 181 patients enrolled in this study, 134 (74.0%) were males, who were aged 31 to 86 years. There were 64 pathologic agreement cases, while 117 were pathologic disagreement cases. Under the diagnostic approach of endoscopy, intestinal-type GC was diagnosed in the largest number of patients (84/181), while mixed-type GC was diagnosed under gross pathology in the largest number of patients (72/181).

Table 1 Characteristics of patients in the training set, test set, and validation cohort

Radiomics feature selection and radiomics signature building

Of the 854 quantitative features extracted from the CT images, 541 features showed an ICC > 0.75 in the reproducibility analysis. After omitting redundancy using the Pearson’s correlation analysis, 158 features from each patient were used for further selection. By applying the F-test, 48 independent radiomic features were determined as predictive features to build the RS. These features included 13 first-order features, 2 shape features, and 33 texture features, as shown in Additional file 1: Table S2.

Table 2 summarizes the predictive performance of the five RS in the training and test sets. Moreover, the ROC curves of the signatures are shown in Fig. 2.

Table 2 Performance of five radiomics signature on the training and test sets
Fig. 2
figure 2

ROC curves for radiomics signatures derived from five machine learning classifiers in (A) training set, and (B) test set, respectively

Best radiomics signature evaluation and its DECT generalization testing

A comparison of the discrimination of the five RS is shown in Additional file 1: Table S3. In the test set, only SVM radiomics signature achieved good prediction accuracy (AUC > 0.80). Furthermore, the SVM signature possesses favorable stability between the training and test sets (Delong test: p = 0.372); thus, it was selected as the best signature for evaluating disagreement status in the Lauren classification. The optimum cutoff value of the SVM signature determined by ROC curve analysis in the training set was 0.391.

The Mann–Whitney U test showed good correlation between the best RS derived from SVM and pathological disagreement status in the training set (p < 0.001) and test set (p = 0.03). Moreover, subgroup analysis showed that the performance of the best radiomics imaging biomarker was not affected by age, sex, tumor location, longest diameter, or Lauren classification determined by endoscopy, indicating its independent diagnostic value in different types of GC populations (Fig. 3). Interestingly, the AUCs for distinguishing the pathologic agreement group and disagreement group in the DECT validation cohort were 0.71 (95% CI:0.535–0.875). The Delong test showed a significant difference between the AUC in the training set and validation cohorts (p = 0.035), indicating that although the performance of the RS trained on SECT imaging was fair, it still could not be stably generalized to DECT imaging in GC. ROC curves and detailed performances of the best RS in the validation cohort are illustrated in Additional file 1: Fig S1 and Table S4.

Fig. 3
figure 3

Subgroup analysis of the best radiomics signature (SVM) for all the SECT cohort patients (n = 141). Subgroup analysis were performed on age (A), sex (B), tumor location (C), longest diameter (D), and Lauren classification determined by endoscopy (E, F)

Development and validation of the clinical-radiomics nomogram

In the training set, the age of the disagreement group was significantly lower than that of the agreement group (p < 0.01). Multivariable analysis identified this clinical variable as an independent predictor of pathological disagreement (p = 0.01) (Additional file 1: Table S5). A clinical-radiomics nomogram was subsequently developed using age and SVM RS (Fig. 4). Notably, given that the SECT-based SVM signature cannot be stably generalized to the DECT validation cohort, we constructed only the nomogram in the SECT cohort. The proposed nomogram showed powerful discriminative ability, with AUC of 0.92 (95% CI: 0.859–0.976) and 0.83 (95% CI: 0.667–0.989) in the training and test sets, respectively (Fig. 5). The optimum cutoff value determined by ROC curve analysis in the training set was 0.38. The sensitivity, specificity, F1 score, NPV, and PPV are summarized in Table 3. The calibration curves (Fig. 6A, B) and DCA (Fig. 6C) confirmed the good calibration and clinical utility of the nomogram.

Fig. 4
figure 4

The nomogram based on best radiomic signature and clinical factor

Fig. 5
figure 5

ROC curves of the nomogram in training and test set (A), and the boxplots of patients’ nomogram scores between agreement and disagreement groups (B)

Table 3 Clinical-radiomics nomogram performance
Fig. 6
figure 6

Calibration curves and decision curve analysis of the nomogram. Calibration curves of the nomogram showed good agreement between the predicted and observed pathologic disagreement probability in both the training (A) and test set (B). The Hosmer–Lemeshow test yielded a nonsignificant statistic (p-value = 0.115, and 0.226), suggesting there is no significant departure. Decision curves analysis (C) for training set and validation set indicated that when the nomogram is used to predict the risk (probability) of disagreement pathological status, patients could obtain better clinical benefits within wide range of risk (probability) threshold in both sets. The gray curve represents the hypothesis that all pathological status were discordant. The black line represents the hypothesis that no disagreement pathological status


In this observational study, we explored the challenging task of identifying GC patients whose Lauren classification was misdiagnosed via endoscopic biopsy, using a radiomics approach. The purpose was to reduce errors in pathological findings caused by unsuccessful biopsy tissue sampling, providing a potential tool for the preoperative diagnosis of GC. Our proposed nomogram, which incorporates radiomics and clinical signatures, successfully stratified biopsy-misclassified patients in the training and test sets.

Two previous studies compared the Lauren classification in matched biopsy and resection specimens of patients with GC and showed that the overall histological diagnostic disagreement between the two specimens reached 17% (65/382) and 26% (26/100), respectively [17, 18]. However, the current study showed a higher proportion of disagreement classification (35.4% [64/181]). To our knowledge, few studies have explored the clinical risk factors for disagreement in pathology, whereas our study innovatively mined disagreement-associated radiomic imaging biomarkers. In recent years, radiomics has proven to be effective in the characterization of GC pathology, including depth of tumor infiltration assessment [19], determination of the Lauren classification subtype [20], evaluation of the tumor immune microenvironment [21], prediction of lymph node metastasis [22], lymphovascular invasion [23], and HER2 status [24], suggesting that radiomics may hold potential for evaluating the pathologic agreement between the two types of specimens in GC. Therefore, we conducted the present study.

Some scholars have used radiomics features originating from multiple types of CT images to construct radiomics models, including multi-phase images [25], multi-material density images [16], and multi-dimensional images (2D images and 3D images) [26]. However, to our knowledge, few studies have been conducted on the construction of models comprising multiple algorithms based on the same set of radiomics features. Zhang et al. adopted three machine learning classifiers to develop CT-based RS to predict EGFR mutation status in lung adenocarcinoma. They found that an optimal model was developed using an SVM algorithm [27]. Mao et al. compared the performance of five machine learning classifiers in differentiating primary liver cancer from metastatic liver cancer. Their results showed that LR outperformed the other classifiers, with an accuracy of 0.843 ± 0.078 [28]. In this study, although the SVM classifier (linear kernel) was evaluated as the best prediction signature, no significant difference was observed between its performance and that of the LR classifier. This is most likely because our feature selection steps effectively select radiomic features that are more sensitive to current clinical tasks, allowing the linear model to perform well.

An attractive innovation of our study is that, in validating the best RS trained on the SECT cohort, we performed external validation of the signature by including a dataset of dual-energy imaging. Although radiomics has been extensively studied in cancer settings, it is not fully understood whether radiomics can be transferred between dual-energy CT and single-energy CT. Theoretically, single-energy CT images at 120 kV have an image quality equivalent to that of dual-energy VMIs at 70 keV. However, in a phantom study, Chen et al. found that the majority of radiomic features were not reproducible between SECT and DECT [29]. As they did not perform actual clinical task validation, we conducted a pilot study. We found that the radiomic model trained on SECT could not be stably generalized to the DECT cohort. Furthermore, in the clinic, the acquisition and processing time of dual-energy CT images are notably higher than those of single-energy CT images. Thus, from a pragmatic point of view, the generalization of the radiomics model between different CT images is not worth promoting.

Age is closely related to the clinicopathologic and molecular features of GC [30]. Lee et al. revealed that age, sex, tumor size, and tumor location were risk factors for pathological discordance in patients with GC [7]. Although the current study only found that age was significant in the univariate analysis (p < 0.05), we also attempted to include the other clinical factors mentioned above in the regression analysis. The final result suggested that age was the only clinical predictor of pathological disagreement. A slight improvement in predictive power was observed when age, a more easy-to-obtain clinical variable, was added to the best RS. We speculate that the slight association between clinicopathological features and pathologic disagreement in the present study could be because of the stronger correlation between intratumoral heterogeneity and pathologic disagreement. Consequently, intratumoral heterogeneity associated with disagreement could be reflected, to a large extent, by the beyond-visual radiomics features extracted from the CT images.

The present study has some limitations. First, because of the retrospective nature of the training and test sets, different scanners were used in the SECT dataset, which may result in some inherent bias. Future studies need to focus on the standardization of radiomics to obtain high-quality data. Second, tumor segmentation in our study was performed on the largest slice of 2D CT images, which might not be sufficiently representative of the whole tumor. However, some scholars have demonstrated that 2D segmentation can avoid more image noise and the interference of effective information, and even 2D radiomics model predictive performance in GC is not inferior to that of 3D radiomics model [19, 31]. Third, this was a single-center study with an insufficient sample size; therefore, the heterogeneity of GC among different regional populations was not considered. We have an ongoing collaboration with other centers to recruit large samples of patients with GC. Additionally, in the comparison of the models, we only analyzed them in terms of predictive performance but neglected to pay attention to how time-consuming it is to develop the models.

In conclusion, we developed a clinical-radiomics nomogram that allows noninvasive evaluation of pathological disagreement status in GC. Such imaging biomarkers may hold promise as transitional tools between endoscopic biopsy and gross pathology. Moreover, the proposed SECT cohort-based radiomics model obtained with SVM classifier could not be stably generalized to DECT cohort.

Availability of data and materials

The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.



Area under the receiver operating characteristic curve


Decision curve analysis


Dual-energy computed tomography


Decision tree


Gastric cancer


K-Nearest Neighbors


Logistic regression


Negative predictive value


Positive predictive value


Receiver operating characteristic


Radiomics signatures


Single-energy computed tomography


Stochastic gradient descent


Support vector machine


Virtual monochromatic images


  1. Bray F, Ferlay J, Soerjomataram I, Siegel RL, Torre LA, Jemal A (2018) Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin 68(6):394–424

    Article  PubMed  Google Scholar 

  2. Okasha HH, Naguib M, El Nady M et al (2017) Role of endoscopic ultrasound and endoscopic-ultrasound-guided fine-needle aspiration in endoscopic biopsy negative gastrointestinal lesions. Endoscopic Ultrasound 6(3):156–161

    Article  PubMed  PubMed Central  Google Scholar 

  3. Noh CK, Lee E, Lee GH et al (2021) Association of intensive endoscopic screening burden with gastric cancer detection. JAMA Netw Open 4(1):e2032542

    Article  PubMed  PubMed Central  Google Scholar 

  4. Metter DM, Colgan TJ, Leung ST, Timmons CF, Park JY (2019) Trends in the US and Canadian pathologist workforces From 2007 to 2017. JAMA Netw Open 2(5):e194337

    Article  PubMed  PubMed Central  Google Scholar 

  5. Yu Y, Tan Y, Xie C et al (2020) Development and validation of a preoperative magnetic resonance imaging radiomics-based signature to predict axillary lymph node metastasis and disease-free survival in patients with early-stage breast cancer. JAMA Netw Open 3(12):e2028086

    Article  PubMed  PubMed Central  Google Scholar 

  6. Bera K, Braman N, Gupta A, Velcheti V, Madabhushi A (2022) Predicting cancer outcomes with radiomics and artificial intelligence in radiology. Nat Rev Clin Oncol 19(2):132–146

    Article  CAS  PubMed  Google Scholar 

  7. Lee IS, Park YS, Lee JH et al (2013) Pathologic discordance of differentiation between endoscopic biopsy and postoperative specimen in mucosal gastric adenocarcinomas. Ann Surg Oncol 20(13):4231–4237

    Article  PubMed  Google Scholar 

  8. Liu M, Lu ZH, Wang QX et al (2019) Diagnostic value, safety, and histopathologic discrepancy risk factors for endoscopic forceps biopsy and transrectal ultrasound-guided core needle biopsy in rectum lesions. Ann Transl Med 7(21):607

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  9. Yıldızlı ÖO, Üntan İ, Demirci D (2021) What is the consistency between the results of needle biopsy and prostatectomy specimen pathology results? A pilot study. Turk J Med Sci 51(3):1360–1364

    Article  PubMed  PubMed Central  Google Scholar 

  10. Liang Y, Liu P, Jiang LB et al (2019) Value of CT-guided core needle biopsy in diagnosing spinal lesions: a comparison study. Orthop Surg 11(1):60–65

    Article  PubMed  PubMed Central  Google Scholar 

  11. Stewart N, Rodríguez Bandera AI, Lee S (2021) Histopathological discrepancy between biopsy and Mohs micrographic surgery in keratinocyte carcinoma: A 464 case series in an Australian population. Australas J Dermatol 62(1):41–46

    Article  PubMed  Google Scholar 

  12. Brendlin AS, Peisen F, Almansour H et al (2021) A Machine learning model trained on dual-energy CT radiomics significantly improves immunotherapy response prediction for patients with stage IV melanoma. J Immunother Cancer 9(11):e003261

    Article  PubMed  PubMed Central  Google Scholar 

  13. Nagayama Y, Inoue T, Oda S et al (2020) Adrenal adenomas versus metastases: diagnostic performance of dual-energy spectral CT virtual noncontrast imaging and iodine maps. Radiology 296(2):324–332

    Article  PubMed  Google Scholar 

  14. Choe J, Lee SM, Do KH et al (2019) Prognostic value of radiomic analysis of iodine overlay maps from dual-energy computed tomography in patients with resectable lung cancer. Eur Radiol 29(2):915–923

    Article  PubMed  Google Scholar 

  15. Li J, Dong D, Fang M et al (2020) Dual-energy CT-based deep learning radiomics can improve lymph node metastasis risk prediction for gastric cancer. Eur Radiol 30(4):2324–2333

    Article  PubMed  Google Scholar 

  16. Liu YY, Zhang H, Wang L et al (2021) Predicting response to systemic chemotherapy for advanced gastric cancer using pre-treatment dual-energy CT radiomics: a pilot study. Front Oncol 11:740732

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  17. Jónasson L, Hallgrímsson J, Olafsdóttir G (1994) Gastric carcinoma: correlation of diagnosis based on biopsies and resection specimens with reference to the Laurén classification. APMIS 102(9):711–715

    Article  PubMed  Google Scholar 

  18. Flucke U, Mönig SP, Baldus SE et al (2002) Differences between biopsy- or specimen-related Laurén and World Health Organization classification in gastric cancer. World J Surg 26(2):137–140

    Article  PubMed  Google Scholar 

  19. Meng L, Dong D, Chen X et al (2021) 2D and 3D CT radiomic features performance comparison in characterization of gastric cancer: a multi-center study. IEEE J Biomed Health Inform 25(3):755–763

    Article  PubMed  Google Scholar 

  20. Wang XX, Ding Y, Wang SW et al (2020) Intratumoral and peritumoral radiomics analysis for preoperative Lauren classification in gastric cancer. Cancer Imag 20(1):83

    Article  Google Scholar 

  21. Jiang Y, Wang H, Wu J et al (2020) Noninvasive imaging evaluation of tumor immune microenvironment to predict outcomes in gastric cancer. Ann Oncol 31(6):760–768

    Article  CAS  PubMed  Google Scholar 

  22. Wang Y, Liu W, Yu Y et al (2020) CT radiomics nomogram for the preoperative prediction of lymph node metastasis in gastric cancer. Eur Radiol 30(2):976–986

    Article  PubMed  Google Scholar 

  23. Chen X, Yang Z, Yang J et al (2020) Radiomics analysis of contrast-enhanced CT predicts lymphovascular invasion and disease outcome in gastric cancer: a preliminary study. Cancer Imag 20(1):24

    Article  Google Scholar 

  24. Wang Y, Yu Y, Han W et al (2021) CT Radiomics for distinction of human epidermal growth factor receptor 2 negative gastric cancer. Acad Radiol 28(3):e86–e92

    Article  PubMed  Google Scholar 

  25. Chen M, Yin F, Yu Y, Zhang H, Wen G (2021) CT-based multi-phase radiomic models for differentiating clear cell renal cell carcinoma. Cancer Imag 21(1):42

    Article  Google Scholar 

  26. Mingzhu L, Yaqiong G, Mengru L, Wei W (2021) Prediction of BRCA gene mutation status in epithelial ovarian cancer by radiomics models based on 2D and 3D CT images. BMC Med Imag 21(1):180

    Article  Google Scholar 

  27. Zhang G, Cao Y, Zhang J et al (2021) Predicting EGFR mutation status in lung adenocarcinoma: development and validation of a computed tomography-based radiomics signature. Am J Cancer Res 11(2):546–560

    CAS  PubMed  PubMed Central  Google Scholar 

  28. Mao B, Ma J, Duan S, Xia Y, Tao Y, Zhang L (2021) Preoperative classification of primary and metastatic liver cancer via machine learning-based ultrasound radiomics. Eur Radiol 31(7):4576–4586

    Article  PubMed  Google Scholar 

  29. Chen Y, Zhong J, Wang L et al (2022) Robustness of CT radiomics features: consistency within and between single-energy CT and dual-energy CT. Eur Radiol 32(8):5480–5490

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  30. Seo JY, Jin EH, Jo HJ et al (2015) Clinicopathologic and molecular features associated with patient age in gastric cancer. World J Gastroenterol 21(22):6905–6913

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  31. Zhao H, Li W, Lyu P et al (2021) TCGA-TCIA-based CT radiomics study for noninvasively predicting epstein-barr virus status in gastric cancer. AJR Am J Roentgenol 217(1):124–134

    Article  PubMed  Google Scholar 

Download references


This study was supported by National Natural Science Foundation of China (Grant no. 81971615).

Author information

Authors and Affiliations



Conceptualization/Formal analysis/Funding acquisition/Project administration/Resources/Writing—review were contributed by JG; Conceptualization/Data curation/Formal analysis/Investigation/Methodology were contributed by YL, SZ and ZW; Data curation/Software/Visualization/Methodology were contributed by HL, XC and CH; Data curation/Investigation were contributed by HL, MY and XX; Visualization was contributed by CLL and CCL. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Jianbo Gao.

Ethics declarations

Ethics approval and consent to participate

This study was approved by the Institutional Review Board of Zhengzhou University, and written informed consent was waived because of the observational nature.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1.

Appendix E1: Study population. Appendix E2: Image acquisition and reconstruction parameters. Appendix E3: Tumor segmentation and extraction of radiomic features. Supplementary Table S1: Imaging Scheme and Acquisition Parameters. Supplementary Table S2: Final retained 48 independent radiomics features. Supplementary Table S3.1: Comparison of discrimination (AUC value) of five models in training set. Supplementary Table S3.2: Comparison of discrimination (AUC value) of five models in test set. Supplementary Table S4: Detailed Performance of the SVM model in the DECT validation cohort. Supplementary Table S5: Multivariate Logistic Regression Analysis for Significant Clinical Variable. Supplementary Figure S1. ROC curves of the SVM model in the DECT validation cohort.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit

Reprints and Permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Liu, Y., Zhao, S., Wu, Z. et al. Virtual biopsy using CT radiomics for evaluation of disagreement in pathology between endoscopic biopsy and postoperative specimens in patients with gastric cancer: a dual-energy CT generalizability study. Insights Imaging 14, 118 (2023).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI:


  • Single-energy CT imaging
  • Dual-energy CT imaging
  • Gastric cancer
  • Radiomics