A virtual biopsy study of microsatellite instability in gastric cancer based on deep learning radiomics

Jiang, Zinian; Xie, Wentao; Zhou, Xiaoming; Pan, Wenjun; Jiang, Sheng; Zhang, Xianxiang; Zhang, Maoshen; Zhang, Zhenqi; Lu, Yun; Wang, Dongsheng

doi:10.1186/s13244-023-01438-1

Original Article
Open access
Published: 07 June 2023

A virtual biopsy study of microsatellite instability in gastric cancer based on deep learning radiomics

Zinian Jiang¹^na1,
Wentao Xie²^na1,
Xiaoming Zhou³,
Wenjun Pan¹,
Sheng Jiang¹,
Xianxiang Zhang²,
Maoshen Zhang²,
Zhenqi Zhang⁴,
Yun Lu^1,2,5 &
…
Dongsheng Wang ORCID: orcid.org/0000-0002-0314-251X^1,2

Insights into Imaging volume 14, Article number: 104 (2023) Cite this article

2246 Accesses
2 Citations
2 Altmetric
Metrics details

Abstract

Objectives

This study aims to develop and validate a virtual biopsy model to predict microsatellite instability (MSI) status in preoperative gastric cancer (GC) patients based on clinical information and the radiomics of deep learning algorithms.

Methods

A total of 223 GC patients with MSI status detected by postoperative immunohistochemical staining (IHC) were retrospectively recruited and randomly assigned to the training (n = 167) and testing (n = 56) sets in a 3:1 ratio. In the training set, 982 high-throughput radiomic features were extracted from preoperative abdominal dynamic contrast-enhanced CT (CECT) and screened. According to the deep learning multilayer perceptron (MLP), 15 optimal features were optimized to establish the radiomic feature score (Rad-score), and LASSO regression was used to screen out clinically independent predictors. Based on logistic regression, the Rad-score and clinically independent predictors were integrated to build the clinical radiomics model and visualized as a nomogram and independently verified in the testing set. The performance and clinical applicability of hybrid model in identifying MSI status were evaluated by the area under the receiver operating characteristic (AUC) curve, calibration curve, and decision curve (DCA).

Results

The AUCs of the clinical image model in training set and testing set were 0.883 [95% CI: 0.822–0.945] and 0.802 [95% CI: 0.666–0.937], respectively. This hybrid model showed good consistency in the calibration curve and clinical applicability in the DCA curve, respectively.

Conclusions

Using preoperative imaging and clinical information, we developed a deep-learning-based radiomics model for the non-invasive evaluation of MSI in GC patients. This model maybe can potentially support clinical treatment decision making for GC patients.

Graphical abstract

Key points

MSI is an important biomarker for immunotherapy in gastric cancer.
Quantitative radiomics features were closely related to MSI in gastric cancer.
Combining clinical and radiomics features with deep learning could evaluate MSI noninvasively.

Introduction

Gastric cancer (GC) is a highly heterogeneous malignancy caused by multiple factors and is a global public health problem. The incidence of GC varies according to geographical location and is particularly high in Asia (age-standardized [global] incidence: 32.5 per 100,000 men; 13.2 per 100,000 women) [1, 2]. MSI-positive GC is one of the major molecular subtypes of GC as defined by the Cancer Genome Atlas Group and accounts for 10–22% of all GC patients [3]. The deletion of any of the mismatch gene repair proteins (MLH1, PMS2, MSH2, and MSH6) leads to microsatellite instability (MSI/MSI-H) [4, 5]. In recent years, immune checkpoint inhibitors have shown great potential in the treatment of progressive GC [6]. MSI is an important predictive biomarker for evaluating the effect of anti-programmed cell death-1 (PD-1) immunotherapy in GC patients. Several clinical trials have confirmed that the objective response rate and survival were significantly better in the MSI group than in the microsatellite stabilization (MSS) group in anti-PD-1 immunotherapy for advanced GC [7,8,9]. However, there is a large individual variation in the efficacy of anti-PD-1 therapy [10], so it is important to select patients who are most likely to benefit. Currently, immunohistochemical staining (IHC) and PCR molecular testing are mainly used to detect MSI expression levels [5, 11, 12]. Given the spatial heterogeneity of MSI expression, a small piece of tissue obtained by invasive biopsy may not be sufficiently representative of the entire tumor region [13], thus affecting the assessment of MMR protein expression. Although universal testing for MSI has been recommended in the NCCN guidelines for patients with GC [14], many patients are not tested due to the invasive, time-consuming and expensive nature of tissue biopsy. Therefore, there is an increasing need to develop a non-invasive method for the holistic assessment of MSI expression in GC.

With the development of computer-aided medicine, accurate assessment of tumor’s pathological features has been achieved by combining radiomics and deep learning to extract and analyze quantitative radiological features, known as 'virtual biopsies,' which provide a reference standard for conventional biopsies [15,16,17]. Several studies have demonstrated the potential role of deep-learning-based radiomics in predicting lymph node metastasis [18] or response to neoadjuvant chemotherapy [19, 20] in GC, so this study explored the non-invasive assessment of GC biomarkers based on such methods using preoperative computed enhanced tomography (CECT) images and clinical data. It was also visualized as a nomogram to evaluate the potential application as a virtual biopsy tool in clinical auxiliary diagnosis.

Methods

Patient selection and clinical variables collection

This study was approved by the ethics committee of our medical center, and the requirement for informed consent was waived due to the retrospective nature of this study. A total of 223 patients were enrolled in this study with MSI confirmed by postoperative IHC of GC by searching the medical database of our hospital from January 2020 to March 2022, including 182 MSS patients and 41 MSI patients. All samples were divided into a training set (n = 167) and testing set (n = 56) according to the 3:1 random allocation principle. The inclusion criteria were as follows: 1) aged 18–80 years; 2) first gastric cancer surgery; 3) histological type adenocarcinoma; 4) dynamic contrast-enhanced CT of the upper abdomen within 2 weeks before surgery; 5) tumor morphology identified in medical images; and 6) no neoadjuvant chemoradiotherapy performed before surgery. The details of cohort inclusion are shown in Fig. 1.

In this study, we collected baseline clinicopathological data and laboratory parameters measured by venous blood collection within 1 week before surgery, as detailed in Additional file 1: Table 1. According to the clinical application standards of our hospital, the tumor location was reported by professional radiologists after reading the preoperative CECT. According to the eighth edition of the AJCC staging report, the clinical T stage of the tumor was determined.

Table 1 Univariate logistic regression analysis of training and testing sets characteristics

Full size table

MSI status definition and revaluation

In this study, the MSI status was obtained by immunohistochemistry (IHC), and the immunohistochemical sections were formalin-fixed paraffin embedding with a thickness of 2–3 µm, passed through a standard streptavidin–biotin–peroxidase procedure, and stained by an automatic immunohistochemical staining machine (Leica Bond-Max, Leica Biosystems). Two pathology experts in the field of gastrointestinal tumors (with 8 years and 10 years of work experience, respectively) reanalyzed the expression of four MMR proteins, MLH1, PMS2, MSH2, and MSH6, in IHC sections to characterize MSI. They were unaware of the clinical and pathological information of the samples in advance, and if the results were different, they reached an agreement through consultation. Loss of expression of any MMR protein was defined as defects of mismatch repair and divided into the MSI/MSI-H group; expression of all MMR proteins was defined as professional MMR and divided into the MSS/MSI-L group.

Protocol of CECT image acquisition

All patients underwent CECT with a 64-slice multislice CT scanner, which covered the entire upper abdomen. The specific parameters of the scanner are detailed in Additional file 1: Table 2. To ensure an empty stomach, all patients fasted for 8 h. Before the examination, they drank more than 800 mL of purified water to fill the stomach cavity. During the examination, the patients were placed in a supine position and asked to hold their breath. A nonionic contrast agent (Iohexol-350 Injection; Starry Pharmaceutical) was pumped into the antecubital vein through an automated high-pressure pump injection system (Medrad Vistron Plus, Bayer Healthcare) at a dose of 1.5 mL/kg, the injection speed was 3 mL/s, and the portal venous phase CECT image was acquired 60 s after the contrast agent was injected. All CT images were reconstructed with an axial thickness of 5 mm. Then, the DICOM format image files were retrieved and exported from the image archiving and communication system and medical imaging workstation and stored for further image segmentation and analysis.

Table 2 Selected optimal features in radiomics

Full size table

Tumor segmentation

All CT images were independently reviewed by two radiologists with 5 years (reader 1) and 8 years (reader 2) of experience in gastrointestinal oncology radiology who were blinded to the IHC results. If they disagree on the diagnosis, the final result will be decided by a chief radiologist with more than 20 years of experience in diagnosing abdominal tumors. Using the 3D Slicer software (4.11, www.slicer.org) in reading the CECT image, set to the abdomen window (width: 350 HU; horizontal: 40 HU). Afterward, Reader 1 and Reader 2 utilized the Segmentation Wizard plugin in 3D Slicer to achieve semi-automatic segmentation of tumor boundaries in all axial portal CECT to obtain the region of interest (ROI). After full-slice annotation, a three-dimensional (3D) image was generated to directly reflect the ROI shape, as shown in Fig. 2. During the labeling process, intragastric air, surrounding adipose tissue, areas of tumor necrosis, and perigastric lymph nodes were carefully excluded. Before feature extraction, all images are z-scores normalized separately and resampled at a pixel spacing of 1*1*1 mm.

Radiomic feature extraction

PyRadiomics (http://www.radiomics.io/) was used to extract radiomics features from all segmented CT images [21]. The extracted feature types included: 1) shape-based (SB); 2) first-order statistics (FOS); 3) gray-level co-occurrence matrix (GLCM); 4) gray-level run-length matrix (GLRLM); 5) gray-level size zone matrix (GLSZM); and 6) gray-level correlation matrix (GLDM), which is consistent with previous studies [22, 23]. To reduce the effect of overfitting of radiomic features on the performance of prediction models, we applied intraclass correlation coefficients (ICCs) to ensure the robustness of radiomic features, and 30 randomly selected CT images from the original reader 1 and reader 2 annotated CT images were used to assess interobserver ICC. After a 4-week interval, reader 1 redrew the ROI of the drawn random sample, extracted the radiomic features with the same process, and calculated the radiomic features of the two repeats of reader 1 to evaluate the intraobserver ICC. Usually, ICC > 0.75 is defined as good consistency, so we discard the features of intragroup and intergroup ICC < 0.75 to ensure robustness.

To remove redundant features, we employ variance and correlation filters. The specific steps are as follows: If the normalized standard deviation of a feature is less than 0.1, the feature will be discarded because it is invalid; at the same time, the Pearson correlation coefficient of each pair of features is calculated. If the Pearson correlation coefficient between two features is greater than 0.9, the two features are highly similar, excluding one of the two features. All the features were standardized with min–max normalization in both cohorts using the min and max deviation of the training cohort feature data. To make the screened features more predictive, we used a deep learning algorithm, multilayer perceptron (MLP), to quantify the optimal radiomic features and established a radiomic feature score (Rad-score) for each patient (see Fig. 3).

Development of prediction models and nomogram

To explore whether there is an additional gain in clinical information for predicting MSI status, we performed a univariate logistic regression analysis on clinical data. Then, clinical risk predictors were screened based on LASSO regression, and three independent MSI prediction models (clinical model, radiomics model, and clinical imaging model) were developed using the Rad-score and clinical risk predictors in training set and independently verified in the testing set. Model prediction performance was assessed by the area under the receiver operating characteristic (ROC) curve (AUC), specificity, and sensitivity. Moreover, we visualized the hybrid model as a nomogram using logistic regression to increase the clinical application value. Nomogram performance was evaluated using a calibration curve, and the clinical utility of the nomogram was evaluated by decision curve analysis (DCA) to calculate the maximization of net gain within range thresholds.

Statistical analysis

IBM SPSS Statistics (26.0; IBM Corp.) was used to conduct a univariate logistic regression analysis, the Chi-square test was used to analyze categorical variables, and the two-sided independent samples t test was used for continuous variables subject to normal distribution. Continuous variables that did not have a normal distribution were analyzed using the Mann‒Whitney U test. A two-sided p value of < 0.05 was considered statistically significant. MLP model development, prediction models, nomogram construction, and performance evaluation were all developed through the R (version 3.6.1; http://www.R-project.org) software package.

Results

Clinical and pathological characteristics

Univariate logistic regression analysis was performed on clinical and pathological characteristics, and details of the relationship between patient characteristics and MSI status are shown in Table 1. The results showed that there were significant differences between MSI and clinical T stage (p value < 0.05) and degree of differentiation (p value < 0.05) in the two sets. In the training set, there were significant differences in MSI status in patients of different ages (p value = 0.016) and of either sex (p value = 0.015). However, tumor location and carcinoembryonic antigen (CEA) level did not show a significant correlation with MSI status in either cohort.

Radiomics feature selection

A total of 982 radiomic features were extracted from each ROI, and 365 features with ICC ≤ 0.75 were excluded after the consistency detection of the features. After that, variance and correlation filters were used to eliminate redundant features, and 197 radiomic features were retained. Finally, using LASSO regression with tenfold cross-validation, 15 optimal radiomic features were screened (Fig. 4A, B), including two original features, one 3D skewness feature, four first-order wavelet features, two GLDM features, three GLSZM features, two GLCM features and one GLRM feature (Table 2).

Construction of the radiomics signature

In the MLP model developed based on a deep learning algorithm, the Rad-score represents quantified radiomic features. We then investigated the distribution of each Rad-score in the two sets. The average Rad-score in the MSI group was significantly higher than that in the MSS group (Fig. 4C, D), which was demonstrated in the testing set. The AUC of the radiomics model based on the independent Rad-score was 0.856 [95% confidence interval (CI): 0.792–0.919] in the training set and 0.753 [95% CI: 0.606–0.901] in the testing set (Fig. 5A, B).

Evaluation of the models and nomogram

In univariate analysis, age, sex, clinical T stage, and degree of differentiation were found to be closely related to MSI status, and subsequent LASSO regression analysis identified age, sex, clinical T stage, and Rad-score as independent predictors of MSI (Fig. 5C, D). Ultimately, we constructed three prediction models (clinical model, radiomics model, and clinical image model) based on the above independent predictors and visualized the clinical imaging model as a nomogram (Fig. 6).

To evaluate the predictive performance of the three models, we plotted the ROC curves and calculated the AUC for comparison (Fig. 7A, B). The AUCs of the clinical models in the training and testing sets were 0.725 [95% CI: 0.611–0.840] and 0.738 [95% CI: 0.586–0.889], respectively. As shown in Fig. 7C, the calibration curve for the nomogram shows good agreement between the observed and predicted results. To assess the clinical applicability of the nomogram, we performed a clinical decision curve analysis on the nomogram (Fig. 7D). At a threshold probability of 20%–70%, the nomogram model showed a greater net benefit than treating all patients or no treatment compared to the stand-alone radiomics model and at the 0%–20% and 70%–100% ranges with similar net gains.

Compared with separate clinical and radiomics models, the clinical image model combining preoperative clinical features and Rad-score showed better predictive performance in classifying MSI and MSS status; AUCs of 0.883 [95% CI: 0.822–0.945] in training set and 0.802 [95% CI: 0.666–0.937] in testing set were achieved. The prediction performance details of the three prediction models are shown in Table 3.

Table 3 Comparison of the prediction performance of three models for MSI status

Full size table

Discussion

We used clinical information and pretreatment CECT images of 223 GC patients from our medical center to develop a virtual biopsy model supported by deep learning algorithms. To the best of our knowledge, we are the first to apply deep learning algorithms to the optimization of radiomics feature building in the GC MSI radiomics. Clinical imaging models supported by MLP effectively improve the ability to identify MSI/MSI-H in GC patients (AUC: training set 0.883, testing set 0.802). Accurate MSI identification is critical for individualized systemic treatment of GC patients, which can benefit patients receiving anti-PD-1 immunotherapy and improve GC patient survival [9, 24, 25].

All valid samples in a unit period were covered in this study, avoiding selection bias as much as possible. The incidence of MSI in GC patients in the study was 18.4%, which is consistent with the current Global epidemiology on GC-related MSI (10–22%) [3, 5, 26]. LASSO regression analysis of clinical baseline data found that age, sex, and clinical T stage were independent clinical risk factors closely related to MSI expression in GC patients. Considering the relationship between advanced age [27,28,29,30] and female sex [28, 30], GC patients and MSI are consistent with previous related studies. Current evidence suggests that the MSI-H phenotype in patients with sporadic GC is closely associated with hypermethylation of the promoter CpG island causing silencing of the hMLH1 gene, which leads to progressive loss of MLH1 protein expression [31,32,33]. Similar findings were found in our recruited patients, with more than 83% (n = 35) of MSI-expressing patients having a deletion in the expression of the mismatch repair protein MLH1. Interestingly, researchers such as Nakajima et al. [34] and Kim et al. [35] found that the methylation of the mismatch repair gene hMLH1 was age dependent, and its incidence was positively correlated with age. hMLH1 methylation is more common in elderly gastric cancer patients, which seems to explain why MSI/MSI-H was more common in older GC patients in our study. Compared with MSS/MSI-L, GC with the MSI/MSI-H phenotype is less aggressive, representing a better prognosis in the early stage of GC [27, 36]. At the molecular level, recent studies have found that changes in the genetic and epigenetic characteristics of the GC genome often occur in the early stages of the tumor [37, 38], which supports that hMLH1 promoter methylation-induced MSI-H is more likely to be seen in the early stage (TNM stages I–II) presumed in GC patients, thus defining MSI as an early molecular event of GC. This is also confirmed in the reports of Polom et al. [30] and Jahng et al. [39]. However, we did not observe the previously demonstrated significant relationship between tumor location [29, 30, 36] and MSI, which may be related to differences in the patient population or regional prevalence of our recruitment. Taken together, these potentially characteristic clinical factors reflect higher histological heterogeneity in MSI tumors.

The development of radiomics technology makes it possible to capture the heterogeneity of tumor molecules in clinical images, which can provide objective and quantitative support for cancer molecular biological detection and personalized treatment [40]. We finally selected 15 radiomics features (including 1 first-order feature, 1 shape-based feature, and 13 filtered features) from the filtered and unfiltered images to build the model. The first-order feature describes the distribution of voxel intensities in CT images, and the shape-based feature is a description of the two-dimensional tumor regions’ size and shape. The filtered features are first-order statistical features and texture features extracted from the filtered images and reconstructed by transforming the Laplacian of the Gaussian spatial bandpass filter or wavelet filter, where the texture features include the gray-level co-occurrence matrix (GLCM), gray-level run-length matrix (GLRLM), gray-level size zone matrix (GLSZM), and gray-level dependence matrix (GLDM). With these two different filtering strategies, the specific structure of the original image is enhanced. These features reflect the differences in spatial morphology, pixel intensity, and texture of tumor regions and may represent spatial and temporal heterogeneity in tumor tissue and characteristic biological phenotypes [23, 41]. This may explain the ability of the Rad-score established in this study to exhibit differentiated GC MSI in both cohorts. Additionally, accurate and efficient tumor region segmentation methods are important to ensure the quality of quantitative image features. It has been confirmed that 3D Slicer has better segmentation algorithms and higher segmentation accuracy than other image segmentation software while being more accessible as free open-source software [42]. Moreover, the radiomics features extracted after image segmentation using 3D Slicer have higher robustness and are, therefore, recommended for use in high-throughput data mining efforts for medical oncology imaging [43].

The exploration of radiomics in the field of GC biomarkers has started. Li et al. [44] used radiomics to construct a predictive model for detecting GC HER-2 expression, with an AUC of 0.799 [95% CI: 0.704 − 0.894]. Based on the information of 189 patients, Liang et al. [45] first constructed a predictive model based on logistic regression analysis to explore the feasibility of predicting GC-related MSI status, and the AUC was 0.8228 [95% CI: 0.7355–0.9101]. However, traditional radiomic method brings challenges in image segmentation, standardization, acquisition, and reconstruction. As an emerging means of quantitative image analysis, deep learning can optimize such limitations and improve the accuracy and reliability of prediction models [16, 46, 47]. The combination of deep learning and radiomics has shown promising results [18,19,20, 48]. In this study, we first attempted to apply deep learning algorithms to the calculation and reconstruction of GC-related MSI radiomic features. Compared with previous [45] research, our results were encouraging and obtained a higher AUC (0.883 and 0.802). This may be attributed in the optimization of radiomic features by deep learning algorithm, the increase in sample size, and the level of image segmentation in our study. Additionally, we tried to add pathological features to the combined model, but the final result did not significantly improve the model's predictive performance, reflecting the independent value of pretreatment radiomics in predicting MSI status in GC patients. To increase clinical practicability, we visualized the clinical image model into a nomogram based on logistic regression to generate the prediction probability of MSI so that it could be used as a virtual biopsy tool to support clinical medical decision making.

It should be noted that our study also has limitations. First, this is a single-center retrospective study, and inevitably, there is a patient selection bias. Although we tried to include a relatively more number of cases (n = 223), the study sample is still small, considering the high incidence of gastric cancer in Asia, which may affect the generalizability of the model. Second, due to the inherent black box property of machine learning, the process between model data input and output is difficult to interpret, and this lack of transparency has implications for clinical practice, so we plan to apply more transparent and interpretable medical algorithms in future [49, 50]. Finally, we also noted the place of dual-energy CT (DECT) in radiomics, and in future, we plan to collect cases in DECT centers to study the benefit of DECT in GC MSI virtual biopsy studies.

Availability of data and materials

The datasets analyzed during the current study are available from the corresponding author on reasonable request.

Abbreviations

95% CI:: 95% Confidence interval
AUC:: Area under the curve/area under the receiver operating characteristic
CA19-9:: Carbohydrate antigen 19–9
CEA:: Carcinoembryonic antigen
CECT:: Contrast-enhanced CT
DCA:: Decision curve analysis
DECT:: Dual-energy CT
GC:: Gastric cancer
ICCs:: Intraclass correlation coefficients
IHC:: Immunohistochemical staining/immunohistochemistry
LASSO:: Least absolute shrinkage and selection operator
MLP:: Multilayer perceptron
MMR:: Mismatch repair
MSI:: Microsatellite instability
MSS:: Microsatellite stability
PD-1:: Programmed cell death-1
Rad-score:: Radiomic score
ROC:: Operating characteristic curve
ROI:: Regions of interest
FOS:: First-order statistics
GLCM:: Gray-level co-occurrence matrix
GLDM:: Gray-level dependence matrix
GLRLM:: Gray-level run-length matrix
GLSZM:: Gray-level size zone matrix
SB:: Shape-based

References

Cao W, Chen HD, Yu YW, Li N, Chen WQ (2021) Changing profiles of cancer burden worldwide and in China: a secondary analysis of the global cancer statistics 2020. Chin Med J 134(7):783–791
Article PubMed PubMed Central Google Scholar
Arnold M, Abnet CC, Neale RE et al (2020) Global burden of 5 major types of gastrointestinal cancer. Gastroenterology 159(1):335–349
Article PubMed Google Scholar
Comprehensive molecular characterization of gastric adenocarcinoma (2014) Nature 513(7517):202–209
Article Google Scholar
He Y, Zhang L, Zhou R, Wang Y, Chen H (2022) The role of DNA mismatch repair in immunotherapy of human cancer. Int J Biol Sci 18(7):2821–2832
Article CAS PubMed PubMed Central Google Scholar
Baretti M, Le DT (2018) DNA mismatch repair in cancer. Pharmacol Ther 189:45–62
Article CAS PubMed Google Scholar
Li K, Zhang A, Li X, Zhang H, Zhao L (2021) Advances in clinical immunotherapy for gastric cancer. Biochim et Biophysi Acta BBA Rev Cancer. 1876(2):188615
Article CAS Google Scholar
Chao J, Fuchs CS, Shitara K et al (2021) Assessment of pembrolizumab therapy for the treatment of microsatellite instability–high gastric or gastroesophageal junction cancer among patients in the KEYNOTE-059, KEYNOTE-061, and KEYNOTE-062 clinical trials. JAMA Oncol 7(6):895–902
Article PubMed PubMed Central Google Scholar
Marabelle A, Le DT, Ascierto PA et al (2020) Efficacy of pembrolizumab in patients with noncolorectal high microsatellite instability/mismatch repair–deficient cancer: results from the phase II KEYNOTE-158 study. J Clin Oncol 38(1):1–10
Article CAS PubMed Google Scholar
Zhao P, Li L, Jiang X, Li Q (2019) Mismatch repair deficiency/microsatellite instability-high as a predictor for anti-PD-1/PD-L1 immunotherapy efficacy. J Hematol Oncol 12(1):54
Article PubMed PubMed Central Google Scholar
Hegde PS, Chen DS (2020) Top 10 challenges in cancer immunotherapy. Immunity 52(1):17–35
Article CAS PubMed Google Scholar
Zito Marino F, Amato M, Ronchi A (2022) Microsatellite status detection in gastrointestinal cancers: PCR/NGS Is mandatory in negative/patchy MMR immunohistochemistry. Cancers 14(9):2204
Article CAS PubMed PubMed Central Google Scholar
Luchini C, Bibeau F, Ligtenberg MJ et al (2019) ESMO recommendations on microsatellite instability testing for immunotherapy in cancer, and its relationship with PD-1/PD-L1 expression and tumour mutational burden: a systematic review-based approach. Ann Oncol 30(8):1232–1243
Article CAS PubMed Google Scholar
Lambert R (1999) Diagnosis of esophagogastric tumors: a trend toward virtual biopsy. Endoscopy 31(1):38–46
Article CAS PubMed Google Scholar
Ajani JA, D’Amico TA, Bentrem DJ et al (2022) Gastric cancer, version 22022, NCCN clinical practice guidelines in oncology. J Natl Compr Cancer Netw 20(2):167–192
Article CAS Google Scholar
Gillies RJ, Kinahan PE, Hricak H (2016) Radiomics: images are more than pictures. They Are Data Radiol 278(2):563–577
Google Scholar
Napel S, Mu W, Jardim-Perassi BV, Aerts HJ, Gillies RJ (2018) Quantitative imaging of cancer in the postgenomic era: radio (geno) mics, deep learning, and habitats. Cancer 124(24):4633–4649
Article PubMed Google Scholar
Murray JM, Wiegand B, Hadaschik B, Herrmann K, Kleesiek J (2022) Virtual biopsy: just an AI software or a medical procedure? J Nucl Med 63(4):511–513
Article PubMed PubMed Central Google Scholar
Dong D, Fang MJ, Tang L et al (2020) Deep learning radiomic nomogram can predict the number of lymph node metastasis in locally advanced gastric cancer: an international multicenter study. Ann Oncol 31(7):912–920
Article CAS PubMed Google Scholar
Cui Y, Zhang J, Li Z et al (2022) A CT-based deep learning radiomics nomogram for predicting the response to neoadjuvant chemotherapy in patients with locally advanced gastric cancer: a multicenter cohort study. EClinicalMedicine. 46:101348
Article PubMed PubMed Central Google Scholar
Zhang J, Cui Y, Wei K et al (2022) Deep learning predicts resistance to neoadjuvant chemotherapy for locally advanced gastric cancer: a multicenter study. Gastric Cancer 25(6):1050–1059
Article CAS PubMed Google Scholar
Van Griethuysen JJ, Fedorov A, Parmar C et al (2017) Computational radiomics system to decode the radiographic phenotype. Cancer Res 77(21):e104–e107
Article PubMed PubMed Central Google Scholar
Kumar V, Gu Y, Basu S et al (2012) Radiomics: the process and the challenges. Magn Reson Imaging 30(9):1234–1248
Article PubMed PubMed Central Google Scholar
Aerts HJ, Velazquez ER, Leijenaar RT et al (2014) Decoding tumour phenotype by noninvasive imaging using a quantitative radiomics approach. Nat Commun 5(1):4006
Article CAS PubMed Google Scholar
Joshi SS, Badgwell BD (2021) Current treatment and recent progress in gastric cancer. CA Cancer J Clin 71(3):264–279
Article PubMed PubMed Central Google Scholar
Vrána D, Matzenauer M, Neoral Č et al (2018) From tumor immunology to immunotherapy in gastric and esophageal cancer. Int J Mol Sci 20(1):13
Article PubMed PubMed Central Google Scholar
Buonsanti G, Calistri D, Padovan L et al (1997) Microsatellite instability in intestinal-and diffuse-type gastric carcinoma. J Pathol 182(2):167–173
Article CAS PubMed Google Scholar
Pietrantonio F, Miceli R, Raimondi A et al (2019) Individual patient data meta-analysis of the value of microsatellite instability as a biomarker in gastric cancer. J Clin Oncol 37(35):3392–3400
Article PubMed Google Scholar
Smyth EC, Wotherspoon A, Peckitt C et al (2017) Mismatch repair deficiency, microsatellite instability, and survival: an exploratory analysis of the medical research council adjuvant gastric infusional chemotherapy (MAGIC) trial. JAMA Oncol 3(9):1197–1203
Article PubMed Google Scholar
Ratti M, Lampis A, Hahne JC, Passalacqua R, Valeri N (2018) Microsatellite instability in gastric cancer: molecular bases, clinical perspectives, and new treatment approaches. Cell Mol Life Sci 75:4151–4162
Article CAS PubMed PubMed Central Google Scholar
Polom K, Marano L, Marrelli D et al (2018) Meta-analysis of microsatellite instability in relation to clinicopathological characteristics and overall survival in gastric cancer. J Br Surg 105(3):159–167
Article CAS Google Scholar
Bevilacqua RA, Simpson AJ (2000) Methylation of the hMLH1 promoter but no hMLH1 mutations in sporadic gastric carcinomas with high-level microsatellite instability. Int J Cancer 87(2):200–203
Article CAS PubMed Google Scholar
Fleisher AS, Esteller M, Wang S et al (1999) Hypermethylation of the hMLH1 gene promoter in human gastric cancers with microsatellite instability. Cancer Res 59(5):1090–1095
CAS PubMed Google Scholar
Carvalho B, Pinto M, Cirnes L et al (2003) Concurrent hypermethylation of gene promoters is associated with a MSI-H phenotype and diploidy in gastric carcinomas. Eur J Cancer 39(9):1222–1227
Article CAS PubMed Google Scholar
Nakajima T, Akiyama Y, Shiraishi J et al (2001) Age-related hypermethylation of the hMLH1 promoter in gastric cancers. Int J Cancer 94(2):208–211
Article CAS PubMed Google Scholar
Kim KJ, Lee TH, Cho NY, Yang HK, Kim WH, Kang GH (2013) Differential clinicopathologic features in microsatellite-unstable gastric cancers with and without MLH1 methylation. Hum Pathol 44(6):1055–1064
Article CAS PubMed Google Scholar
Yamamoto H, Perez-Piteira J, Yoshida T et al (1999) Gastric cancers of the microsatellite mutator phenotype display characteristic genetic and clinical features. Gastroenterology 116(6):1348–1357
Article CAS PubMed Google Scholar
Sugimoto R, Sugai T, Habano W et al (2016) Clinicopathological and molecular alterations in early gastric cancers with the microsatellite instability-high phenotype. Int J Cancer 138(7):1689–1697
Article CAS PubMed Google Scholar
Liu P, Zhang XY, Shao Y, Zhang DF (2005) Microsatellite instability in gastric cancer and pre-cancerous lesions. World J Gastroenterol 11(31):4904–4907
Article CAS PubMed PubMed Central Google Scholar
Jahng J, Youn YH, Kim KH et al (2012) Endoscopic and clinicopathologic characteristics of early gastric cancer with high microsatellite instability. World J Gastroenterol 18(27):3571–3577
Article PubMed PubMed Central Google Scholar
Liu Z, Wang S, Dong D et al (2019) The applications of radiomics in precision diagnosis and treatment of oncology: opportunities and challenges. Theranostics. 9(5):1303–1322
Article PubMed PubMed Central Google Scholar
Sah BR, Owczarczyk K, Siddique M, Cook GJ, Goh V (2019) Radiomics in esophageal and gastric cancer. Abdom Radiol 44:2048–2058
Article Google Scholar
Mandolini M, Brunzini A, Facco G, Mazzoli A, Forcellese A, Gigante A (2022) Comparison of three 3D segmentation software tools for hip surgical planning. Sensors 22(14):5242
Article PubMed PubMed Central Google Scholar
Parmar C, Rios Velazquez E, Leijenaar R et al (2014) Robust radiomics feature quantification using semiautomatic volumetric segmentation. PLoS One 9(7):e102107
Article PubMed PubMed Central Google Scholar
Li Y, Cheng Z, Gevaert O et al (2020) A CT-based radiomics nomogram for prediction of human epidermal growth factor receptor 2 status in patients with gastric cancer. Chinese J Cancer Res 32(1):62–71
Article Google Scholar
Liang X, Wu Y, Liu Y, Yu D, Huang C, Li Z (2022) A multicenter study on the preoperative prediction of gastric cancer microsatellite instability status based on computed tomography radiomics. Abdom Radiol 47(6):2036–2045
Article Google Scholar
Avanzo M, Wei L, Stancanello J et al (2020) Machine and deep learning methods for radiomics. Med Phys 47(5):e185–e202
Article PubMed Google Scholar
Lee JG, Jun S, Cho YW et al (2017) Deep learning in medical imaging: general overview. Korean J Radiol 18(4):570–584
Article PubMed PubMed Central Google Scholar
Lao J, Chen Y, Li ZC et al (2017) A deep learning-based radiomics model for prediction of survival in glioblastoma multiforme. Sci Rep 7(1):10353
Article PubMed PubMed Central Google Scholar
Azodi CB, Tang J, Shiu SH (2020) Opening the black box: interpretable machine learning for geneticists. Trends Genet 36(6):442–455
Article CAS PubMed Google Scholar
The Lancet Respiratory M (2018) Opening the black box of machine learning. Lancet Respir Med 6(11):801.

Download references

Acknowledgements

The authors thank all colleagues who participated in this study for their support and help.

Critical relevance statement

Based on the extraction of radiomic features of tumor regions from pre-treatment CECT, optimized by deep learning algorithms, and combined with clinical baseline data to achieve non-invasive evaluation of GC MSI which provide support for GC personalized immunotherapy.

Funding

This study has received funding from the Beijing Bethune Charitable Foundation (Grant No. 05006).

Author information

Zinian Jiang and Wentao Xie are equal contributors as first authors

Authors and Affiliations

Qingdao Medical College, Qingdao University, Qingdao, Shandong, China
Zinian Jiang, Wenjun Pan, Sheng Jiang, Yun Lu & Dongsheng Wang
Department of Gastrointestinal Surgery, The Affiliated Hospital of Qingdao University, No. 1677, Wutaishan Road, Qingdao, 266000, Shandong, China
Wentao Xie, Xianxiang Zhang, Maoshen Zhang, Yun Lu & Dongsheng Wang
Department of Radiology, The Affiliated Hospital of Qingdao University, Qingdao, Shandong, China
Xiaoming Zhou
Department of Pathology, The Affiliated Hospital of Qingdao University, Qingdao, Shandong, China
Zhenqi Zhang
Shandong Key Laboratory of Digital Medicine and Computer Assisted Surgery, Qingdao, Shandong, China
Yun Lu

Authors

Zinian Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Wentao Xie
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoming Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Wenjun Pan
View author publications
You can also search for this author in PubMed Google Scholar
Sheng Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Xianxiang Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Maoshen Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Zhenqi Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Yun Lu
View author publications
You can also search for this author in PubMed Google Scholar
Dongsheng Wang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

DW was involved in conceptualization. ZJ, WP, SJ, and XZ were involved in data curation. WX performed the formal analysis. ZJ, ZZ, and MZ were involved in methodology. DW and YL were involved in project administration and supervision. ZJ was involved in resources, visualization, and writing—original draft. WX was involved in software. ZJ, XZ, and XZ were involved in validation. DW, YL, and XZ were involved in writing—review and editing. All authors approved for publication.

Corresponding authors

Correspondence to Yun Lu or Dongsheng Wang.

Ethics declarations

Ethics approval and consent to participate

Our institutional review board approved this retrospective study, and so, the requirement for informed consent was waived.

Consent for publication

The requirement for informed consent was waived.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1

. Clinical baseline data indicators and CT scanner parameters.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Jiang, Z., Xie, W., Zhou, X. et al. A virtual biopsy study of microsatellite instability in gastric cancer based on deep learning radiomics. Insights Imaging 14, 104 (2023). https://doi.org/10.1186/s13244-023-01438-1

Download citation

Received: 09 March 2023
Accepted: 15 April 2023
Published: 07 June 2023
DOI: https://doi.org/10.1186/s13244-023-01438-1

A virtual biopsy study of microsatellite instability in gastric cancer based on deep learning radiomics

Abstract

Objectives

Methods

Results

Conclusions

Graphical abstract

Key points

Introduction

Methods

Patient selection and clinical variables collection

MSI status definition and revaluation

Protocol of CECT image acquisition

Tumor segmentation

Radiomic feature extraction

Development of prediction models and nomogram

Statistical analysis

Results

Clinical and pathological characteristics

Radiomics feature selection

Construction of the radiomics signature

Evaluation of the models and nomogram

Discussion

Availability of data and materials

Abbreviations

References

Acknowledgements

Critical relevance statement

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher's Note

Supplementary Information

Additional file 1

Rights and permissions

About this article

Cite this article

Share this article

Keywords