Generalizable transfer learning of automated tumor segmentation from cervical cancers toward a universal model for uterine malignancies in diffusion-weighted MRI

Lin, Yu-Chun; Lin, Yenpo; Huang, Yen-Ling; Ho, Chih-Yi; Chiang, Hsin-Ju; Lu, Hsin-Ying; Wang, Chun-Chieh; Wang, Jiun-Jie; Ng, Shu-Hang; Lai, Chyong-Huey; Lin, Gigin

doi:10.1186/s13244-022-01356-8

Original Article
Open access
Published: 24 January 2023

Generalizable transfer learning of automated tumor segmentation from cervical cancers toward a universal model for uterine malignancies in diffusion-weighted MRI

Yu-Chun Lin^1,2,3,
Yenpo Lin¹,
Yen-Ling Huang¹,
Chih-Yi Ho¹,
Hsin-Ju Chiang^1,3,
Hsin-Ying Lu^1,3,
Chun-Chieh Wang^2,5,
Jiun-Jie Wang^1,2,
Shu-Hang Ng¹,
Chyong-Huey Lai⁴ &
…
Gigin Lin ORCID: orcid.org/0000-0001-7246-1058^1,3,4

Insights into Imaging volume 14, Article number: 14 (2023) Cite this article

2120 Accesses
4 Citations
1 Altmetric
Metrics details

Abstract

Purpose

To investigate the generalizability of transfer learning (TL) of automated tumor segmentation from cervical cancers toward a universal model for cervical and uterine malignancies in diffusion-weighted magnetic resonance imaging (DWI).

Methods

In this retrospective multicenter study, we analyzed pelvic DWI data from 169 and 320 patients with cervical and uterine malignancies and divided them into the training (144 and 256) and testing (25 and 64) datasets, respectively. A pretrained model was established using DeepLab V3 + from the cervical cancer dataset, followed by TL experiments adjusting the training data sizes and fine-tuning layers. The model performance was evaluated using the dice similarity coefficient (DSC).

Results

In predicting tumor segmentation for all cervical and uterine malignancies, TL models improved the DSCs from the pretrained cervical model (DSC 0.43) when adding 5, 13, 26, and 51 uterine cases for training (DSC improved from 0.57, 0.62, 0.68, 0.70, p < 0.001). Following the crossover at adding 128 cases (DSC 0.71), the model trained by combining data from adding all the 256 patients exhibited the highest DSCs for the combined cervical and uterine datasets (DSC 0.81) and cervical only dataset (DSC 0.91).

Conclusions

TL may improve the generalizability of automated tumor segmentation of DWI from a specific cancer type toward multiple types of uterine malignancies especially in limited case numbers.

Key points

1.
Transfer learning (TL) improves performance of tumor segmentation on diffusion-weighted imaging (DWI) especially in limited case numbers.
2.
Training a model by combining sufficient data of different cancers exhibited the highest performance for segmenting mixed cervical and uterine datasets and also improved the pretrained cervical model.
3.
The TL model with fine-tuning the early layers of the encoder part outperformed those by fine-tuning the other layers.

Introduction

Magnetic resonance imaging (MRI) plays crucial roles for gynecologic malignancies, in the initial or preoperative staging, response monitoring and surveillance for recurrence [1], by providing anatomical details and functional parameters [2]. MRI-derived target volumes are increasingly used for radiation treatment planning for cervical cancer, based on the accurate tumor contouring and precisely evaluating tumor extension [3]. In addition to tumor localization, MRI-based radiomics has potential to differentiate benign versus malignant uterine tumors [4] and provides useful information for the outcome prediction [5]. However, the radiomics pipeline involves extracting features from large number of images [6], related to potential discrepancies of manual contouring among human readers [7], highlighting the pressing need for a fully automatic method for tumor segmentation.

Convolutional neural networks (CNNs) have shown promise as alternatives for image segmentation [8, 9]. However, building a CNN model requires far more annotated datasets than are available in medical imaging [10]. Furthermore, training a segmentation model in a three-dimensional volume dataset, such as CT or MRI, requires manual labeling in a slice-by-slice manner, which is extremely labor intensive. Therefore, reducing the size of the training dataset is necessary to boost training efficiency. Transfer learning (TL) is an approach to address this problem, which uses features learned from a source domain as a pretrained model and transfers them to another domain [11]. Several studies have shown that TL has potential to overcome the requirement of large data in medical imaging [12,13,14]. To the best of our knowledge, no study has investigated the usefulness of TL in studying different cancers. Because the majority of tumors exhibit high intensity on diffusion-weighted (DW) imaging and low intensity on the apparent diffusion coefficient (ADC) map, we assume that various tumors have common features in certain layers of the network, and knowledge regarding one tumor type can be transferred for the study of another type. Previous study has established a model of tumor segmentation in cervical cancer (CX) with DW images [15]. In the current study, we hypothesized that the pretrained model of CX dataset can be transferred to all cervical and uterine malignancies, with a reduced sample size of labeled images.

This study investigated the generalizability of transfer learning (TL) of automated tumor segmentation from cervical cancers toward a universal model for all cervical and uterine malignancies in diffusion diffusion-weighted magnetic resonance imaging (DWI). Specifically, we investigated whether TL can reduce the data size required for the target domain, and we deciphered which parts of CNN layers should be fine-tuned to achieve adequately accurate tumor segmentation. Finally, the model performance of TL was compared with an aggregated model, which was trained using the combined cervical and uterine datasets, for the universal tumor segmentation of malignant uterine tumors on MRI.

Materials and methods

Patients

This exploratory study retrospectively analyzed the dataset of patients with cervical or uterine malignancies at a tertiary referral center between July 2010 and January 2018. The Institutional Review Board approved the study, and informed consent was waved. The experiments were performed using two cohorts of patients: (1) cervical dataset, which was used as the source domain for the pretrained model. This dataset was used for establishing the tumor segmentation model in previous report [15]; the dataset included the data of 144 patients with cervical cancer for training and the data of 25 patients for testing; (2) uterine dataset, the target domain for transfer learning experiments.

The inclusion criteria for patients in the uterine dataset were as follows: (a) female sex, (b) age of 20–80 years, and (c) clinical diagnosis of uterine malignancies. The exclusion criteria were as follows: (a) contraindicated for an MRI study due to a cardiac pacemaker or cochlear implantation; (b) post major pelvic surgery, total hip replacement, or magnetic substance implantation in the pelvis; (c) significant major systemic disease, such as renal failure, heart failure, stroke, acute myocardial infarction/unstable angina, poor controlled diabetes mellitus, and poor controlled hypertension; and (d) pregnant or breast-feeding.

Of 345 consecutive patients enrolled, we excluded 16 patients who had no visible tumors and nine patients susceptible to artifacts in DW imaging. Thus, the data of 320 patients in the uterine dataset were included in the final analysis (Fig. 2). Among them, 256 patients (80%) were randomized to the training dataset, and the remaining 64 patients (20%) were included in the testing dataset. All data were exported anonymously.

MRI data and image annotation

MRI studies were performed using two MRI scanners: Skyra (n = 248) and Trio TIM (n = 72) (Siemens Healthineers). All patients underwent the standard MR protocol from Chang Gung Memorial Hospitals following the guide of European Society of Urogenital Radiology for female pelvis imaging [16]. The imaging protocol included T1-weighted, T2-weighted, DW images and contrast-enhanced T1-weighted images acquired in sagittal and axial planes. The DW imaging utilized single-shot echo planar imaging with b-values = 0 and 1000 s/mm² to generate ADC map. (repetition time/echo time, 3700–8200 ms/65–85 ms; slice thickness/interval, 4 mm /1 mm; field of view, 200 × 200 mm; matrix, 256 × 256). The slice sections ranged 14–22 to cover the whole tumor for each patient. The sagittal DW images and the corresponded ADC maps of each slice were used as input sources for training and testing.

Regions of interest (ROIs) of tumor contours were delineated by the consensus of two gynecologic radiologist (Y.L.H. and G.L. with 7 and 13 years of experience in gynecology, respectively) using an in-house developed interface in MATLAB (Mathworks, Natick). Both readers were blinded to clinical outcomes. We avoided the ROIs contaminating the adjacent normal endometrium and myometrium and excluded the normal cervical stroma when studying the cervical invasion. The labeled ROIs were used as the ground truth for the model training.

Network and training

In an optimization study, we explored the performance of U-Net and DeepLab V3 + architectures for tumor segmentation in cervical cancer. Finally, the DeepLab V3 + architecture was adopted because it produced higher preliminary accuracies (Additional file 1). The DW MRI with b-values of 0 and 1000 s/mm² and ADC images were used as three-channel input sources for training. Xception was used as the backbone (first 356 layers) of the DeepLab V3 + network. The networks were trained with weight randomization and stochastic gradient descent Adam Optimizer method [17]. The signal intensities of all images were normalized to a mean = 0 and standard deviation = 1 [18]. We implanted data augmentation on each training image set, such that six times of image data were generated (20°, − 20°, 60°, − 60°, and horizontal flip). Finally, 10,164 images from the 256 patients in the uterine training dataset were used for training. The learning rate was 0.001, and the number of epochs until convergence was 100, with batch sizes of 2. The network was trained using Keras 2.1.4 written in Python 3.5.4 and TensorFlow 1.5.0. The code for the DeepLab V3 + model is available at https://github.com/bonlime/keras-deeplab-v3-plus.

Model experiments

The pretrained model was established using DeepLab V3 + for cervical dataset (n = 144). We performed three combinations of model training and prediction: (a) UT-only model: training from scratch using the uterine dataset without TL from cervical model; (b) TL model: using the pretrained cervical model and fine-tuning of certain layers using the uterine dataset; and (c) Aggregated model: training from scratch by using the combined cervical and uterine datasets. This model was proposed to test the generalization for both cervix and uterine cancers.

To investigate the effect of freezing/tuning layers on TL performance, we examined three levels as the cutoff layers on the TL model. The layers before the identified layer were frozen, whereas those after that were fine-tuned based on the target domain data (Fig. 1). (a) L1: the first layer following the Xception model of the encoder. This was to retain the low-level features learned in the source domain and retrain the high-level features from the target domain. (b) L2: a deep layer following the Atrous Spatial Pyramid Pooling at the end of the encoder. This was to retain the low-level features and most of the high-level features of the encoder in the source domain and retrain the last layer in the encoder. (c) L3: the layer at decoder initiation. This was to retain all the extracted features of the source domain in the encoder and retrain from the start of the decoder.

To assess the influence of data size on training performance, we examined different training data sizes of uterine dataset through splitting the training samples randomly with 2% (n = 5), 5% (n = 13), 10% (n = 26), 20% (n = 51), 50% (n = 128), and 100% (n = 256) of patients (Fig. 2). The independent dataset comprising patients with uterine cancer (n = 64) and cervical cancer (n = 25) was used for testing the performance of each group.

Evaluation of model performances

The accuracy of segmentation was estimated using a dice similarity coefficient (DSC) [19] as follows: \({\text{D}}\left( {{\text{X}},{\text{Y}}} \right) = \frac{{2\left\lfloor {X \cap Y} \right\rfloor }}{\left\lfloor X \right\rfloor + \left\lfloor Y \right\rfloor }\), where X and Y denote the segmentation of the prediction and ground truth, respectively. The trained models with the highest DSC in each group were selected as the final models for prediction in the testing dataset.

Extraction of ADC radiomics

To assess the reliability of predicted ROIs by the established models, we examined the radiomics features of ADC values of tumor ROIs extracted by manual and automatic segmentation models. The 14 first-order radiomics features of tumors were calculated using pyradiomics software [20] based on the 3D volumes of ROIs on ADC images.

Statistics

Statistical analysis was performed using GraphPad Prism software version 8.0 for Mac (GraphPad Software, San Diego, CA, USA). The differences in DSCs in various trained models were assessed using analysis of variance (ANOVA) with Tukey’s post hoc analysis. The stability of the model was assessed through k-fold cross-validations by using ANOVA on DSCs between labeled and predicted ROIs by each trained model. The reliability of radiomics features of tumor ROIs was evaluated using intraclass correlation coefficient (ICC) obtained by manual and automatic segmentation models.

Results

Patient characteristics in UT dataset

In total, 320 patients with uterine malignancies were eligible for the final analysis. Table 1 presents the clinical and demographic features of the training (n = 256) and testing datasets (n = 64). The median patient age was 53 years (range: 25–88 years). The histopathology types comprised endometrial carcinoma (EC, n = 309, 96.5%), endometrial stromal sarcoma (ESS, n = 5, 1.6%), leiomyosarcoma (LMS, n = 4, 1.3%), and carcinosarcoma (malignant mixed müllerian tumor, MMMT, n = 2, 0.6%), with tumors either well/moderately differentiated (n = 259, 80.9%) or poorly differentiated (n = 61, 19.1%). Tumor size ranged from 0.14 to 270 cm³ (median, 4.2 cm³). No statistically significant differences were found regarding the clinical or demographic characteristics between the training and testing datasets.

Table 1 Patient demographics of the uterine dataset

Full size table

Model performance

Figure 3 shows the performance of models in various training combinations and under various sample sizes from the uterine dataset. Initially applying the pretrained cervical model directly to the combined cervical and uterine datasets yielded a DSC of only 0.43 (95% confidence interval [CI], 0.38–0.49). The TL models with the fine-tuning level at L1 exhibited higher DSCs as compared with those at L2 or L3 (p < 0.05 for all data size subgroups). The TL model of L1 fine-tune level exhibited the highest DSCs when the used uterine data size was ≤ 51 (DSC = 0.57, 0.62, 0.68, 0.70 for sample sizes 5, 13, 26, and 51, respectively, p < 0.001). As more training data were added, the performances of models increased. With the data size of ≥ 128 used, the aggregated model exhibited the highest DSC among all the models (DSC = 0.73 and 0.81 for sample sizes 128 and 256, respectively, p < 0.001). Figure 4 demonstrates a patient with endometrial cancer where tumor contours were generated using various training models and sample sizes.

Subgroup analysis was performed on cervical dataset, uterine dataset, and the combined cervical and uterine datasets, respectively. The prediction accuracies of various models in predicting tumor contours using different training sample sizes are summarized in Fig. 5 and Table 2. Applying the pretrained cervical model directly to uterine dataset yielded a DSC of only 0.31. (95% CI, 0.25–0.34). On testing the uterine cancer cases, the TL model exhibited the highest DSCs when the training size of uterine data was small and medium (DSC = 0.61 and 0.70 for n = 13 and 51, respectively) among all models (p < 0.001). The UT-only model had the highest DSC when the full dataset was used (n = 256, DSC = 0.79, 95% CI, 0.75–0.83, p < 0.001). On testing the cervical cancer cases, the TL model achieved similar DSCs as the aggregated model if adding uterine cases of n = 13 and n = 51 for training (DSC = 0.67 and 0.71, respectively). Surprisingly, the aggregated model drastically improved the DSC in the cervical dataset if adding full uterine cases for training (n = 256, DSC = 0.91, 95% CI, 0.87–0.94, p < 0.001).

Table 2 Prediction accuracies of models for different testing datasets using various training sample sizes

Full size table

Reliability of radiomics features

Figure 6 shows the ICC values of ADC radiomics features in first-order obtained by manual and automatic segmentation by uterine-only and TL models with various trained sample sizes. Both the models exhibited poor to moderate reliabilities when the training data size was small (n = 13) with ICC = 0.32–0.58 for uterine-only model and 0.38–0.69 for TL model. As the training sample size increased to n = 51, the TL model exhibited higher ICCs compared with the UT-only model for all parameters (ICC = 0.73–0.89 and 0.53–0.81 for TL and UT-only models, respectively, p < 0.001). With the use of full data size, both models exhibited high reliabilities with ICC > 0.8 for all the parameters (ICC = 0.81–0.96 and 0.8–0.96 for TL and UT-only models, respectively).

Discussion

We exploited the potential of TL through domain adaption for automated tumor segmentation for gynecological cancers on diffusion MRI. Our results showed the effectiveness of the DeepLab V3 + network in tumor segmentation through the adaptation of previously acquired knowledge of cervical cancer to the new domain of uterine malignancy. When the number of training samples was limited in the target dataset, the TL approach outperformed conventional training from scratch with the same size of training data.

TL works under the assumption of a common feature space for data distribution from source and target domains shared. CNN architectures with transferable weights are particularly suited for TL [21, 22]. We hypothesized that cervical cancer and uterine malignancy might have some common features in DW imaging within the scope of gynecology, and the learned weights from the pretrained cervical model might generalize to all cervical and uterine malignancies. Our results showed that model trained from either cervical or uterine tumor failed to predict the contour of the other cancer properly without fine-tuning. Our approach underscored that the cervical cancer and uterine malignancy did share common features that can be learned at low-level layers, whereas the high-level features are specific to the target domain of uterine malignancy. The network can be adopted to the target domain rapidly with only a small sample size to fine-tune the weights.

Our results suggest that with a small sample size, the TL approach outperformed training from scratch for both the segmentation similarity measures as well as the reliability of the extracted radiomics parameters. Kurata et al. [23] demonstrated DSCs of 0.68 and 0.56 in DW imaging and ADC images, respectively, for endometrial cancer by training 180 uterine cancer patients from scratch. Our results showed that, with only 51 patients used, the TL model exhibited higher DSC of 0.70 than the UT-only model with DSC of 0.64. Although the DSC of 0.7 is not satisfactory for the tumor segmentation task, the extracted ADC radiomics is reliable with ICCs of 0.73–0.89. This is nearly comparable with the results by Kurata et al. [23], who reported ICCs of 0.75–0.93 for the first-order features based on T2-weighted image by training 180 patients with endometrial cancer.

The TL approach could be in particular useful for uncommon diseases such as uterine sarcomas demonstrated in the present study. Our finding is consistent with that of Ghafoorian et al. [24], who performed domain adaptation for segmentation on brain white matter among different MRI. They showed that the accuracy of the model using TL outperformed the model trained from scratch with a sample size of < 50. Swati et al. [25] reported that the data size can be reduced to as low as 25% (n = 58) by using TL for brain tumor classification on MRI by using the VGG19 network. Our results showed that TL model outperformed all the non-TL models with data size < 128. The potential reason may be that the pretrained model may contain some mutual features for both cervical cancer and uterine malignancy, and these features would dominate the weights of the trained model when the sample size is small.

We also demonstrated that the performance of TL is dependent on fine-tuning layers in the network. In a CNN, the convolutional layers near the input are regarded to extract general features, whereas deeper layers are specific to the target task [26]. Shirokikh et al. [27] reported that fine-tuning the first layers significantly outperforms fine-tuning the last layers in brain segmentation by using U-Net. Our results demonstrated that DeepLab V3 + exhibited higher accuracy compared with U-Net for tumor segmentation in uterine malignancy. We observed that fine-tuning the layers immediately after the Xception portion [28] exhibited the highest performance among the various levels of interest in the network. In addition, fine-tuning at the encoder (L1 and L2 levels) outperformed that at the decoder (L3) of the network. This finding implies that low-level features at the early encoder portion dominate the common features of tumors in cervical cancer and uterine malignancy.

Our study had some limitations. First, we focused on only TL between cervical cancer and uterine malignancy because these two cancers are prevalent in gynecology and the data size available for clinical use is the largest. The value of TL for ovarian cancer is yet to be investigated. Second, we used DeepLab V3 + in this study; innovative networks always exist for semantic segmentation. However, most of the segmentation networks use the encoder–decoder form with various backbones for feature extraction. Nonetheless, the current study provides a proof of concept that fine-tuning from the early part of the encoder is recommended for TL among different cancers. Third, the tumors for radiation planning are segmented on fast spin echo T2-weighted images with higher resolution and signal to noise. Thus, generalizability to other types of datasets needs to also be demonstrated in the future.

In conclusion, our results demonstrated that TL may improve the generalizability of automated tumor segmentation of DWI from a specific cancer type toward multiple types of uterine malignancies especially in limited case numbers. However, if large amounts of annotated data are available, training from scratch using the target dataset appears to be a better option for specific disease.

Availability of data and materials

The datasets used or analyzed during the current study are available from the corresponding author on reasonable request.

Abbreviations

ADC:: Apparent diffusion coefficient
CNN:: Convolutional neural network
CX:: Cervical cancer
DSC:: Dice similarity coefficient
DW:: Diffusion-weighted
EC:: Endometroid carcinoma
ESS:: Endometrial stromal sarcoma
LMS:: Leiomyosarcoma
MRI:: Magnetic resonance imaging
ROI:: Region of interest
TL:: Transfer learning
UT:: Uterine tumors
MMMT:: Malignant mixed müllerian tumor

References

Manganaro L, Lakhman Y, Bharwani N et al (2021) Staging, recurrence and follow-up of uterine cervical cancer using MRI: updated guidelines of the european society of urogenital radiology after revised FIGO staging 2018. Eur Radiol 31:7802–7816
Article Google Scholar
Lura N, Wagner-Larsen KS, Forsse D et al (2022) What MRI-based tumor size measurement is best for predicting long-term survival in uterine cervical cancer? Insights Imaging 13:105
Article Google Scholar
Batumalai V, Burke S, Roach D et al (2020) Impact of dosimetric differences between CT and MRI derived target volumes for external beam cervical cancer radiotherapy. Br J Radiol 93:20190564
Article Google Scholar
Wang T, Gong J, Li Q et al (2021) A combined radiomics and clinical variables model for prediction of malignancy in T2 hyperintense uterine mesenchymal tumors on MRI. Eur Radiol 31:6125–6135
Article Google Scholar
Lin G, Yang LY, Lin YC et al (2019) Prognostic model based on magnetic resonance imaging, whole-tumour apparent diffusion coefficient values and HPV genotyping for stage IB-IV cervical cancer patients following chemoradiotherapy. Eur Radiol 29:556–565
Article Google Scholar
Gillies RJ, Kinahan PE, Hricak H (2016) Radiomics: images are more than pictures, they are data. Radiology 278:563–577
Article Google Scholar
Min LA, Vacher YJL, Dewit L et al (2020) Gross tumour volume delineation in anal cancer on T2-weighted and diffusion-weighted MRI - Reproducibility between radiologists and radiation oncologists and impact of reader experience level and DWI image quality. Radiother Oncol 150:81–88
Article Google Scholar
Perkuhn M, Stavrinou P, Thiele F et al (2018) Clinical evaluation of a multiparametric deep learning model for glioblastoma segmentation using heterogeneous magnetic resonance imaging data from clinical routine. Invest Radiol. https://doi.org/10.1097/rli.0000000000000484
Article Google Scholar
Tian Z, Liu L, Zhang Z, Fei B (2018) PSNet: prostate segmentation on MRI based on a convolutional neural network. J Med Imaging (Bellingham) 5:021208
Google Scholar
Shen D, Wu G, Suk HI (2017) Deep learning in medical image analysis. Annu Rev Biomed Eng 19:221–248
Article CAS Google Scholar
Ghafoorian M, Mehrtash A, Kapur T et al (2017) Transfer learning for domain adaptation in mri: application in brain lesion segmentationmedical image computing and computer assisted intervention − MICCAI 2017. (Lecture Notes in Computer Science), pp 516–524
Banerjee I, Crawley A, Bhethanabotla M, Daldrup-Link HE, Rubin DL (2018) Transfer learning on fused multiparametric MR images for classifying histopathological subtypes of rhabdomyosarcoma. Comput Med Imaging Graph 65:167–175
Article Google Scholar
Shan H, Zhang Y, Yang Q et al (2018) 3-D convolutional encoder-decoder network for low-dose CT via transfer learning from a 2-D trained network. IEEE Trans Med Imaging 37:1522–1534
Article Google Scholar
Christopher M, Belghith A, Bowd C et al (2018) Performance of deep learning architectures and transfer learning for detecting glaucomatous optic neuropathy in fundus photographs. Sci Rep 8:16685
Article Google Scholar
Lin YC, Lin CH, Lu HY et al (2020) Deep learning for fully automated tumor segmentation and extraction of magnetic resonance radiomics features in cervical cancer. Eur Radiol 30:1297–1305
Article Google Scholar
Alt C, Bharwani N, Brunesch L et al (2019) ESUR quick guide to female pelvis imaging. European Society of Urogenital Radiology. ESUR Guidelines; 2019; Available online: https://www.esur.org/esur-guidelines
Arcos-Garcia A, Alvarez-Garcia JA, Soria-Morillo LM (2018) Deep neural network for traffic sign recognition systems: an analysis of spatial transformers and stochastic optimisation methods. Neural Netw 99:158–165
Article Google Scholar
Trebeschi S, van Griethuysen JJM, Lambregts DMJ et al (2017) Deep learning for fully-automated localization and segmentation of rectal cancer on multiparametric MR. Sci Rep 7:5301
Article Google Scholar
Taha AA, Hanbury A (2015) Metrics for evaluating 3D medical image segmentation: analysis, selection, and tool. BMC Med Imaging 15:29
Article Google Scholar
van Griethuysen JJM, Fedorov A, Parmar C et al (2017) Computational radiomics system to decode the radiographic phenotype. Cancer Res 77:e104–e107
Article Google Scholar
Mazo C, Bernal J, Trujillo M, Alegre E (2018) Transfer learning for classification of cardiovascular tissues in histological images. Comput Methods Programs Biomed 165:69–76
Article Google Scholar
Sevakula RK, Singh V, Verma NK, Kumar C, Cui Y (2019) Transfer learning for molecular cancer classification using deep neural networks. IEEE/ACM Trans Comput Biol Bioinform 16:2089–2100
Article Google Scholar
Kurata Y, Nishio M, Moribata Y et al (2021) Automatic segmentation of uterine endometrial cancer on multi-sequence MRI using a convolutional neural network. Sci Rep 11:14440
Article CAS Google Scholar
Ghafoorian M, Mehrtash A, Kapur T et al (2017) Transfer learning for domain adaptation in MRI: application in brain lesion segmentation. Springer International Publishing, Cham, pp 516–524
Google Scholar
Swati ZNK, Zhao Q, Kabir M et al (2019) Brain tumor classification for MR images using transfer learning and fine-tuning. Comput Med Imaging Graph 75:34–46
Article Google Scholar
Yosinski J, Clune J, Bengio Y, Lipson H (2014) How transferable are features 554 in deep neural networks? https://ui.adsabs.harvard.edu/abs/2014arXiv1411.1792Y. Accessed 1 Nov 2014
Shirokikh B, Zakazov I, Chernyavskiy A, Fedulova I, Belyaev M (2020) First U-net layers contain more domain specific information than the last ones. https://ui.adsabs.harvard.edu/abs/2020arXiv200807357S. Accessed 1 Aug 2020
Chollet F (2016) Xception: deep learning with depthwise separable convolutions. https://ui.adsabs.harvard.edu/abs/2016arXiv161002357C. Accessed 1 Oct 2016

Download references

Acknowledgements

The authors acknowledge the assistance provided by the Cancer Center and Clinical Trial Center, Chang Gung Memorial Hospital, Linkou, Taiwan, which was founded by the Ministry of Health and Welfare of Taiwan MOHW 109-TDU-B-212-114005.

Funding

This study was funded by Ministry of Science and Technology, Taiwan (grant no.: MOST 110-2628-B-182A-018, MOST 109-2314-B-182A-045, MOST 111-2628-B-182A-012 and MOST 111-2314-B-182A-041) and Chang Gung Medical Foundation grant CLRPG3K0023, CMRPG3J1631, CMRPG3I0141, CMRPG3L1601, CMRPG3M0731 and SMRPG3K0053. Chang Gung IRB 201702080A0C601, 201702204B0, 201902082B0C601 and 202002290B0C601.

Author information

Authors and Affiliations

Department of Medical Imaging and Intervention, Chang Gung Memorial Hospital at Linkou and Keelung, 5 Fuhsing St., Guishan, Taoyuan, 33382, Taiwan
Yu-Chun Lin, Yenpo Lin, Yen-Ling Huang, Chih-Yi Ho, Hsin-Ju Chiang, Hsin-Ying Lu, Jiun-Jie Wang, Shu-Hang Ng & Gigin Lin
Department of Medical Imaging and Radiological Sciences, Chang Gung University, Taoyuan, 33302, Taiwan
Yu-Chun Lin, Chun-Chieh Wang & Jiun-Jie Wang
Clinical Metabolomics Core Laboratory, Chang Gung Memorial Hospital at Linkou, 5 Fuhsing St., Guishan, Taoyuan, 33382, Taiwan
Yu-Chun Lin, Hsin-Ju Chiang, Hsin-Ying Lu & Gigin Lin
Gynecologic Cancer Research Center, Department of Obstetrics and Gynecology, Chang Gung Memorial Hospital at Linkou and Chang Gung University, 5 Fuhsing St., Guishan, Taoyuan, 33382, Taiwan
Chyong-Huey Lai & Gigin Lin
Department of Radiation Oncology, Chang Gung Memorial Hospital at Linkou and Chang Gung University, 5 Fuhsing St., Guishan, Taoyuan, 33382, Taiwan
Chun-Chieh Wang

Authors

Yu-Chun Lin
View author publications
You can also search for this author in PubMed Google Scholar
Yenpo Lin
View author publications
You can also search for this author in PubMed Google Scholar
Yen-Ling Huang
View author publications
You can also search for this author in PubMed Google Scholar
Chih-Yi Ho
View author publications
You can also search for this author in PubMed Google Scholar
Hsin-Ju Chiang
View author publications
You can also search for this author in PubMed Google Scholar
Hsin-Ying Lu
View author publications
You can also search for this author in PubMed Google Scholar
Chun-Chieh Wang
View author publications
You can also search for this author in PubMed Google Scholar
Jiun-Jie Wang
View author publications
You can also search for this author in PubMed Google Scholar
Shu-Hang Ng
View author publications
You can also search for this author in PubMed Google Scholar
Chyong-Huey Lai
View author publications
You can also search for this author in PubMed Google Scholar
Gigin Lin
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

YCL and GL contributed to the study ideation and design. YL, YLH and CC Wang contributed to the acquisition and evaluation of radiologic data. CYH, HJC, HYL, JJW, SHN and CHL contributed to the acquisition of data. YCL, CYL and GL contributed to the analysis of the data and the drafting of the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Gigin Lin.

Ethics declarations

Ethics approval and consent to participate

Institutional Review Board approval was obtained. Written informed consent was waived by the Institutional Review Board.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1:

Comparison of U-Net and DeepLab V3+ architectures for tumor segmentation in cervical cancer.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Lin, YC., Lin, Y., Huang, YL. et al. Generalizable transfer learning of automated tumor segmentation from cervical cancers toward a universal model for uterine malignancies in diffusion-weighted MRI. Insights Imaging 14, 14 (2023). https://doi.org/10.1186/s13244-022-01356-8

Download citation

Received: 22 June 2022
Accepted: 04 December 2022
Published: 24 January 2023
DOI: https://doi.org/10.1186/s13244-022-01356-8

Generalizable transfer learning of automated tumor segmentation from cervical cancers toward a universal model for uterine malignancies in diffusion-weighted MRI

Abstract

Purpose

Methods

Results

Conclusions

Key points

Introduction

Materials and methods

Patients

MRI data and image annotation

Network and training

Model experiments

Evaluation of model performances

Extraction of ADC radiomics

Statistics

Results

Patient characteristics in UT dataset

Model performance

Reliability of radiomics features

Discussion

Availability of data and materials

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher's Note

Supplementary Information

Additional file 1:

Rights and permissions

About this article

Cite this article

Share this article

Keywords