Automatic analysis framework based on 3D-CT multi-scale features for accurate prediction of Ki67 expression levels in substantial renal cell carcinoma
Insights into Imaging volume 14, Article number: 130 (2023)
To investigate the effectiveness of an automatic analysis framework based on 3D-CT multi-scale features in predicting Ki67 expression levels in substantial renal cell carcinoma (RCC).
This retrospective study was conducted using multi-center cohorts consisting of 588 participants with pathologically confirmed RCC. The participants were divided into an internal training set (n = 485) and an external testing set (n = 103) from four and one local hospitals, respectively. The proposed automatic analytic framework comprised a 3D kidney and tumor segmentation model constructed by 3D UNet, a 3D-CT multi-scale features extractor based on the renal–tumor feature, and a low or high Ki67 prediction classifier using XGBoost. The framework was validated using a fivefold cross-validation strategy. The Shapley additive explanation (SHAP) method was used to determine the contribution of each feature.
In the prediction of low or high Ki67, the combination of renal and tumor features achieved better performance than any single features. Internal validation using a fivefold cross-validation strategy yielded AUROC values of 0.75 ± 0.1, 0.75 ± 0.1, 0.83 ± 0.1, 0.77 ± 0.1, and 0.87 ± 0.1, respectively. The optimal model achieved an AUROC of 0.87 ± 0.1 and 0.82 ± 0.1 for low vs. high Ki67 prediction in the internal validation and external testing sets, respectively. Notably, the tumor first-order-10P was identified as the most influential feature in the model decision.
Our study suggests that the proposed automatic analysis framework based on 3D-CT multi-scale features has great potential for accurately predicting Ki67 expression levels in substantial RCC.
Critical relevance statement
Automatic analysis framework based on 3D-CT multi-scale features provides reliable predictions for Ki67 expression levels in substantial RCC, indicating the potential usage of clinical applications.
The multi-scale 3D-CT features can provide robust prediction of Ki67 in RCC.
The proposed analysis framework can be easily transferred to new datasets.
The contribution of extracted features was evaluated using SHAP to explore model interpretation.
Renal cell carcinoma (RCC) is the most common renal tumor in adults and the second most common urinary tract tumor, accounting for 3% of all cancers . Over the last twenty years, there has been a consistent annual rise of 2% in the incidence of RCC worldwide . At initial diagnosis, a large percentage (approximately 20–30%) of RCC patients have distant metastases, and a substantial number (approximately 20–30%) of patients with localized RCC develop metastases even after curative nephrectomy, with 10–15% of cases ultimately resulting in fatality . As a result, it is crucial to accurately identify RCC patients with poor prognoses at an early stage, which holds significant clinical significance.
Ki-67 is a nuclear antigen that reflects cell proliferation status, expressed throughout the cell cycle of proliferating cells except for G0 phase, and is closely associated with tumor proliferation and invasion. Many studies have suggested that Ki-67 is a useful prognostic marker in RCC, with high expression levels being associated with poor prognosis and advanced clinicopathological features [4,5,6]. Detection of Ki67 requires a pathological puncture, which is an invasive procedure that carries risks such as tumor needle tract metastasis or infection. Furthermore, due to the possibility of RCC patients experiencing recurrence or multiple repeated surgeries, the development of a noninvasive and dynamic predictive model for Ki67 holds significant clinical value.
CT is the predominant imaging modality for preoperative assessment and postoperative surveillance of RCC patients, and it represents a critical component of standard patient care . In recent years, machine learning algorithms have shown promise in the field of medical image analysis and have demonstrated their ability to accurately predict Ki67 at many cancers [8,9,10,11]. Despite their promising predictive performance, many of these models are limited in terms of their practical applicability in clinical settings. Two specific limitations include the reliance on manual annotation by radiologists for target regions learned by the models, which is impractical for clinical practice. Additionally, most of the models only offer classification results without providing insight into the decision-making logic behind those results, leading to distrust and hesitation among clinicians in implementing them .
To the best of our knowledge, there was currently no literature available that reports the predictive value of machine learning-based CT features for Ki67 in substantial RCC. Given the complexity and diversity of analysis, it is crucial to extract more comprehensive image features to enable accurate prediction. In this study, we propose an automatic analysis framework that includes three key modules: a 3D kidney and tumor segmentation model constructed using 3D UNet, a 3D-CT multi-scale features extractor based on the renal–tumor and a low or high Ki67 prediction classifier using XGBoost. To ensure a robust framework, we employed a fivefold cross-validation strategy. Additionally, we used a quantitative model interpretation method called SHAP to explore the contribution of each feature.
Materials and methods
This study is a retrospective analysis of multi-center datasets, encompassing 588 participants who underwent nephrectomy for substantial renal cell carcinoma from 2017 to 2022 in five medical centers. The dataset was divided into an internal group of 485 participants and an external testing group of 103 participants, with the internal group further divided into a training group of 388 participants (80%) and an internal validation group of 97 participants (20%). The protocols for collecting the data were approved by the local institutional review board (KY2022-036-01) and informed consent was waived, as the study relied on anonymous clinical data and images.
In adherence to the inclusion criteria and processing protocols outlined in Fig. 1a, we disregarded cases with inadequate clinical and pathological information and limited our analysis to corticomedullary phase images. Inclusion Criteria: 1. Underwent partial or radical nephrectomy and were pathologically confirmed to have substantial renal cell carcinoma; 2. Consecutive adults; and 3. Without chemotherapy or radiotherapy before surgery. Exclusion Criteria: 1. Incomplete semantic segmentation of kidney and tumor region; 2. Patients with cystic renal carcinoma; 3. Not-corticomedullary phase images; 4. Incomplete clinicopathological diagnostic report; and 5. Patients with low-quality images (low resolution, disordered, and blurred images).
Images preparation and segmentation
Following a thorough selection process, the images were meticulously annotated and divided into kidneys and kidney cancer segments by a team of two experienced radiologists and two adept medical students. Based on the 3D UNet network , we constructed an automated kidneys and kidney cancer segmentation model. The principle behind this was to identify a function "g" consisting of a set of heuristics that adjusts normalization and resampling techniques. Hyperparameters, including pooling operations, batch size, and patch size, were chosen based on the characteristics of the dataset. To guarantee precision, the segmentation results were reviewed and revised by a specialist with over 20 years of experience and multiple observers. This process was repeated to ensure accuracy in the delineation and prepare for further model training.
Multi-scale features extractor
The 3D-CT multi-scale features extractor comprised a radiomics feature extractor, a PCA (principal component analysis) matrix dimensionality reduction feature extractor and an SVD (singular value decomposition) matrix decomposition feature extractor. The PyPi pyradiomics  module was utilized to extract the texture, morphological and statistical features of the CT images, yielding 100 features for each of the ROI. Subsequently, a dimensional reduction was performed on the segmented voxels of the CT images, with adjustments made to image resolution, normalization using mean and standard deviation values, cropping to include only the kidneys or tumor regions, and filling of empty regions with minimal pixels (see the supplementary file for details and Additional file 1: Fig. S1). Three hundred and twenty features by dimensionality reduction, which represents the original voxel information, was performed by PCA (256 features for each of the ROI) and SVD (64 features for each of the ROI).
Model construction and explaining
Gradient boosting decision trees (XGBoost, v1.3.3)  were utilized to predict the Ki67 expression levels. A combination of Ki67 levels greater than or equal to 5% was classified as high risk and Ki67 levels less than 5% as low risk. To evaluate the performance of single renal features, single tumor features, and combined renal–tumor features, three classifiers were constructed for Ki67 (high risk vs. low risk).
The process of decision-making was explored with the aid of SHapley Additive exPlanations (SHAP)  by decomposing the model's decision into individual feature influences. A high SHAP value indicates a significant impact on the model's decision. The accuracy and area under the receiver operating characteristic curve (AUROC) were quantified with a 95% confidence interval and were deemed statistically significant if the p-value was less than 0.05. The statistical analysis was performed in Python (v3.8) and R (v3.6.3).
A cohort of 588 individuals diagnosed with substantial RCC, comprising 298 (236, 62) males and 290 (149, 41) females, participated in the study. A summary of their basic and clinical information is presented in Table 1. The internal set of 485 participants was randomly divided into a training set, consisting of 388 cases (80%), and a validation set, comprised of 97 cases (20%). The remaining 103 participants were assigned to an external testing set (Fig. 1a). The automatic analysis framework is illustrated in Fig. 1b.
Multi-scale features framework provides robust analysis capability
The 3D region of the kidney and tumor were expertly segmented from CT images through a structure-based 3D UNet. In Fig. 2, the segmentation model demonstrates outstanding performance in test cases, with the red ROI symbolizing the kidney and the green ROI representing the tumor.
The AUROC from single renal features, single tumor features, and renal–tumor features, as shown in Fig. 3a, b, was 0.79 ± 0.1, 0.84 ± 0.1, and 0.87 ± 0.1 for the classification of Ki67, with an accuracy of 0.71, 0.78 and 0.81 at a 95% confidence level, respectively. For the internal validation, the AUROC of the low vs. high Ki67 prediction was 0.75 ± 0.1, 0.75 ± 0.1, 0.83 ± 0.1, 0.77 ± 0.1 and 0.87 ± 0.1, with an accuracy of 0.67, 0.70, 0.71, 0.70 and 0.82 by the fivefold cross-validation, respectively (Fig. 3c, d). The AUROC from the optimal model was 0.87 ± 0.1 and 0.82 ± 0.1 for low vs. high Ki67 prediction at the internal validation set (Fig. 3e) and external testing set (Fig. 3f), respectively.
Feature contribution evaluation by SHAP values
The contribution of the imaging features to the model's prediction was assessed by computing the SHAP values, which decomposed the decision of the model into the influence of individual features for each sample. The top-20 driver features were visualized in bee-swarm-plots (Fig. 4a) and bar-plots (Fig. 4b) for low vs. high Ki67 prediction. The bee-swarm plot depicts the SHAP values and feature values across the original dataset, with redder dots indicating larger eigenvalues and bluer dots indicating lower eigenvalues, and positive SHAP values signifying a higher likelihood for the corresponding prediction.
In Fig. 4a, b, the tumor first-order 10P and pca 130 were found to be critical in low vs. high Ki67 prediction. To test the noise reduction capability of our model, we conducted a feature elimination experiment by removing features with SHAP values equal to zero and re-training the original model. The results indicated that the model's performance remained comparable, and the AUROC achieved 0.87 at a 95% confidence interval.
Despite significant advancements in the diagnosis and treatment of RCC, the overall prognosis still remains dismal [3, 17]. Excessive cell proliferation is a hallmark of cancer. Ki-67, a nuclear protein, serves as a critical marker for assessing tumor proliferation status, and its expression level has significant implications for tumor biology, treatment response, and patient prognosis. At present, pathology is the gold standard for determining tumor Ki-67 expression [4,5,6]. However, biopsy procedures are invasive and pose risks for dissemination, and are generally not recommended. In clinical practice, noninvasive diagnostic criteria for RCC include imaging signs of corticomedullary phase enhancement and contouring in the nephrographic phase on CT or MRI scans. If Ki-67 expression in RCC can be assessed using imaging modalities, it would offer valuable information to clinicians for making individualized treatment decisions, which is paramount for patient prognosis.
Medical imaging is commonly acknowledged as a product of genetic and molecular-level processes [18, 19]. Consequently, the implementation of artificial intelligence techniques to extract feature from medical images can shed light on the molecular and genotypic foundation of tissues to some degree [20,21,22]. Many researchers have endeavored to investigate the relationship between CT features and Ki-67 expression level in cancers. The studies of Wu et al. showed that CT texture analysis based on machine learning might be a credible quantitative strategy to predict the Ki67 expression level in hepatocellular carcinoma . Gu et al. found that a CT-based radiomics model could predict a high Ki67 expression level of non-small cell lung cancer . In gliomas, CT features have been found to exhibit a significant correlation with the Ki-67 index . However, the value of features of CT images based on machine learning for predicting the Ki67 expression level of RCC remains uncertain.
In this study, we built a comprehensive machine learning-based approach that includes image processing, semantic segmentation, multi-scale features extraction, and Ki-67 expression level prediction to provide a fully automated analysis framework. Furthermore, to enhance the decision-making ability of the visualization model, we had quantified the impact of each multi-scale features on the model decision using SHAP values.
As known, an excessive number of features may result in overfitting in machine learning. We found that only some features made a decisive impact on the decision-making process of the XGBoost model. As shown in Fig. 4a, b, the tumor first-order 10P were found to be critical in low vs. high Ki67 prediction of RCC, likely reflecting significant differences in growth and physical properties between them. As best as our knowledge, the XGBoost algorithm will assign different weights to each feature and perform the feature selection automatically. However, we are still concerned that the absence of feature selection may lead to overfitting problems. To verify that we conducted a feature elimination experiment by removing features with SHAP values equal to zero and re-training the original model. From this result that the AUROC still achieved 0.87, number of features has a small effect on overfitting when using the XGBoost algorithm for model training.
Indeed, a reliable model must not only be able to adapt to any given dataset in real-world scenarios, but also produce consistent and stable results . In our study, we utilized an automatic segmentation-based approach to delineate the kidney and tumor region. This not only significantly reduces the time and cost involved, but also ensures reproducibility for research result while possessing a certain level of generalizability.
Furthermore, our multi-scale features extraction strategy can be seamlessly applied to novel datasets for diverse tasks. Moreover, we employed a fivefold cross-validation approach and a heterogeneous dataset for model validation. Despite the potential risk of overfitting, our model's external test AUROC still achieved an impressive value of 0.84.
Although this study has demonstrated promising results in predicting Ki67 expression levels in substantial RCC patients, there are several limitations that should be addressed in future studies. Firstly, the retrospective and multi-center nature of this study may lead to data heterogeneity and overfitting, which can affect the model's performance. Secondly, while the model can predict low and high Ki67 expression levels, the impact of this prediction on patient outcomes remains unknown. Long-term follow-up studies and prospective studies are necessary to evaluate the clinical significance of Ki67 expression levels predicted by the model. Finally, further research is needed to optimize and improve the accuracy of the model, potentially through the incorporation of additional imaging features or the use of deep learning algorithms.
In conclusion, our study suggests that the proposed automatic analysis framework is capable of predicting the Ki67 expression levels in substantial RCC patients automatically, noninvasively, and dynamically. This prediction can serve as a valuable reference for clinical treatment decisions.
Availability of data and materials
The original images and data used in this study are available from the corresponding author by request.
Area under the receiver operating characteristic curve
Renal cell carcinoma
Receiver operating characteristic
Regions of interest
The Shapley additive explanations
Ljungberg B, Albiges L, Abu-Ghanem Y et al (2022) European association of urology guidelines on renal cell carcinoma: the 2022 update. Eur Urol. https://doi.org/10.1016/j.eururo.2022.03.006:399-410
Ferlay J, Colombet M, Soerjomataram I et al (2018) Cancer incidence and mortality patterns in Europe: estimates for 40 countries and 25 major cancers in 2018. Eur J Cancer 103:356–387
Dudani S, de Velasco G, Wells JC et al (2021) Evaluation of clear cell, papillary, and chromophobe renal cell carcinoma metastasis sites and association with survival. JAMA Netw Open 4:e2021869
Krabbe LM, Bagrodia A, Haddad AQ et al (2015) Multi-institutional validation of the predictive value of Ki-67 in patients with high grade urothelial carcinoma of the upper urinary tract. J Urol 193:1486–1493
Xie Y, Chen L, Ma X et al (2017) Prognostic and clinicopathological role of high Ki-67 expression in patients with renal cell carcinoma: a systematic review and meta-analysis. Sci Rep 7:44281
Virman JP, Bono P, Luukkaala TH, Sunela KL, Kujala PM, Kellokumpu-Lehtinen PL (2016) Combined angiogenesis and proliferation markers’ expressions as long-term prognostic factors in renal cell cancer. Clin Genitourin Cancer 14:e283-289
Xv Y, Lv F, Guo H et al (2021) Machine learning-based CT radiomics approach for predicting WHO/ISUP nuclear grade of clear cell renal cell carcinoma: an exploratory and comparative study. Insights Imaging 12:170
Su C, Jiang J, Zhang S et al (2019) Radiomics based on multicontrast MRI can precisely differentiate among glioma subtypes and predict tumour-proliferative behaviour. Eur Radiol 29:1986–1996
Wu H, Han X, Wang Z et al (2020) Prediction of the Ki-67 marker index in hepatocellular carcinoma based on CT radiomics features. Phys Med Biol 65:235048
Acar E, Turgut B, Yigit S, Kaya G (2019) Comparison of the volumetric and radiomics findings of 18F-FDG PET/CT images with immunohistochemical prognostic factors in local/locally advanced breast cancer. Nucl Med Commun 40:764–772
Gu Q, Feng Z, Liang Q et al (2019) Machine learning-based radiomics strategy for prediction of cell proliferation in non-small cell lung cancer. Eur J Radiol 118:32–37
Watson DS, Krutzinna J, Bruce IN et al (2019) Clinical applications of machine learning algorithms: beyond the black box. BMJ 364:12
Isensee F, Jaeger PF, Kohl SA, Petersen J, Maier-Hein KH (2021) nnU-Net: a self-configuring method for deep learning-based biomedical image segmentation. Nat Methods 18:203–211
Van Griethuysen JJ, Fedorov A, Parmar C et al (2017) Computational radiomics system to decode the radiographic phenotype. Can Res 77:104–107
Chen T, Guestrin C (2016) Xgboost: a scalable tree boosting system. In: Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining, pp 785–794
Lundberg SM, Erion G, Chen H et al (2020) From local explanations to global understanding with explainable AI for trees. Nat Mach Intell 2:56–67
Motzer RJ, Jonasch E, Michaelson MD et al (2019) NCCN Guidelines insights: kidney cancer, version 2.2020. J Natl Compr Canc Netw 17:1278–1285
Pinker K, Chin J, Melsaether AN, Morris EA, Moy L (2018) Precision medicine and radiogenomics in breast cancer: new approaches toward diagnosis and treatment. Radiology 287:732–747
Wu P, Wu K, Li Z et al (2023) Multimodal investigation of bladder cancer data based on computed tomography, whole slide imaging, and transcriptomics. Quant Imaging Med Surg 13:1023–1035
Shinagare AB, Krajewski KM, Braschi-Amirfarzan M, Ramaiya NH (2017) Advanced renal cell carcinoma: role of the radiologist in the era of precision medicine. Radiology 284:333–351
McKinney SM, Sieniek M, Godbole V et al (2020) International evaluation of an AI system for breast cancer screening. Nature 577:89–94
Considine B, Hurwitz ME (2019) Current status and future directions of immunotherapy in renal cell carcinoma. Curr Oncol Rep 21:34
Wu K, Wu P, Yang K et al (2022) A comprehensive texture feature analysis framework of renal cell carcinoma: pathological, prognostic, and genomic evaluation based on CT images. Eur Radiol 32:2255–2265
We thank the local hospital for their data to this project.
This work was supported by the National Natural Science Foundation Fund of China (No. 61931024), Shenzhen Science and Technology Program (No. JCYJ20220818100015031 and No. RCJC20200714114557005), and the Shenzhen Municipal Science and Technology Innovation Commission (No. JSGG20180712090411521).
Ethical approval and consent to participate
This study was approved by the institutional research ethics committee. All data of participate used were acquired with institutional review board-approved protocols. Informed consent documents are waived by the Institutional Review Board.
Consent for publication
No personal data or any identifiable statement beyond images are used in the manuscript.
The authors of this manuscript declare no relationships with any companies, whose products or services may be related to the subject matter of the article.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Method. Fig. S1. The multi-scale features extractor. 200 Radiomics features were integrated with 512 PCA and 128 SVD features as the 840 combined features to the classifier.
About this article
Cite this article
Yang, H., Lin, J., Liu, H. et al. Automatic analysis framework based on 3D-CT multi-scale features for accurate prediction of Ki67 expression levels in substantial renal cell carcinoma. Insights Imaging 14, 130 (2023). https://doi.org/10.1186/s13244-023-01465-y
- Renal cell carcinoma
- Multi-scale features