Skip to main content

A systematic review and meta-analysis of CT and MRI radiomics in ovarian cancer: methodological issues and clinical utility



We aimed to present the state of the art of CT- and MRI-based radiomics in the context of ovarian cancer (OC), with a focus on the methodological quality of these studies and the clinical utility of these proposed radiomics models.


Original articles investigating radiomics in OC published in PubMed, Embase, Web of Science, and the Cochrane Library between January 1, 2002, and January 6, 2023, were extracted. The methodological quality was evaluated using the radiomics quality score (RQS) and Quality Assessment of Diagnostic Accuracy Studies 2 (QUADAS-2). Pairwise correlation analyses were performed to compare the methodological quality, baseline information, and performance metrics. Additional meta-analyses of studies exploring differential diagnoses and prognostic prediction in patients with OC were performed separately.


Fifty-seven studies encompassing 11,693 patients were included. The mean RQS was 30.7% (range − 4 to 22); less than 25% of studies had a high risk of bias and applicability concerns in each domain of QUADAS-2. A high RQS was significantly associated with a low QUADAS-2 risk and recent publication year. Significantly higher performance metrics were observed in studies examining differential diagnosis; 16 such studies as well as 13 exploring prognostic prediction were included in a separate meta-analysis, which revealed diagnostic odds ratios of 25.76 (95% confidence interval (CI) 13.50–49.13) and 12.55 (95% CI 8.38–18.77), respectively.


Current evidence suggests that the methodological quality of OC-related radiomics studies is unsatisfactory. Radiomics analysis based on CT and MRI showed promising results in terms of differential diagnosis and prognostic prediction.

Critical relevance statement

Radiomics analysis has potential clinical utility; however, shortcomings persist in existing studies in terms of reproducibility. We suggest that future radiomics studies should be more standardized to better bridge the gap between concepts and clinical applications.

Graphical abstract

Key points

  • The methodological quality of current radiomics studies concerning ovarian cancer was unsatisfactory.

  • Meta-analyses showed high diagnostic odds ratios regarding differential diagnosis and prognostic prediction.

  • Radiomics analysis in ovarian cancer holds promise for clinical applications.

  • More standardization should be required for radiomics studies.


Ovarian cancer (OC) is the most lethal gynecological cancer and the fifth-leading cause of cancer-related deaths among women; there were 313,959 newly diagnosed cases and 207,252 deaths worldwide in 2020 [1, 2]. The most recent cancer statistics report indicates that approximately 19,710 new cases of OC will be diagnosed in the USA in 2023, and 13,270 women will die from the disease [2]. The World Health Organization classification of tumors divides OC into dozens of pathological types [3]; furthermore, 70% of patients with OC are diagnosed at an advanced stage, leading to worse outcomes (their 5-year overall survival rates are 20–30% versus 80–95% for those diagnosed at early stages) [4,5,6]. Treatment involves surgery and chemotherapy and depends on the pathological type and International Federation of Gynaecology and Obstetrics stage [5, 7]. Notwithstanding advances in the diagnosis and treatment of OC, however, mortality rates have not changed appreciably in the last 30 years [2, 8,9,10,11,12]. This is partly due to the difficulty of early detection and the lack of effective therapeutic options for patients in advanced stages.

Computed tomography (CT) and magnetic resonance imaging (MRI) are essential for diagnosing and staging OC [5] and are invaluable for assessing chemotherapy response [13]. However, conventional imaging interpretations rely on the skills of radiologists, and variabilities among reports inevitably exist. Fortunately, medical imaging is advancing toward more standardized, specialized, and quantitative approaches, contributing to greater consistency and communication among radiologists. With this evolution, radiologists are shifting from conventional free-text reporting to structured reporting, enabling more accurate and efficient analysis of the extensive volumes of imaging data [14, 15]. Radiomics is a rapidly emerging field that quantitatively analyzes medical images utilizing artificial intelligence; based on high-throughput mining of quantitative image features, radiomics analysis generates unique markers that may be visually indiscernible yet can support clinical decision-making and increase diagnostic and prognostic accuracy [8, 16, 17].

In recent years, extensive studies using radiomics methods based on CT and MRI have linked quantitative image features to diagnosis, response evaluation, and prognostic prediction in patients with OC [8, 17, 18]. Nevertheless, it has remained difficult to assess the clinical value of radiomics in OC owing to the complexity of the methods and varying study designs. Therefore, we performed this study with two main aims: First, we evaluated the methodological quality of existing studies using the ‘quality of diagnostic accuracy studies-2’ (QUADAS-2) tool as well as the ‘radiomics quality score’ (RQS) [16, 19]. Second, we conducted a meta-analysis to determine the diagnostic performance of radiomics in patients with OC.

Materials and methods

Evidence acquisition protocol and registry

This systematic review was conducted according to the Preferred Reporting Items for Systematic Reviews and Meta-analysis (PRISMA) statement [20] (Additional file 1: Table S1). A review protocol is available through the International Prospective Register of Systematic Reviews (PROSPERO) (CRD 42022313519).

Literature search and study selection

A structured search was performed independently by two reviewers (M.H. and J.R.) with 5 and 3 years of experience in gynecological imaging interpretation, respectively. Any disagreement was arbitrated by a third reviewer (Y.H.) with 12 years of experience, which was performed in 1.9% of the studies (8/418). The reviewers independently screened the titles, abstracts, and full texts of the extracted articles; uncertainties were discussed and resolved by consensus. Detailed search strategies and selection criteria are described in Additional file 1.

Data extraction and methodological quality assessment

After selecting the relevant studies, the two reviewers developed a data extraction instrument (described in Additional file 1: Table S2). After independently reading the full text of each eligible article, they documented: (1) bibliographical information, (2) baseline study information (including study design, imaging technique parameters, and modeling information), and (3) model performance metrics. The extracted information was recorded using the Excel software (Microsoft Corp., Redmond, WA, USA). The methodological quality of the eligible articles was independently assessed by the two reviewers using the RQS (described in Additional file 1: Table S3) [16] and Quality Assessment of QUADAS-2 tools [19]. Each of the 16 key components identified by the RQS was rated, resulting in a total score ranging from − 8 to 36 points; these were converted to RQS percentages, with − 8 to 0 points defined as 0% and 36 points defined as 100% [16]. The QUADAS-2 tool comprises seven assessment items reflecting four domains: ‘patient selection’, ‘index test’, ‘reference standard’, and ‘flow and timing’. Each item was judged as “low,” “high,” or “unclear” based on responses to signaling questions on the risk of bias and applicability concerns [19]. A summed RQS rating calculated by averaging the scores of the two reviewers was determined for each study. For QUADAS-2 assessment, any disagreement was arbitrated with a third reviewer (H.Y.) to achieve a common appraisal for each item. To facilitate additional analysis, a final risk assessment was performed for each study as follows: studies in which all seven items were rated “low” were defined as “low risk”, studies in which at least one item was rated “high” were defined as “high risk”, and the remainder were defined as “unclear risk”.

Data synthesis and analysis

Statistical analysis was conducted using the SPSS software version (IBM Corp., Armonk, NY, USA), Review Manager (RevMan) version 5.3, and R (version 4.0.5; R Foundation for Statistical Computing) incorporating the ‘tidyverse’ packages. Categorical variables are presented as numbers and percentages, while continuous variables are presented as means and standard deviations or as medians and ranges. The inter-rater agreement for the RQS and QUADAS-2 was determined using Cohen’s kappa [21] and the ratio of agreements [21, 22]. An interclass correlation coefficient (ICC) of ≤ 0.4 was considered poor, 0.4–0.75 moderate, and > 0.75 good. Only training set performance metrics of the proposed radiomics models were recorded, even if validation sets existed, to maintain consistency among studies. Pairwise correlation analyses between methodological quality, baseline information, and performance metrics were conducted. Correlations between numeric variables were evaluated using linear regression analyses, while associations between numeric and categorical variables were assessed using unpaired Student’s t-tests or Mann–Whitney tests. The significance level α was 0.05, and statistical significance was set at a two-tailed p-value of < 0.05.

Meta-analyses were performed using the STATA software version 17.0 (StataCorp LP, College Station, TX, USA) with the ‘midas’ package when a sufficient number of studies attempted to address a similar question, and two-by-two tables could be extracted or reconstructed based on published data (the details are shown in Additional file 1). The sensitivity, specificity, positive likelihood ratio (PLR), negative likelihood ratio (NLR), and diagnostic odds ratio (DOR) with 95% confidence intervals (CIs) were calculated. Summary receiver operating characteristic (SROC) analysis was performed, and the areas under the curve (AUCs) were obtained to describe the diagnostic accuracy. If a particular study involved multiple models, only the radiomics model was selected. Heterogeneity was assessed using Cochrane’s Q test and Higgin’s inconsistency index (I2) test. Any heterogeneity was considered significant if the p-value on Cochran’s Q test was < 0.05, whereupon the random effects model was used. Higgins I2 values of < 25%, 25–50%, and > 50% were associated with low, moderate, and substantial heterogeneity, respectively. In cases of the latter, meta-regression was performed to explore the possible sources of heterogeneity. Deeks’ funnel plots were constructed to illustrate the risk of publication bias.


The study selection process is illustrated in Fig. 1; 57 articles were ultimately included in the systematic review [23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65,66,67,68,69,70,71,72,73,74,75,76,77,78,79]. We divided these publications based on the studies’ main objectives into three categories: differential diagnosis (24/57, 42.1%), response evaluation (4/57, 7.0%), and prediction of prognosis (28/57, 49.1%). Additionally, one paper (1.8%) described both differential diagnosis and prognostic prediction. We found that assessments of differential diagnosis and prognostic prediction were both commonly performed; thus, 16 articles focusing on the differential diagnosis of OC [24,25,26, 28,29,30,31,32, 34, 35, 39, 40, 42, 44, 45, 47] and 13 that described studies on prognostic factors [52, 53, 5557, 64, 66,67,68, 71,72,73,74] were subjected to separate meta-analyses.

Fig. 1
figure 1

Flowchart of the study selection process for the systematic review and meta-analysis

Study characteristics

There were between 28 and 1329 patients in each study; their median or mean ages ranged from 38.5 to 75 years. The studies’ baseline information and characteristics are shown in Tables 1 and 2, respectively. Nearly half of the studies were published in imaging journals (28/57, 49.1%); the first authors were mainly specialized in radiology (36/57, 63.2%) and most came from Asia (43/57, 75.4%). Thirty-seven studies used CT and 20 used MRI; most applied the manual segmentation method and three-dimensional analysis. According to the model classification method proposed by the TRIPOD statement [80], a plurality of the studies established developed models validated with random splitting of data (27/57, 47.4%), followed by developed models validated using exactly the same data (10/57, 17.5%) and separate data (9/57, 15.8%). Furthermore, using Zhong et al. method [22], 59.6% of the studies were classified as phase 0 owing to their lack of external validation. Two phase III studies were conducted without sufficient patients or lacked external validation; however, they retained their phase categories owing to their prospective designs. Detailed characteristics of each study’s population and proposed radiomics model are presented in Additional file 1: Tables S4 and S5.

Table 1 Baseline information of studies included in the systematic review
Table 2 Characteristics of included studies

Methodological quality assessment

The 57 studies had a mean RQS of 12 (interquartile range 10–14), with RQS values ranging from − 4 to 22. The mean percentage RQS was 30.7%, with a maximum of 61.1%. The average ICC for the RQS was 0.80 (95% confidence interval (CI) 0.69–0.91), i.e., a ‘good’ rating. The average rating and inter-rater agreement per RQS component are shown in Table 3. In most studies, the lack of reproducibility and repeatability analysis of imaging features (e.g., phantom study and imaging at multiple time points), insufficient cost-effectiveness analyses, and inadequate access to the data led to a low RQS. Additionally, biological correlation and cutoff analyses were seldom performed in the aftermath, and the rated validation score (2 [2]) was suboptimal. Discrimination statistics to assess model performance were available for all studies; 21 of them also reported calibration statistics. Fifty-five studies (96.5%) were retrospective analyses, while only two (3.5%) were based on prospectively acquired data. The RQS assessments by each reviewer (M.H. and J.R.) are shown in Additional file 1: Table S6.

Table 3 Average rating and inter-rater agreement per component of RQS

The summarized and individual QUADAS-2 results are presented in Fig. 2 and Additional file 1: Table S7. A high risk of publication bias (19.3%) and applicability concerns (22.8%) were observed in terms of patient selection, mainly because of inappropriate exclusion. Twelve studies (21.1%) were regarded as having an unclear risk of bias in flow and timing, as they did not provide sufficient information regarding the interval between index tests and reference standards. Complete agreement between the two reviewers concerning the seven QUADAS-2 items ranged from 80.7 to 96.5%.

Fig. 2
figure 2

Quality assessment of included studies by QUADAS-2 tool. The authors’ judgments for each domain of each included study were reviewed. The proportion of included studies that indicated low, unclear, high risk, and applicability concerns were shown in green, yellow and gray, respectively

Pairwise correlation analysis of methodological quality, baseline information, and performance metrics

The pairwise correlation analysis results are presented in Fig. 3. One article that discussed both differential diagnosis and prognostic prediction was analyzed twice as two separate studies. The RQS values were significantly different between studies with different QUADAS-2 risk assessments (p = 0.011). Importantly, studies deemed low risk had a higher mean RQS than those with high or unclear risk; the difference between low-risk and unclear-risk studies was significant (p = 0.002). The study aim (p = 0.180) and specialty of the first author (p = 0.520) did not influence the RQS rating. Studies published more recently received significantly higher scores (adjusted R2 = 0.264, p < 0.001).

Fig. 3
figure 3

Correlations between radiomics quality score (RQS), performance metrics and baseline information. ad Correlation between RQS and QUADAS-2, study aim, first author, and publication year. The vertical dashed line corresponds to the year of publication of the RQS. eh Correlation between performance metrics and RQS, study aim, number of patients, and number of features. Each point corresponds to a study. The regression line and its 95% confidence interval are shown in gray with adjusted R2 and p-value

The performance metrics were represented by the AUC in 49 studies (86.0%) and by the c-index in three (5.3%); they were missing in the remaining five (8.8%). Two of the studies explored two radiomics models with different aims; hence, their performance metrics were documented separately. The best radiomics model used in each study had an AUC or c-index value between 0.620 and 1.000. Moreover, the performance of the radiomics model was closely related to the study aim (p < 0.001); performance was significantly better in studies exploring differential diagnosis than in those evaluating response (p = 0.012) or predicting prognosis (p < 0.001). No significant correlation was found between performance metrics and RQS (adjusted R2 = − 0.019, p = 0.892), sample size (adjusted R2 = − 0.019, p = 0.938), or number of features extracted (adjusted R2 = 0.046, p = 0.068).


Sixteen studies that focused on differential diagnosis and 13 that investigated prognostic prediction were subjected to a meta-analysis. For studies of differential diagnosis, the pooled diagnostic odds ratio (DOR) was 25.76 (95% CI 13.50–49.13) (Fig. 4a); the pooled sensitivity, specificity, positive likelihood ratio (PLR), and negative likelihood ratio (NLR) were 0.84 (95% CI 0.76–0.89), 0.83 (95% CI 0.77–0.88), 5.00 (95% CI 3.58–6.97), and 0.19 (95% CI 0.13–0.30), respectively (Additional file 1: Figs. S1a and Fig. S2a). The SROC curve suggested good diagnostic performance with an AUC of 0.90 (95% CI 0.87–0.92) (Additional file 1: Fig. S3a). For studies analyzing prognostic prediction, the pooled DOR, sensitivity, specificity, PLR, and NLR were 12.55 (95% CI 8.38–18.77), 0.78 (95% CI 0.71–0.83), 0.78 (95% CI 0.72–0.82), 3.59 (95% CI 2.80–4.59), and 0.29 (95% CI 0.22–0.37), respectively (Fig. 4b, Additional file 1: Figs. S1b, and S2b). The AUC calculated from the SROC curve was 0.85 (95% CI 0.81–0.88), indicating good performance of the prognostic prediction models (Additional file 1: Fig. S3b). Per Deeks’ funnel plots, the likelihood of publication bias was low for differential diagnosis studies (p = 0.760) but high for prognostic prediction studies (p = 0.040) (Additional file 1: Fig. S4).

Fig. 4
figure 4

Forrest plot of the effect size calculated as diagnostic odds ratio for studies discussing the diagnostic accuracy of radiomics models in the a differential diagnosis; b prognosis prediction of ovarian tumors. The numbers are pooled estimates with 95% CIs in parentheses; horizontal lines indicate 95% CIs. TP, FP, FN, and TN were defined according to the original articles' descriptions

Cochrane’s Q test (p < 0.001 and p < 0.001) and Higgin’s I2 test (I2 = 94% and I2 = 91%) indicated high heterogeneity; therefore, a meta-regression analysis was conducted to identify its source (Table 4 and Additional file 1: Fig. S5). The number of patients was significantly associated with heterogeneity in specificity (p < 0.001) for differential diagnosis studies and contributed to heterogeneity in sensitivity (p < 0.001), specificity (p < 0.001), and the AUC (p = 0.04) in prognostic prediction studies. Additionally, the imaging method (CT vs. MRI) influenced heterogeneity in both sensitivity (p < 0.001 for differential diagnosis and p < 0.01 for prognostic prediction studies) and specificity (p < 0.01 for both study types). The heterogeneity of sensitivity and specificity in the differential diagnosis subgroup was associated with the type of region of interest (p = 0.04 for sensitivity and p < 0.01 for specificity). In the prognostic prediction subgroup, however, the region of interest type influenced AUC (p = 0.03), sensitivity (p < 0.001), and specificity (p < 0.01). Meanwhile, the type of features only contributed to heterogeneity in specificity (p < 0.01 for both differential diagnosis and prognostic prediction).

Table 4 The results of meta-regression analysis of studies about differential diagnosis and prognosis prediction of OC


Our systematic review found that the methodological quality of CT and MRI radiomics for patients with OC was relatively low, while our meta-analysis revealed that radiomics has promising potential in discriminating between OC subtypes and predicting patient prognosis.

Several narrative reviews described the prospects of applying radiomics in OC [8, 17, 81, 82]. Rizzo et al.’s systematic review [83] included six studies that lacked RQS ratings and meta-analyses, whereas that by Ponsiglione et al. [18] used the RQS to evaluate the methodological quality of studies involving CT-, MRI-, ultrasonography-, or positron emission tomography-based radiomics in ovarian disorders published as of November 2021. The field of radiomics is in a period of rapid growth; 42.1% of the studies we analyzed were published in 2022 or 2023. Aside from the RQS, QUADAS-2 was applied to assess potential publication bias and applicability concerns. While previously published reviews usually describe only qualitative analyses, we performed quantitative evaluation of the effect of radiomics models by conducting a meta-analysis; our mean RQS (30.7%) was acceptable when compared to those published previously (5.6–36.1%) [18, 22, 84-87]. However, the overall scientific quality was still unsatisfactory, given that the assessment values were considerably below 100%. The main reasons for the low RQS ratings, as in previous analyses [22, 86, 88], included low levels of feature robustness, insufficient model assessment, and lack of concern for clinical applications.

The primary challenge for feature robustness was the high variability in radiomics features; few of the publications we analyzed described phantom studies [74, 77, 78], multiple time-point imaging, or automatic segmentation [49,50,51]. Regarding model assessment, radiomics studies usually included discrimination statistics; however, cutoff analyses and calibration statistics were often neglected, thus complicating the risk evaluation of overly optimistic results and accuracy of model predictions. Classification indexes including sensitivity and specificity, which are critical for quantitative analysis, were lacking in some studies. Our results suggest the need for performing additional technical validation before considering radiomics models for clinical applications.

Although most studies involved internal validation (which is indispensable for the clinical translation and broad application of radiomics models), the absence of external validation at several centers undermined the credibility and generalizability of the models. Very few prospective studies [37, 66], which are considered to have a high level of evidence, were performed. Furthermore, analysis of cost-effectiveness was absent from all our included studies, and the lack of decision curve analysis in approximately two-thirds of them also obscured the applicability of the models to clinical settings. As such, technical and clinical validations remain critical unmet requirements for incorporating radiomics analysis into clinical applications.

The importance of open science is emphasized in the RQS ratings to ensure the transparency and reproducibility of research findings, but such access was lacking in our studies. As such, we suggest that researchers should at least include the computed numerical values of any investigated features in their publications.

In terms of correlations between the RQS and original variables, there was a significant relationship between RQS and publication year. More modern studies presumably have a larger sample size, collect data from multiple centers, use a prospective design, and/or apply more strict inclusion and exclusion criteria, thereby contributing to a higher RQS. Additionally, a higher RQS was significantly correlated with low-level risk according to QUADAS-2, which may support using the former to improve research quality. We also found that studies exploring differential diagnosis had significantly higher performance metrics, whereas studies of response evaluation or prognostic prediction may be more affected by non-relevant factors, such as lifestyle and other medications, given that they have longer follow-up periods. Contrary to our expectation, we did not find a significant relationship between performance metrics and the number of features extracted, which could be explained by the heterogeneity in feature selection and modeling.

The results of our meta-analysis were promising; the pooled AUCs reached 0.90 for differential diagnosis and 0.85 for prognostic prediction, indicating a relatively high accuracy in categorizing different pathological subtypes and predicting the prognoses of patients with OC. According to Cochrane’s Q and Higgin’s I2 tests, studies included in the meta-analysis had high levels of heterogeneity, which a meta-regression analysis attributed to the influence of patient number, imaging modality, region of interest, and feature type.

Our study had some limitations. First, most included articles did not report the numbers of true/false positives or true/false negatives, and our calculation of these data from the available information might have introduced some errors. Second, studies predicting chemotherapy response were not included in the meta-analysis owing to insufficient data. Third, studies involved in the meta-analysis showed high heterogeneity; although we found significant correlations between such heterogeneity and certain factors, the latter may not have sufficiently explained the former. Fourth, the prognostic prediction studies included in our meta-analysis had a significant risk of publication bias, likely because we only included English-language articles and also because half of the studies comprised small sample sizes; reluctance to publish negative data may also have been a factor.

In conclusion, radiomics analysis showed promise in terms of overcoming some current obstacles in determining differential diagnosis, chemotherapy response, and prognosis in patients with OC. Pairwise correlation analysis revealed a significant relationship between RQS and QUADAS-2 result or publication year, as well as between performance metrics and study aims. Additionally, our meta-analysis demonstrated the suitability of radiomics analysis for discriminating between various subtypes of OC and identifying prognostic factors through quantitative analysis. Given the generally low RQS ratings of all the included studies, the methodological quality of radiomics studies involving OC is lower than desirable; hence, more high-level evidence is required to develop effective radiomics models.

Availability of data and materials

All data generated or analyzed during this study are included in this published article [and its Additional files].



Area under the curve


Confidence interval


Diagnostic odds ratio


Interclass correlation coefficient


Negative likelihood ratio


Ovarian cancer


Positive likelihood ratio


Preferred Reporting Items for Systematic Reviews and Meta-analysis


International Prospective Register of Systematic Reviews


Quality Assessment of Diagnostic Accuracy Studies 2


Radiomics quality score


Summary receiver operating characteristic


  1. Sung H, Ferlay J, Siegel RL et al (2021) Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin 71:209–249

    Article  PubMed  Google Scholar 

  2. Siegel RL, Miller KD, Wagle NS, Jemal A (2023) Cancer statistics, 2023. CA Cancer J Clin 73:17–48

    Article  PubMed  Google Scholar 

  3. WHO classification of Tumours Editorial Board (2020) WHO classification of tumours. In: Female genital tumours, 5th ed. IARC Press, Lyon

  4. Park HK, Ruterbusch JJ, Cote ML (2017) Recent trends in ovarian cancer incidence and relative survival in the United States by race/ethnicity and histologic subtypes. Cancer Epidemiol Biomark Prev 26:1511–1518

    Article  Google Scholar 

  5. National Comprehensive Cancer Network (2023) Ovarian cancer (Version 1.2023). Available via Accessed 3 Jan 2023

  6. Peres LC, Cushing-Haugen KL, Köbel M et al (2019) Invasive epithelial ovarian cancer survival by histotype and disease stage. J Natl Cancer Inst 111:60–68

    Article  PubMed  Google Scholar 

  7. Gaona-Luviano P, Medina-Gaona LA, Magaña-Pérez K (2020) Epidemiology of ovarian cancer. Chin Clin Oncol 9:47–47

    Article  PubMed  Google Scholar 

  8. Nougaret S, McCague C, Tibermacine H, Vargas HA, Rizzo S, Sala E (2021) Radiomics and radiogenomics in ovarian cancer: a literature review. Abdom Radiol (NY) 46:2308–2322

    Article  CAS  PubMed  Google Scholar 

  9. Lee JM, Minasian L, Kohn EC (2019) New strategies in ovarian cancer treatment. Cancer 125:4623–4629

    Article  CAS  PubMed  Google Scholar 

  10. Fujiwara K, Hasegawa K, Nagao S (2019) Landscape of systemic therapy for ovarian cancer in 2019: primary therapy. Cancer 125:4582–4586

    Article  CAS  PubMed  Google Scholar 

  11. Siegel RL, Miller KD, Fuchs HE, Jemal A (2022) Cancer statistics, 2022. CA Cancer J Clin 72:7–33

    Article  PubMed  Google Scholar 

  12. Arend R, Martinez A, Szul T, Birrer MJ (2019) Biomarkers in ovarian cancer: To be or not to be. Cancer 125:4563–4572

    Article  CAS  PubMed  Google Scholar 

  13. Eisenhauer EA, Therasse P, Bogaerts J et al (2009) New response evaluation criteria in solid tumours: revised RECIST guideline (version 1.1). Eur J Cancer 45:228–247

    Article  CAS  PubMed  Google Scholar 

  14. Faggioni L, Coppola F, Ferrari R, Neri E, Regge D (2017) Usage of structured reporting in radiological practice: results from an Italian online survey. Eur Radiol 27:1934–1943

    Article  PubMed  Google Scholar 

  15. Granata V, Faggioni L, Grassi R et al (2022) Structured reporting of computed tomography in the staging of colon cancer: a Delphi consensus proposal. Radiol Med 127:21–29

    Article  PubMed  Google Scholar 

  16. Lambin P, Leijenaar RTH, Deist TM et al (2017) Radiomics: the bridge between medical imaging and personalized medicine. Nat Rev Clin Oncol 14:749–762

    Article  PubMed  Google Scholar 

  17. Arezzo F, Loizzi V, La Forgia D et al (2021) Radiomics analysis in ovarian cancer: a narrative review. Appl Sci Basel 11:7833

    Article  CAS  Google Scholar 

  18. Ponsiglione A, Stanzione A, Spadarella G et al (2022) Ovarian imaging radiomics quality score assessment: an EuSoMII radiomics auditing group initiative. Eur Radiol.

    Article  PubMed  PubMed Central  Google Scholar 

  19. Whiting PF, Rutjes AW, Westwood ME et al (2011) QUADAS-2: a revised tool for the quality assessment of diagnostic accuracy studies. Ann Intern Med 155:529–536

    Article  PubMed  Google Scholar 

  20. McInnes MDF, Moher D, Thombs BD et al (2018) Preferred reporting items for a systematic review and meta-analysis of diagnostic test accuracy studies. JAMA 319:388

    Article  PubMed  Google Scholar 

  21. Landis JR, Koch GG (1977) The measurement of observer agreement for categorical data. Biometrics 33:159–174

    Article  CAS  PubMed  Google Scholar 

  22. Zhong J, Hu Y, Si L et al (2021) A systematic review of radiomics in osteosarcoma: utilizing radiomics quality score as a tool promoting clinical translation. Eur Radiol 31:1526–1535

    Article  PubMed  Google Scholar 

  23. Zheng Y, Wang H, Li Q, Sun H, Guo L (2022) Discriminating between benign and malignant solid ovarian tumors based on clinical and radiomic features of MRI. Acad Radiol.

    Article  PubMed  Google Scholar 

  24. Zhang A, Hu Q, Ma Z, Song J, Chen T (2022) Application of enhanced computed tomography-based radiomics nomogram analysis to differentiate metastatic ovarian tumors from epithelial ovarian tumors. J Xray Sci Technol 30:1185–1199

    PubMed  Google Scholar 

  25. Xu Y, Luo HJ, Ren J, Guo LM, Niu J, Song X (2022) Diffusion-weighted imaging-based radiomics in epithelial ovarian tumors: assessment of histologic subtype. Front Oncol 12:978123

    Article  PubMed  PubMed Central  Google Scholar 

  26. Wei M, Zhang Y, Bai G et al (2022) T2-weighted MRI-based radiomics for discriminating between benign and borderline epithelial ovarian tumors: a multicenter study. Insights Imaginging 13:130

    Article  Google Scholar 

  27. Wang M, Perucho JAU, Hu Y et al (2022) Computed tomographic radiomics in differentiating histologic subtypes of epithelial ovarian carcinoma. JAMA Netw Open 5:e2245141

    Article  PubMed  PubMed Central  Google Scholar 

  28. Nagawa K, Kishigami T, Yokoyama F et al (2022) Diagnostic utility of a conventional MRI-based analysis and texture analysis for discriminating between ovarian thecoma-fibroma groups and ovarian granulosa cell tumors. J Ovarian Res 15:65

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  29. Liu X, Wang T, Zhang G et al (2022) Two-dimensional and three-dimensional T2 weighted imaging-based radiomic signatures for the preoperative discrimination of ovarian borderline tumors and malignant tumors. J Ovarian Res 15:22–22

    Article  PubMed  PubMed Central  Google Scholar 

  30. Liu P, Liang X, Liao S, Lu Z (2022) Pattern classification for ovarian tumors by integration of radiomics and deep learning features. Curr Med Imaging 18:1486–1502

    Article  PubMed  Google Scholar 

  31. Li S, Liu J, Xiong Y et al (2022) Application values of 2D and 3D radiomics models based on CT plain scan in differentiating benign from malignant ovarian tumors. Biomed Res Int 2022:5952296

    PubMed  PubMed Central  Google Scholar 

  32. Li J, Zhang T, Ma J, Zhang N, Zhang Z, Ye Z (2022) Machine-learning-based contrast-enhanced computed tomography radiomic analysis for categorization of ovarian tumors. Front Oncol 12:934735

    Article  PubMed  PubMed Central  Google Scholar 

  33. Li J, Li X, Ma J, Wang F, Cui S, Ye Z (2022) Computed tomography-based radiomics machine learning classifiers to differentiate type I and type II epithelial ovarian cancers. Eur Radiol.

    Article  PubMed  PubMed Central  Google Scholar 

  34. Zhu H, Ai Y, Zhang J et al (2021) Preoperative nomogram for differentiation of histological subtypes in ovarian cancer based on computer tomography radiomics. Front Oncol 11:642892

    Article  PubMed  PubMed Central  Google Scholar 

  35. Yu X-p, Wang L, Yu H-y et al (2021) MDCT-based radiomics features for the differentiation of serous borderline ovarian tumors and serous malignant ovarian tumors. Cancer Manag Res 13:329–336

    Article  PubMed  PubMed Central  Google Scholar 

  36. Ye R, Weng S, Li Y et al (2021) Texture analysis of three-dimensional MRI images may differentiate borderline and malignant epithelial ovarian tumors. Korean J Radiol 22:106–117

    Article  PubMed  Google Scholar 

  37. Song X-l, Ren J-L, Zhao D, Wang L, Ren H, Niu J (2021) Radiomics derived from dynamic contrast-enhanced MRI pharmacokinetic protocol features: the value of precision diagnosis ovarian neoplasms. Eur Radiol 31:368–378

    Article  CAS  PubMed  Google Scholar 

  38. Park H, Qin L, Guerra P, Bay CP, Shinagare AB (2021) Decoding incidental ovarian lesions: use of texture analysis and machine learning for characterization and detection of malignancy. Abdom Radiol (NY) 46:2376–2383

    Article  PubMed  Google Scholar 

  39. Li S, Liu J, Xiong Y et al (2021) A radiomics approach for automated diagnosis of ovarian neoplasm malignancy in computed tomography. Sci Rep 11:8730

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  40. Li NY, Shi B, Chen YL et al (2021) The value of MRI findings combined with texture analysis in the differential diagnosis of primary ovarian granulosa cell tumors and ovarian thecoma-fibrothecoma. Front Oncol 11:758036

    Article  PubMed  PubMed Central  Google Scholar 

  41. Jian J, Li Y, Pickhardt PJ et al (2021) MR image-based radiomics to differentiate type Iota and type Iota Iota epithelial ovarian cancers. Eur Radiol 31:403–410

    Article  PubMed  Google Scholar 

  42. Hu Y, Weng Q, Xia H et al (2021) A radiomic nomogram based on arterial phase of CT for differential diagnosis of ovarian cancer. Abdom Radiol (NY) 46:2384–2392

    Article  PubMed  Google Scholar 

  43. An H, Wang Y, Wong EMF et al (2021) CT texture analysis in histological classification of epithelial ovarian carcinoma. Eur Radiol 31:5050–5058

    Article  PubMed  Google Scholar 

  44. Qian L, Ren J, Liu A et al (2020) MR imaging of epithelial ovarian cancer: a combined model to predict histologic subtypes. Eur Radiol 30:5815–5825

    Article  CAS  PubMed  Google Scholar 

  45. Lupean R-A, Sefan P-A, Feier DS et al (2020) Radiomic analysis of MRI images is instrumental to the stratification of ovarian cysts. J Personal Med 10:127

    Article  Google Scholar 

  46. Ya Li, Jian J, Pickhardt PJ et al (2020) MRI-based machine learning for differentiating borderline from malignant epithelial ovarian tumors: a multicenter study. J Magn Reson Imag 52:897–904

    Article  Google Scholar 

  47. Zhang H, Mao Y, Chen X et al (2019) Magnetic resonance imaging radiomics in categorizing ovarian masses and predicting clinical outcome: a preliminary study. Eur Radiol 29:3358–3371

    Article  PubMed  Google Scholar 

  48. Rundo L, Beer L, Escudero Sanchez L et al (2022) Clinically interpretable radiomics-based prediction of histopathologic response to neoadjuvant chemotherapy in high-grade serous ovarian carcinoma. Front Oncol 12:2423

    Article  Google Scholar 

  49. Zargari A, Du Y, Heidari M et al (2018) Prediction of chemotherapy response in ovarian cancer patients using a new clustered quantitative image marker. Phys Med Biol 63:155020

    Article  PubMed  PubMed Central  Google Scholar 

  50. Danala G, Thai T, Gunderson CC et al (2017) Applying quantitative CT image feature analysis to predict response of ovarian cancer patients to chemotherapy. Acad Radiol 24:1233–1239

    Article  PubMed  PubMed Central  Google Scholar 

  51. Qiu Y, Tan M, McMeekin S et al (2016) Early prediction of clinical benefit of treating ovarian cancer using quantitative CT image feature analysis. Acta Radiol 57:1149–1155

    Article  PubMed  Google Scholar 

  52. Wan S, Zhou T, Che R et al (2023) CT-based machine learning radiomics predicts CCR5 expression level and survival in ovarian cancer. J Ovarian Res 16:1

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  53. Wu Y, Jiang W, Fu L, Ren M, Ai H, Wang X (2022) Intra- and peritumoral radiomics for predicting early recurrence in patients with high-grade serous ovarian cancer. Abdom Radiol (NY).

    Article  PubMed  Google Scholar 

  54. Wang T, Wang H, Wang Y et al (2022) MR-based radiomics-clinical nomogram in epithelial ovarian tumor prognosis prediction: tumor body texture analysis across various acquisition protocols. J Ovarian Res 15:6

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  55. Lu J, Cai S, Wang F et al (2022) Development of a prediction model for gross residual in high-grade serous ovarian cancer by combining preoperative assessments of abdominal and pelvic metastases and multiparametric MRI. Acad Radiol.

    Article  PubMed  Google Scholar 

  56. Li C, Wang H, Chen Y et al (2022) A nomogram combining MRI multisequence radiomics and clinical factors for predicting recurrence of high-grade serous ovarian carcinoma. J Oncol 2022:1716268

    PubMed  PubMed Central  Google Scholar 

  57. Hu J, Wang Z, Zuo R et al (2022) Development of survival predictors for high-grade serous ovarian cancer based on stable radiomic features from computed tomography images. iScience 25:104628

    Article  PubMed  PubMed Central  Google Scholar 

  58. Hong Y, Liu Z, Lin D et al (2022) Development of a radiomic-clinical nomogram for prediction of survival in patients with serous ovarian cancer. Clin Radiol 77:352–359

    Article  CAS  PubMed  Google Scholar 

  59. Gao L, Jiang W, Yue Q et al (2022) Radiomic model to predict the expression of PD-1 and overall survival of patients with ovarian cancer. Int Immunopharmacol 113:109335

    Article  CAS  PubMed  Google Scholar 

  60. Fotopoulou C, Rockall A, Lu H et al (2022) Validation analysis of the novel imaging-based prognostic radiomic signature in patients undergoing primary surgery for advanced high-grade serous ovarian cancer (HGSOC). Br J Cancer 126:1047–1054

    Article  CAS  PubMed  Google Scholar 

  61. Feng S, Xia T, Ge Y et al (2022) Computed tomography imaging-based radiogenomics analysis reveals hypoxia patterns and immunological characteristics in ovarian cancer. Front Immunol 13:868067

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  62. Boehm KM, Aherne EA, Ellenson L et al (2022) Multimodal data integration using machine learning improves risk stratification of high-grade serous ovarian cancer. Nat Cancer 3:723–733

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  63. Avesani G, Tran HE, Cammarata G et al (2022) CT-based radiomics and deep learning for BRCA mutation and progression-free survival prediction in ovarian cancer using a multicentric dataset. Cancers 14:2739

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  64. Yu XY, Ren J, Jia Y et al (2021) Multiparameter MRI radiomics model predicts preoperative peritoneal carcinomatosis in ovarian cancer. Front Oncol 11:765652

    Article  PubMed  PubMed Central  Google Scholar 

  65. Yi X, Liu Y, Zhou B et al (2021) Incorporating SULF1 polymorphisms in a pretreatment CT-based radiomic model for predicting platinum resistance in ovarian cancer treatment. Biomed Pharmacother 133:111013

    Article  CAS  PubMed  Google Scholar 

  66. Song X-L, Ren J-L, Yao T-Y, Zhao D, Niu J (2021) Radiomics based on multisequence magnetic resonance imaging for the preoperative prediction of peritoneal metastasis in ovarian cancer. Eur Radiol 31:8438–8446

    Article  PubMed  Google Scholar 

  67. Liu M, Ge Y, Li M, Wei W (2021) Prediction of BRCA gene mutation status in epithelial ovarian cancer by radiomics models based on 2D and 3D CT images. BMC Med Imaging.

    Article  PubMed  PubMed Central  Google Scholar 

  68. Li MR, Liu MZ, Ge YQ, Zhou Y, Wei W (2021) Assistance by routine CT features combined With 3D texture analysis in the diagnosis of BRCA gene mutation status in advanced epithelial ovarian cancer. Front Oncol 11:696780

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  69. Li HM, Gong J, Li RM et al (2021) Development of MRI-based radiomics model to predict the risk of recurrence in patients with advanced high-grade serous ovarian carcinoma. AJR Am J Roentgenol 217:664–675

    Article  PubMed  Google Scholar 

  70. Li H, Zhang R, Li R et al (2021) Noninvasive prediction of residual disease for advanced high-grade serous ovarian carcinoma by MRI-based radiomic-clinical nomogram. Eur Radiol 31:7855–7864

    Article  PubMed  Google Scholar 

  71. Chen H-z, Wang X-r, Zhao F-m et al (2021) A CT-based radiomics nomogram for predicting early recurrence in patients with high-grade serous ovarian cancer. Eur J Radiol 145:110018

    Article  PubMed  Google Scholar 

  72. Chen H-z, Wang X-r, Zhao F-m et al (2021) The development and validation of a CT-based radiomics nomogram to preoperatively predict lymph node metastasis in high-grade serous ovarian cancer. Front Oncol 11:3362

    Google Scholar 

  73. Ai Y, Zhang J, Jin J, Zhang J, Zhu H, Jin X (2021) Preoperative prediction of metastasis for ovarian cancer based on computed tomography radiomics features and clinical factors. Front Oncol 11:610742

    Article  PubMed  PubMed Central  Google Scholar 

  74. Veeraraghavan H, Vargas HA, Alejandro-Jimenez S et al (2020) Integrated multi-tumor radio-genomic marker of outcomes in patients with high serous ovarian carcinoma. Cancers 12:3403

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  75. Wei W, Liu Z, Rong Y et al (2019) A computed tomography-based radiomic prognostic marker of advanced high-grade serous ovarian cancer recurrence: a multicenter study. Front Oncol 9:255

    Article  PubMed  PubMed Central  Google Scholar 

  76. Meier A, Veeraraghavan H, Nougaret S et al (2019) Association between CT-texture-derived tumor heterogeneity, outcomes, and BRCA mutation status in patients with high-grade serous ovarian cancer. Abdom Radiol (NY) 44:2040–2047

    Article  PubMed  Google Scholar 

  77. Lu H, Arshad M, Thornton A et al (2019) A mathematical-descriptor of tumor-mesoscopic-structure from computed-tomography images annotates prognostic- and molecular-phenotypes of epithelial ovarian cancer. Nat Commun 10:764

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  78. Rizzo S, Botta F, Raimondi S et al (2018) Radiomics of high-grade serous ovarian cancer: association between quantitative CT features, residual tumour and disease progression within 12 months. Eur Radiol 28:4849–4859

    Article  PubMed  Google Scholar 

  79. Vargas HA, Veeraraghavan H, Micco M et al (2017) A novel representation of inter-site tumour heterogeneity from pre-treatment computed tomography textures classifies ovarian cancers by clinical outcome. Eur Radiol 27:3991–4001

    Article  PubMed  PubMed Central  Google Scholar 

  80. Collins GS, Reitsma JB, Altman DG, Moons KG (2015) Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): the TRIPOD statement. Ann Intern Med 162:55–63

    Article  PubMed  Google Scholar 

  81. Martin-Gonzalez P, Crispin-Ortuzar M, Rundo L et al (2020) Integrative radiogenomics for virtual biopsy and treatment monitoring in ovarian cancer. Insights Imaging.

    Article  PubMed  PubMed Central  Google Scholar 

  82. Nougaret S, Tardieu M, Vargas HA et al (2019) Ovarian cancer: an update on imaging in the era of radiomics. Diagn Interv Imaging 100:647–655

    Article  CAS  PubMed  Google Scholar 

  83. Rizzo S, Manganaro L, Dolciami M, Gasparri ML, Papadia A, Del Grande F (2021) Computed Tomography Based Radiomics as a Predictor of Survival in Ovarian Cancer Patients: A Systematic Review. Cancers 13.

  84. Staal FCR, Aalbersberg EA, Van Der Velden D et al (2022) GEP-NET radiomics: a systematic review and radiomics quality score assessment. Eur Radiol.

    Article  PubMed  Google Scholar 

  85. Li Y, Liu Y, Liang Y et al (2022) Radiomics can differentiate high-grade glioma from brain metastasis: a systematic review and meta-analysis. Eur Radiol.

    Article  PubMed  PubMed Central  Google Scholar 

  86. Gao Y, Cheng S, Zhu L et al (2022) A systematic review of prognosis predictive role of radiomics in pancreatic cancer: heterogeneity markers or statistical tricks? Eur Radiol.

    Article  PubMed  PubMed Central  Google Scholar 

  87. Brancato V, Cerrone M, Lavitrano M, Salvatore M, Cavaliere C (2022) A systematic review of the current status and quality of radiomics for glioma differential diagnosis. Cancers 14:2731

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  88. Ursprung S, Beer L, Bruining A et al (2020) Radiomics of computed tomography and magnetic resonance imaging in renal cell carcinoma: a systematic review and meta-analysis. Eur Radiol 30:3558–3566

    Article  PubMed  PubMed Central  Google Scholar 

Download references


This study has received funding by grants from Natural Science Foundation of China (Grant No. 81901829), National High Level Hospital Clinical Research Funding (Grant No. 2022-PUMCH-A-004) and Natural Science Foundation of China (Grant No. 82271886).

Author information

Authors and Affiliations



ZYJ, HDX, YL and YLH contributed to conceptualization; HDX, YLH, MLH and JR were involved in protocol finalization; JR and XYL were involved in technical contributions; JR, XYL and MLH contributed to article selection; YLH was involved in article consensus; MLH, JR and YLH contributed to manuscript writing; MLH, JR and XYL were involved in visualization; HDX, YL and YLH contributed to writing—review and editing; YLH, ZYJ and HDX were involved in supervision. All authors read and approved the final manuscript.

Corresponding authors

Correspondence to Yong-Lan He, Yuan Li or Hua-Dan Xue.

Ethics declarations

Ethics approval and consent to participate

Institutional Review Board approval was not required because our data are extracted from existing manuscripts.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1

. Electronic Supplementary Materials.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit

Reprints and Permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Huang, ML., Ren, J., Jin, ZY. et al. A systematic review and meta-analysis of CT and MRI radiomics in ovarian cancer: methodological issues and clinical utility. Insights Imaging 14, 117 (2023).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI:


  • Ovarian neoplasms
  • Differential diagnosis
  • Machine learning
  • Systematic review