A systematic review and meta-analysis of CT and MRI radiomics in ovarian cancer: methodological issues and clinical utility

Huang, Meng-Lin; Ren, Jing; Jin, Zheng-Yu; Liu, Xin-Yu; He, Yong-Lan; Li, Yuan; Xue, Hua-Dan

doi:10.1186/s13244-023-01464-z

Critical Review
Open access
Published: 03 July 2023

A systematic review and meta-analysis of CT and MRI radiomics in ovarian cancer: methodological issues and clinical utility

Meng-Lin Huang¹^na1,
Jing Ren¹^na1,
Zheng-Yu Jin¹,
Xin-Yu Liu¹,
Yong-Lan He ORCID: orcid.org/0000-0003-2567-9710¹,
Yuan Li² &
…
Hua-Dan Xue¹

Insights into Imaging volume 14, Article number: 117 (2023) Cite this article

3103 Accesses
8 Citations
2 Altmetric
Metrics details

Abstract

Objectives

We aimed to present the state of the art of CT- and MRI-based radiomics in the context of ovarian cancer (OC), with a focus on the methodological quality of these studies and the clinical utility of these proposed radiomics models.

Methods

Original articles investigating radiomics in OC published in PubMed, Embase, Web of Science, and the Cochrane Library between January 1, 2002, and January 6, 2023, were extracted. The methodological quality was evaluated using the radiomics quality score (RQS) and Quality Assessment of Diagnostic Accuracy Studies 2 (QUADAS-2). Pairwise correlation analyses were performed to compare the methodological quality, baseline information, and performance metrics. Additional meta-analyses of studies exploring differential diagnoses and prognostic prediction in patients with OC were performed separately.

Results

Fifty-seven studies encompassing 11,693 patients were included. The mean RQS was 30.7% (range − 4 to 22); less than 25% of studies had a high risk of bias and applicability concerns in each domain of QUADAS-2. A high RQS was significantly associated with a low QUADAS-2 risk and recent publication year. Significantly higher performance metrics were observed in studies examining differential diagnosis; 16 such studies as well as 13 exploring prognostic prediction were included in a separate meta-analysis, which revealed diagnostic odds ratios of 25.76 (95% confidence interval (CI) 13.50–49.13) and 12.55 (95% CI 8.38–18.77), respectively.

Conclusion

Current evidence suggests that the methodological quality of OC-related radiomics studies is unsatisfactory. Radiomics analysis based on CT and MRI showed promising results in terms of differential diagnosis and prognostic prediction.

Critical relevance statement

Radiomics analysis has potential clinical utility; however, shortcomings persist in existing studies in terms of reproducibility. We suggest that future radiomics studies should be more standardized to better bridge the gap between concepts and clinical applications.

Graphical abstract

Key points

The methodological quality of current radiomics studies concerning ovarian cancer was unsatisfactory.
Meta-analyses showed high diagnostic odds ratios regarding differential diagnosis and prognostic prediction.
Radiomics analysis in ovarian cancer holds promise for clinical applications.
More standardization should be required for radiomics studies.

Introduction

Ovarian cancer (OC) is the most lethal gynecological cancer and the fifth-leading cause of cancer-related deaths among women; there were 313,959 newly diagnosed cases and 207,252 deaths worldwide in 2020 [1, 2]. The most recent cancer statistics report indicates that approximately 19,710 new cases of OC will be diagnosed in the USA in 2023, and 13,270 women will die from the disease [2]. The World Health Organization classification of tumors divides OC into dozens of pathological types [3]; furthermore, 70% of patients with OC are diagnosed at an advanced stage, leading to worse outcomes (their 5-year overall survival rates are 20–30% versus 80–95% for those diagnosed at early stages) [4,5,6]. Treatment involves surgery and chemotherapy and depends on the pathological type and International Federation of Gynaecology and Obstetrics stage [5, 7]. Notwithstanding advances in the diagnosis and treatment of OC, however, mortality rates have not changed appreciably in the last 30 years [2, 8,9,10,11,12]. This is partly due to the difficulty of early detection and the lack of effective therapeutic options for patients in advanced stages.

Computed tomography (CT) and magnetic resonance imaging (MRI) are essential for diagnosing and staging OC [5] and are invaluable for assessing chemotherapy response [13]. However, conventional imaging interpretations rely on the skills of radiologists, and variabilities among reports inevitably exist. Fortunately, medical imaging is advancing toward more standardized, specialized, and quantitative approaches, contributing to greater consistency and communication among radiologists. With this evolution, radiologists are shifting from conventional free-text reporting to structured reporting, enabling more accurate and efficient analysis of the extensive volumes of imaging data [14, 15]. Radiomics is a rapidly emerging field that quantitatively analyzes medical images utilizing artificial intelligence; based on high-throughput mining of quantitative image features, radiomics analysis generates unique markers that may be visually indiscernible yet can support clinical decision-making and increase diagnostic and prognostic accuracy [8, 16, 17].

In recent years, extensive studies using radiomics methods based on CT and MRI have linked quantitative image features to diagnosis, response evaluation, and prognostic prediction in patients with OC [8, 17, 18]. Nevertheless, it has remained difficult to assess the clinical value of radiomics in OC owing to the complexity of the methods and varying study designs. Therefore, we performed this study with two main aims: First, we evaluated the methodological quality of existing studies using the ‘quality of diagnostic accuracy studies-2’ (QUADAS-2) tool as well as the ‘radiomics quality score’ (RQS) [16, 19]. Second, we conducted a meta-analysis to determine the diagnostic performance of radiomics in patients with OC.

Materials and methods

Evidence acquisition protocol and registry

This systematic review was conducted according to the Preferred Reporting Items for Systematic Reviews and Meta-analysis (PRISMA) statement [20] (Additional file 1: Table S1). A review protocol is available through the International Prospective Register of Systematic Reviews (PROSPERO) (CRD 42022313519).

Literature search and study selection

A structured search was performed independently by two reviewers (M.H. and J.R.) with 5 and 3 years of experience in gynecological imaging interpretation, respectively. Any disagreement was arbitrated by a third reviewer (Y.H.) with 12 years of experience, which was performed in 1.9% of the studies (8/418). The reviewers independently screened the titles, abstracts, and full texts of the extracted articles; uncertainties were discussed and resolved by consensus. Detailed search strategies and selection criteria are described in Additional file 1.

Data extraction and methodological quality assessment

After selecting the relevant studies, the two reviewers developed a data extraction instrument (described in Additional file 1: Table S2). After independently reading the full text of each eligible article, they documented: (1) bibliographical information, (2) baseline study information (including study design, imaging technique parameters, and modeling information), and (3) model performance metrics. The extracted information was recorded using the Excel software (Microsoft Corp., Redmond, WA, USA). The methodological quality of the eligible articles was independently assessed by the two reviewers using the RQS (described in Additional file 1: Table S3) [16] and Quality Assessment of QUADAS-2 tools [19]. Each of the 16 key components identified by the RQS was rated, resulting in a total score ranging from − 8 to 36 points; these were converted to RQS percentages, with − 8 to 0 points defined as 0% and 36 points defined as 100% [16]. The QUADAS-2 tool comprises seven assessment items reflecting four domains: ‘patient selection’, ‘index test’, ‘reference standard’, and ‘flow and timing’. Each item was judged as “low,” “high,” or “unclear” based on responses to signaling questions on the risk of bias and applicability concerns [19]. A summed RQS rating calculated by averaging the scores of the two reviewers was determined for each study. For QUADAS-2 assessment, any disagreement was arbitrated with a third reviewer (H.Y.) to achieve a common appraisal for each item. To facilitate additional analysis, a final risk assessment was performed for each study as follows: studies in which all seven items were rated “low” were defined as “low risk”, studies in which at least one item was rated “high” were defined as “high risk”, and the remainder were defined as “unclear risk”.

Data synthesis and analysis

Statistical analysis was conducted using the SPSS software version 25.0.0.0 (IBM Corp., Armonk, NY, USA), Review Manager (RevMan) version 5.3, and R (version 4.0.5; R Foundation for Statistical Computing) incorporating the ‘tidyverse’ packages. Categorical variables are presented as numbers and percentages, while continuous variables are presented as means and standard deviations or as medians and ranges. The inter-rater agreement for the RQS and QUADAS-2 was determined using Cohen’s kappa [21] and the ratio of agreements [21, 22]. An interclass correlation coefficient (ICC) of ≤ 0.4 was considered poor, 0.4–0.75 moderate, and > 0.75 good. Only training set performance metrics of the proposed radiomics models were recorded, even if validation sets existed, to maintain consistency among studies. Pairwise correlation analyses between methodological quality, baseline information, and performance metrics were conducted. Correlations between numeric variables were evaluated using linear regression analyses, while associations between numeric and categorical variables were assessed using unpaired Student’s t-tests or Mann–Whitney tests. The significance level α was 0.05, and statistical significance was set at a two-tailed p-value of < 0.05.

Meta-analyses were performed using the STATA software version 17.0 (StataCorp LP, College Station, TX, USA) with the ‘midas’ package when a sufficient number of studies attempted to address a similar question, and two-by-two tables could be extracted or reconstructed based on published data (the details are shown in Additional file 1). The sensitivity, specificity, positive likelihood ratio (PLR), negative likelihood ratio (NLR), and diagnostic odds ratio (DOR) with 95% confidence intervals (CIs) were calculated. Summary receiver operating characteristic (SROC) analysis was performed, and the areas under the curve (AUCs) were obtained to describe the diagnostic accuracy. If a particular study involved multiple models, only the radiomics model was selected. Heterogeneity was assessed using Cochrane’s Q test and Higgin’s inconsistency index (I²) test. Any heterogeneity was considered significant if the p-value on Cochran’s Q test was < 0.05, whereupon the random effects model was used. Higgins I² values of < 25%, 25–50%, and > 50% were associated with low, moderate, and substantial heterogeneity, respectively. In cases of the latter, meta-regression was performed to explore the possible sources of heterogeneity. Deeks’ funnel plots were constructed to illustrate the risk of publication bias.

Results

The study selection process is illustrated in Fig. 1; 57 articles were ultimately included in the systematic review [23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65,66,67,68,69,70,71,72,73,74,75,76,77,78,79]. We divided these publications based on the studies’ main objectives into three categories: differential diagnosis (24/57, 42.1%), response evaluation (4/57, 7.0%), and prediction of prognosis (28/57, 49.1%). Additionally, one paper (1.8%) described both differential diagnosis and prognostic prediction. We found that assessments of differential diagnosis and prognostic prediction were both commonly performed; thus, 16 articles focusing on the differential diagnosis of OC [24,25,26, 28,29,30,31,32, 34, 35, 39, 40, 42, 44, 45, 47] and 13 that described studies on prognostic factors [52, 53, 55–57, 64, 66,67,68, 71,72,73,74] were subjected to separate meta-analyses.

Study characteristics

There were between 28 and 1329 patients in each study; their median or mean ages ranged from 38.5 to 75 years. The studies’ baseline information and characteristics are shown in Tables 1 and 2, respectively. Nearly half of the studies were published in imaging journals (28/57, 49.1%); the first authors were mainly specialized in radiology (36/57, 63.2%) and most came from Asia (43/57, 75.4%). Thirty-seven studies used CT and 20 used MRI; most applied the manual segmentation method and three-dimensional analysis. According to the model classification method proposed by the TRIPOD statement [80], a plurality of the studies established developed models validated with random splitting of data (27/57, 47.4%), followed by developed models validated using exactly the same data (10/57, 17.5%) and separate data (9/57, 15.8%). Furthermore, using Zhong et al. method [22], 59.6% of the studies were classified as phase 0 owing to their lack of external validation. Two phase III studies were conducted without sufficient patients or lacked external validation; however, they retained their phase categories owing to their prospective designs. Detailed characteristics of each study’s population and proposed radiomics model are presented in Additional file 1: Tables S4 and S5.

Table 1 Baseline information of studies included in the systematic review

Full size table

Table 2 Characteristics of included studies

Full size table

Methodological quality assessment

The 57 studies had a mean RQS of 12 (interquartile range 10–14), with RQS values ranging from − 4 to 22. The mean percentage RQS was 30.7%, with a maximum of 61.1%. The average ICC for the RQS was 0.80 (95% confidence interval (CI) 0.69–0.91), i.e., a ‘good’ rating. The average rating and inter-rater agreement per RQS component are shown in Table 3. In most studies, the lack of reproducibility and repeatability analysis of imaging features (e.g., phantom study and imaging at multiple time points), insufficient cost-effectiveness analyses, and inadequate access to the data led to a low RQS. Additionally, biological correlation and cutoff analyses were seldom performed in the aftermath, and the rated validation score (2 [2]) was suboptimal. Discrimination statistics to assess model performance were available for all studies; 21 of them also reported calibration statistics. Fifty-five studies (96.5%) were retrospective analyses, while only two (3.5%) were based on prospectively acquired data. The RQS assessments by each reviewer (M.H. and J.R.) are shown in Additional file 1: Table S6.

Table 3 Average rating and inter-rater agreement per component of RQS

Full size table

The summarized and individual QUADAS-2 results are presented in Fig. 2 and Additional file 1: Table S7. A high risk of publication bias (19.3%) and applicability concerns (22.8%) were observed in terms of patient selection, mainly because of inappropriate exclusion. Twelve studies (21.1%) were regarded as having an unclear risk of bias in flow and timing, as they did not provide sufficient information regarding the interval between index tests and reference standards. Complete agreement between the two reviewers concerning the seven QUADAS-2 items ranged from 80.7 to 96.5%.

Pairwise correlation analysis of methodological quality, baseline information, and performance metrics

The pairwise correlation analysis results are presented in Fig. 3. One article that discussed both differential diagnosis and prognostic prediction was analyzed twice as two separate studies. The RQS values were significantly different between studies with different QUADAS-2 risk assessments (p = 0.011). Importantly, studies deemed low risk had a higher mean RQS than those with high or unclear risk; the difference between low-risk and unclear-risk studies was significant (p = 0.002). The study aim (p = 0.180) and specialty of the first author (p = 0.520) did not influence the RQS rating. Studies published more recently received significantly higher scores (adjusted R² = 0.264, p < 0.001).

The performance metrics were represented by the AUC in 49 studies (86.0%) and by the c-index in three (5.3%); they were missing in the remaining five (8.8%). Two of the studies explored two radiomics models with different aims; hence, their performance metrics were documented separately. The best radiomics model used in each study had an AUC or c-index value between 0.620 and 1.000. Moreover, the performance of the radiomics model was closely related to the study aim (p < 0.001); performance was significantly better in studies exploring differential diagnosis than in those evaluating response (p = 0.012) or predicting prognosis (p < 0.001). No significant correlation was found between performance metrics and RQS (adjusted R² = − 0.019, p = 0.892), sample size (adjusted R² = − 0.019, p = 0.938), or number of features extracted (adjusted R² = 0.046, p = 0.068).

Meta-analysis

Sixteen studies that focused on differential diagnosis and 13 that investigated prognostic prediction were subjected to a meta-analysis. For studies of differential diagnosis, the pooled diagnostic odds ratio (DOR) was 25.76 (95% CI 13.50–49.13) (Fig. 4a); the pooled sensitivity, specificity, positive likelihood ratio (PLR), and negative likelihood ratio (NLR) were 0.84 (95% CI 0.76–0.89), 0.83 (95% CI 0.77–0.88), 5.00 (95% CI 3.58–6.97), and 0.19 (95% CI 0.13–0.30), respectively (Additional file 1: Figs. S1a and Fig. S2a). The SROC curve suggested good diagnostic performance with an AUC of 0.90 (95% CI 0.87–0.92) (Additional file 1: Fig. S3a). For studies analyzing prognostic prediction, the pooled DOR, sensitivity, specificity, PLR, and NLR were 12.55 (95% CI 8.38–18.77), 0.78 (95% CI 0.71–0.83), 0.78 (95% CI 0.72–0.82), 3.59 (95% CI 2.80–4.59), and 0.29 (95% CI 0.22–0.37), respectively (Fig. 4b, Additional file 1: Figs. S1b, and S2b). The AUC calculated from the SROC curve was 0.85 (95% CI 0.81–0.88), indicating good performance of the prognostic prediction models (Additional file 1: Fig. S3b). Per Deeks’ funnel plots, the likelihood of publication bias was low for differential diagnosis studies (p = 0.760) but high for prognostic prediction studies (p = 0.040) (Additional file 1: Fig. S4).

Cochrane’s Q test (p < 0.001 and p < 0.001) and Higgin’s I² test (I² = 94% and I² = 91%) indicated high heterogeneity; therefore, a meta-regression analysis was conducted to identify its source (Table 4 and Additional file 1: Fig. S5). The number of patients was significantly associated with heterogeneity in specificity (p < 0.001) for differential diagnosis studies and contributed to heterogeneity in sensitivity (p < 0.001), specificity (p < 0.001), and the AUC (p = 0.04) in prognostic prediction studies. Additionally, the imaging method (CT vs. MRI) influenced heterogeneity in both sensitivity (p < 0.001 for differential diagnosis and p < 0.01 for prognostic prediction studies) and specificity (p < 0.01 for both study types). The heterogeneity of sensitivity and specificity in the differential diagnosis subgroup was associated with the type of region of interest (p = 0.04 for sensitivity and p < 0.01 for specificity). In the prognostic prediction subgroup, however, the region of interest type influenced AUC (p = 0.03), sensitivity (p < 0.001), and specificity (p < 0.01). Meanwhile, the type of features only contributed to heterogeneity in specificity (p < 0.01 for both differential diagnosis and prognostic prediction).

Table 4 The results of meta-regression analysis of studies about differential diagnosis and prognosis prediction of OC

Full size table

Discussion

Our systematic review found that the methodological quality of CT and MRI radiomics for patients with OC was relatively low, while our meta-analysis revealed that radiomics has promising potential in discriminating between OC subtypes and predicting patient prognosis.

Several narrative reviews described the prospects of applying radiomics in OC [8, 17, 81, 82]. Rizzo et al.’s systematic review [83] included six studies that lacked RQS ratings and meta-analyses, whereas that by Ponsiglione et al. [18] used the RQS to evaluate the methodological quality of studies involving CT-, MRI-, ultrasonography-, or positron emission tomography-based radiomics in ovarian disorders published as of November 2021. The field of radiomics is in a period of rapid growth; 42.1% of the studies we analyzed were published in 2022 or 2023. Aside from the RQS, QUADAS-2 was applied to assess potential publication bias and applicability concerns. While previously published reviews usually describe only qualitative analyses, we performed quantitative evaluation of the effect of radiomics models by conducting a meta-analysis; our mean RQS (30.7%) was acceptable when compared to those published previously (5.6–36.1%) [18, 22, 84-87]. However, the overall scientific quality was still unsatisfactory, given that the assessment values were considerably below 100%. The main reasons for the low RQS ratings, as in previous analyses [22, 86, 88], included low levels of feature robustness, insufficient model assessment, and lack of concern for clinical applications.

The primary challenge for feature robustness was the high variability in radiomics features; few of the publications we analyzed described phantom studies [74, 77, 78], multiple time-point imaging, or automatic segmentation [49,50,51]. Regarding model assessment, radiomics studies usually included discrimination statistics; however, cutoff analyses and calibration statistics were often neglected, thus complicating the risk evaluation of overly optimistic results and accuracy of model predictions. Classification indexes including sensitivity and specificity, which are critical for quantitative analysis, were lacking in some studies. Our results suggest the need for performing additional technical validation before considering radiomics models for clinical applications.

Although most studies involved internal validation (which is indispensable for the clinical translation and broad application of radiomics models), the absence of external validation at several centers undermined the credibility and generalizability of the models. Very few prospective studies [37, 66], which are considered to have a high level of evidence, were performed. Furthermore, analysis of cost-effectiveness was absent from all our included studies, and the lack of decision curve analysis in approximately two-thirds of them also obscured the applicability of the models to clinical settings. As such, technical and clinical validations remain critical unmet requirements for incorporating radiomics analysis into clinical applications.

The importance of open science is emphasized in the RQS ratings to ensure the transparency and reproducibility of research findings, but such access was lacking in our studies. As such, we suggest that researchers should at least include the computed numerical values of any investigated features in their publications.

In terms of correlations between the RQS and original variables, there was a significant relationship between RQS and publication year. More modern studies presumably have a larger sample size, collect data from multiple centers, use a prospective design, and/or apply more strict inclusion and exclusion criteria, thereby contributing to a higher RQS. Additionally, a higher RQS was significantly correlated with low-level risk according to QUADAS-2, which may support using the former to improve research quality. We also found that studies exploring differential diagnosis had significantly higher performance metrics, whereas studies of response evaluation or prognostic prediction may be more affected by non-relevant factors, such as lifestyle and other medications, given that they have longer follow-up periods. Contrary to our expectation, we did not find a significant relationship between performance metrics and the number of features extracted, which could be explained by the heterogeneity in feature selection and modeling.

The results of our meta-analysis were promising; the pooled AUCs reached 0.90 for differential diagnosis and 0.85 for prognostic prediction, indicating a relatively high accuracy in categorizing different pathological subtypes and predicting the prognoses of patients with OC. According to Cochrane’s Q and Higgin’s I² tests, studies included in the meta-analysis had high levels of heterogeneity, which a meta-regression analysis attributed to the influence of patient number, imaging modality, region of interest, and feature type.

Our study had some limitations. First, most included articles did not report the numbers of true/false positives or true/false negatives, and our calculation of these data from the available information might have introduced some errors. Second, studies predicting chemotherapy response were not included in the meta-analysis owing to insufficient data. Third, studies involved in the meta-analysis showed high heterogeneity; although we found significant correlations between such heterogeneity and certain factors, the latter may not have sufficiently explained the former. Fourth, the prognostic prediction studies included in our meta-analysis had a significant risk of publication bias, likely because we only included English-language articles and also because half of the studies comprised small sample sizes; reluctance to publish negative data may also have been a factor.

In conclusion, radiomics analysis showed promise in terms of overcoming some current obstacles in determining differential diagnosis, chemotherapy response, and prognosis in patients with OC. Pairwise correlation analysis revealed a significant relationship between RQS and QUADAS-2 result or publication year, as well as between performance metrics and study aims. Additionally, our meta-analysis demonstrated the suitability of radiomics analysis for discriminating between various subtypes of OC and identifying prognostic factors through quantitative analysis. Given the generally low RQS ratings of all the included studies, the methodological quality of radiomics studies involving OC is lower than desirable; hence, more high-level evidence is required to develop effective radiomics models.

Availability of data and materials

All data generated or analyzed during this study are included in this published article [and its Additional files].

Abbreviations

AUC:: Area under the curve
CI:: Confidence interval
DOR:: Diagnostic odds ratio
ICC:: Interclass correlation coefficient
NLR:: Negative likelihood ratio
OC:: Ovarian cancer
PLR:: Positive likelihood ratio
PRISMA:: Preferred Reporting Items for Systematic Reviews and Meta-analysis
PROSPERO:: International Prospective Register of Systematic Reviews
QUADAS-2:: Quality Assessment of Diagnostic Accuracy Studies 2
RQS:: Radiomics quality score
SROC:: Summary receiver operating characteristic

References

Sung H, Ferlay J, Siegel RL et al (2021) Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin 71:209–249
Article PubMed Google Scholar
Siegel RL, Miller KD, Wagle NS, Jemal A (2023) Cancer statistics, 2023. CA Cancer J Clin 73:17–48
Article PubMed Google Scholar
WHO classification of Tumours Editorial Board (2020) WHO classification of tumours. In: Female genital tumours, 5th ed. IARC Press, Lyon
Park HK, Ruterbusch JJ, Cote ML (2017) Recent trends in ovarian cancer incidence and relative survival in the United States by race/ethnicity and histologic subtypes. Cancer Epidemiol Biomark Prev 26:1511–1518
Article Google Scholar
National Comprehensive Cancer Network (2023) Ovarian cancer (Version 1.2023). Available via https://www.nccn.org/professionals/physician_gls/pdf/ovarian.pdf. Accessed 3 Jan 2023
Peres LC, Cushing-Haugen KL, Köbel M et al (2019) Invasive epithelial ovarian cancer survival by histotype and disease stage. J Natl Cancer Inst 111:60–68
Article PubMed Google Scholar
Gaona-Luviano P, Medina-Gaona LA, Magaña-Pérez K (2020) Epidemiology of ovarian cancer. Chin Clin Oncol 9:47–47
Article PubMed Google Scholar
Nougaret S, McCague C, Tibermacine H, Vargas HA, Rizzo S, Sala E (2021) Radiomics and radiogenomics in ovarian cancer: a literature review. Abdom Radiol (NY) 46:2308–2322
Article CAS PubMed Google Scholar
Lee JM, Minasian L, Kohn EC (2019) New strategies in ovarian cancer treatment. Cancer 125:4623–4629
Article CAS PubMed Google Scholar
Fujiwara K, Hasegawa K, Nagao S (2019) Landscape of systemic therapy for ovarian cancer in 2019: primary therapy. Cancer 125:4582–4586
Article CAS PubMed Google Scholar
Siegel RL, Miller KD, Fuchs HE, Jemal A (2022) Cancer statistics, 2022. CA Cancer J Clin 72:7–33
Article PubMed Google Scholar
Arend R, Martinez A, Szul T, Birrer MJ (2019) Biomarkers in ovarian cancer: To be or not to be. Cancer 125:4563–4572
Article CAS PubMed Google Scholar
Eisenhauer EA, Therasse P, Bogaerts J et al (2009) New response evaluation criteria in solid tumours: revised RECIST guideline (version 1.1). Eur J Cancer 45:228–247
Article CAS PubMed Google Scholar
Faggioni L, Coppola F, Ferrari R, Neri E, Regge D (2017) Usage of structured reporting in radiological practice: results from an Italian online survey. Eur Radiol 27:1934–1943
Article PubMed Google Scholar
Granata V, Faggioni L, Grassi R et al (2022) Structured reporting of computed tomography in the staging of colon cancer: a Delphi consensus proposal. Radiol Med 127:21–29
Article PubMed Google Scholar
Lambin P, Leijenaar RTH, Deist TM et al (2017) Radiomics: the bridge between medical imaging and personalized medicine. Nat Rev Clin Oncol 14:749–762
Article PubMed Google Scholar
Arezzo F, Loizzi V, La Forgia D et al (2021) Radiomics analysis in ovarian cancer: a narrative review. Appl Sci Basel 11:7833
Article CAS Google Scholar
Ponsiglione A, Stanzione A, Spadarella G et al (2022) Ovarian imaging radiomics quality score assessment: an EuSoMII radiomics auditing group initiative. Eur Radiol. https://doi.org/10.1007/s00330-022-09180-w
Article PubMed PubMed Central Google Scholar
Whiting PF, Rutjes AW, Westwood ME et al (2011) QUADAS-2: a revised tool for the quality assessment of diagnostic accuracy studies. Ann Intern Med 155:529–536
Article PubMed Google Scholar
McInnes MDF, Moher D, Thombs BD et al (2018) Preferred reporting items for a systematic review and meta-analysis of diagnostic test accuracy studies. JAMA 319:388
Article PubMed Google Scholar
Landis JR, Koch GG (1977) The measurement of observer agreement for categorical data. Biometrics 33:159–174
Article CAS PubMed Google Scholar
Zhong J, Hu Y, Si L et al (2021) A systematic review of radiomics in osteosarcoma: utilizing radiomics quality score as a tool promoting clinical translation. Eur Radiol 31:1526–1535
Article PubMed Google Scholar
Zheng Y, Wang H, Li Q, Sun H, Guo L (2022) Discriminating between benign and malignant solid ovarian tumors based on clinical and radiomic features of MRI. Acad Radiol. https://doi.org/10.1016/j.acra.2022.06.007
Article PubMed Google Scholar
Zhang A, Hu Q, Ma Z, Song J, Chen T (2022) Application of enhanced computed tomography-based radiomics nomogram analysis to differentiate metastatic ovarian tumors from epithelial ovarian tumors. J Xray Sci Technol 30:1185–1199
PubMed Google Scholar
Xu Y, Luo HJ, Ren J, Guo LM, Niu J, Song X (2022) Diffusion-weighted imaging-based radiomics in epithelial ovarian tumors: assessment of histologic subtype. Front Oncol 12:978123
Article PubMed PubMed Central Google Scholar
Wei M, Zhang Y, Bai G et al (2022) T2-weighted MRI-based radiomics for discriminating between benign and borderline epithelial ovarian tumors: a multicenter study. Insights Imaginging 13:130
Article Google Scholar
Wang M, Perucho JAU, Hu Y et al (2022) Computed tomographic radiomics in differentiating histologic subtypes of epithelial ovarian carcinoma. JAMA Netw Open 5:e2245141
Article PubMed PubMed Central Google Scholar
Nagawa K, Kishigami T, Yokoyama F et al (2022) Diagnostic utility of a conventional MRI-based analysis and texture analysis for discriminating between ovarian thecoma-fibroma groups and ovarian granulosa cell tumors. J Ovarian Res 15:65
Article CAS PubMed PubMed Central Google Scholar
Liu X, Wang T, Zhang G et al (2022) Two-dimensional and three-dimensional T2 weighted imaging-based radiomic signatures for the preoperative discrimination of ovarian borderline tumors and malignant tumors. J Ovarian Res 15:22–22
Article PubMed PubMed Central Google Scholar
Liu P, Liang X, Liao S, Lu Z (2022) Pattern classification for ovarian tumors by integration of radiomics and deep learning features. Curr Med Imaging 18:1486–1502
Article PubMed Google Scholar
Li S, Liu J, Xiong Y et al (2022) Application values of 2D and 3D radiomics models based on CT plain scan in differentiating benign from malignant ovarian tumors. Biomed Res Int 2022:5952296
PubMed PubMed Central Google Scholar
Li J, Zhang T, Ma J, Zhang N, Zhang Z, Ye Z (2022) Machine-learning-based contrast-enhanced computed tomography radiomic analysis for categorization of ovarian tumors. Front Oncol 12:934735
Article PubMed PubMed Central Google Scholar
Li J, Li X, Ma J, Wang F, Cui S, Ye Z (2022) Computed tomography-based radiomics machine learning classifiers to differentiate type I and type II epithelial ovarian cancers. Eur Radiol. https://doi.org/10.1007/s00330-022-09318-w
Article PubMed PubMed Central Google Scholar
Zhu H, Ai Y, Zhang J et al (2021) Preoperative nomogram for differentiation of histological subtypes in ovarian cancer based on computer tomography radiomics. Front Oncol 11:642892
Article PubMed PubMed Central Google Scholar
Yu X-p, Wang L, Yu H-y et al (2021) MDCT-based radiomics features for the differentiation of serous borderline ovarian tumors and serous malignant ovarian tumors. Cancer Manag Res 13:329–336
Article PubMed PubMed Central Google Scholar
Ye R, Weng S, Li Y et al (2021) Texture analysis of three-dimensional MRI images may differentiate borderline and malignant epithelial ovarian tumors. Korean J Radiol 22:106–117
Article PubMed Google Scholar
Song X-l, Ren J-L, Zhao D, Wang L, Ren H, Niu J (2021) Radiomics derived from dynamic contrast-enhanced MRI pharmacokinetic protocol features: the value of precision diagnosis ovarian neoplasms. Eur Radiol 31:368–378
Article CAS PubMed Google Scholar
Park H, Qin L, Guerra P, Bay CP, Shinagare AB (2021) Decoding incidental ovarian lesions: use of texture analysis and machine learning for characterization and detection of malignancy. Abdom Radiol (NY) 46:2376–2383
Article PubMed Google Scholar
Li S, Liu J, Xiong Y et al (2021) A radiomics approach for automated diagnosis of ovarian neoplasm malignancy in computed tomography. Sci Rep 11:8730
Article CAS PubMed PubMed Central Google Scholar
Li NY, Shi B, Chen YL et al (2021) The value of MRI findings combined with texture analysis in the differential diagnosis of primary ovarian granulosa cell tumors and ovarian thecoma-fibrothecoma. Front Oncol 11:758036
Article PubMed PubMed Central Google Scholar
Jian J, Li Y, Pickhardt PJ et al (2021) MR image-based radiomics to differentiate type Iota and type Iota Iota epithelial ovarian cancers. Eur Radiol 31:403–410
Article PubMed Google Scholar
Hu Y, Weng Q, Xia H et al (2021) A radiomic nomogram based on arterial phase of CT for differential diagnosis of ovarian cancer. Abdom Radiol (NY) 46:2384–2392
Article PubMed Google Scholar
An H, Wang Y, Wong EMF et al (2021) CT texture analysis in histological classification of epithelial ovarian carcinoma. Eur Radiol 31:5050–5058
Article PubMed Google Scholar
Qian L, Ren J, Liu A et al (2020) MR imaging of epithelial ovarian cancer: a combined model to predict histologic subtypes. Eur Radiol 30:5815–5825
Article CAS PubMed Google Scholar
Lupean R-A, Sefan P-A, Feier DS et al (2020) Radiomic analysis of MRI images is instrumental to the stratification of ovarian cysts. J Personal Med 10:127
Article Google Scholar
Ya Li, Jian J, Pickhardt PJ et al (2020) MRI-based machine learning for differentiating borderline from malignant epithelial ovarian tumors: a multicenter study. J Magn Reson Imag 52:897–904
Article Google Scholar
Zhang H, Mao Y, Chen X et al (2019) Magnetic resonance imaging radiomics in categorizing ovarian masses and predicting clinical outcome: a preliminary study. Eur Radiol 29:3358–3371
Article PubMed Google Scholar
Rundo L, Beer L, Escudero Sanchez L et al (2022) Clinically interpretable radiomics-based prediction of histopathologic response to neoadjuvant chemotherapy in high-grade serous ovarian carcinoma. Front Oncol 12:2423
Article Google Scholar
Zargari A, Du Y, Heidari M et al (2018) Prediction of chemotherapy response in ovarian cancer patients using a new clustered quantitative image marker. Phys Med Biol 63:155020
Article PubMed PubMed Central Google Scholar
Danala G, Thai T, Gunderson CC et al (2017) Applying quantitative CT image feature analysis to predict response of ovarian cancer patients to chemotherapy. Acad Radiol 24:1233–1239
Article PubMed PubMed Central Google Scholar
Qiu Y, Tan M, McMeekin S et al (2016) Early prediction of clinical benefit of treating ovarian cancer using quantitative CT image feature analysis. Acta Radiol 57:1149–1155
Article PubMed Google Scholar
Wan S, Zhou T, Che R et al (2023) CT-based machine learning radiomics predicts CCR5 expression level and survival in ovarian cancer. J Ovarian Res 16:1
Article CAS PubMed PubMed Central Google Scholar
Wu Y, Jiang W, Fu L, Ren M, Ai H, Wang X (2022) Intra- and peritumoral radiomics for predicting early recurrence in patients with high-grade serous ovarian cancer. Abdom Radiol (NY). https://doi.org/10.1007/s00261-022-03717-9
Article PubMed Google Scholar
Wang T, Wang H, Wang Y et al (2022) MR-based radiomics-clinical nomogram in epithelial ovarian tumor prognosis prediction: tumor body texture analysis across various acquisition protocols. J Ovarian Res 15:6
Article CAS PubMed PubMed Central Google Scholar
Lu J, Cai S, Wang F et al (2022) Development of a prediction model for gross residual in high-grade serous ovarian cancer by combining preoperative assessments of abdominal and pelvic metastases and multiparametric MRI. Acad Radiol. https://doi.org/10.1016/j.acra.2022.12.019
Article PubMed Google Scholar
Li C, Wang H, Chen Y et al (2022) A nomogram combining MRI multisequence radiomics and clinical factors for predicting recurrence of high-grade serous ovarian carcinoma. J Oncol 2022:1716268
PubMed PubMed Central Google Scholar
Hu J, Wang Z, Zuo R et al (2022) Development of survival predictors for high-grade serous ovarian cancer based on stable radiomic features from computed tomography images. iScience 25:104628
Article PubMed PubMed Central Google Scholar
Hong Y, Liu Z, Lin D et al (2022) Development of a radiomic-clinical nomogram for prediction of survival in patients with serous ovarian cancer. Clin Radiol 77:352–359
Article CAS PubMed Google Scholar
Gao L, Jiang W, Yue Q et al (2022) Radiomic model to predict the expression of PD-1 and overall survival of patients with ovarian cancer. Int Immunopharmacol 113:109335
Article CAS PubMed Google Scholar
Fotopoulou C, Rockall A, Lu H et al (2022) Validation analysis of the novel imaging-based prognostic radiomic signature in patients undergoing primary surgery for advanced high-grade serous ovarian cancer (HGSOC). Br J Cancer 126:1047–1054
Article CAS PubMed Google Scholar
Feng S, Xia T, Ge Y et al (2022) Computed tomography imaging-based radiogenomics analysis reveals hypoxia patterns and immunological characteristics in ovarian cancer. Front Immunol 13:868067
Article CAS PubMed PubMed Central Google Scholar
Boehm KM, Aherne EA, Ellenson L et al (2022) Multimodal data integration using machine learning improves risk stratification of high-grade serous ovarian cancer. Nat Cancer 3:723–733
Article CAS PubMed PubMed Central Google Scholar
Avesani G, Tran HE, Cammarata G et al (2022) CT-based radiomics and deep learning for BRCA mutation and progression-free survival prediction in ovarian cancer using a multicentric dataset. Cancers 14:2739
Article CAS PubMed PubMed Central Google Scholar
Yu XY, Ren J, Jia Y et al (2021) Multiparameter MRI radiomics model predicts preoperative peritoneal carcinomatosis in ovarian cancer. Front Oncol 11:765652
Article PubMed PubMed Central Google Scholar
Yi X, Liu Y, Zhou B et al (2021) Incorporating SULF1 polymorphisms in a pretreatment CT-based radiomic model for predicting platinum resistance in ovarian cancer treatment. Biomed Pharmacother 133:111013
Article CAS PubMed Google Scholar
Song X-L, Ren J-L, Yao T-Y, Zhao D, Niu J (2021) Radiomics based on multisequence magnetic resonance imaging for the preoperative prediction of peritoneal metastasis in ovarian cancer. Eur Radiol 31:8438–8446
Article PubMed Google Scholar
Liu M, Ge Y, Li M, Wei W (2021) Prediction of BRCA gene mutation status in epithelial ovarian cancer by radiomics models based on 2D and 3D CT images. BMC Med Imaging. https://doi.org/10.1186/s12880-021-00711-3
Article PubMed PubMed Central Google Scholar
Li MR, Liu MZ, Ge YQ, Zhou Y, Wei W (2021) Assistance by routine CT features combined With 3D texture analysis in the diagnosis of BRCA gene mutation status in advanced epithelial ovarian cancer. Front Oncol 11:696780
Article CAS PubMed PubMed Central Google Scholar
Li HM, Gong J, Li RM et al (2021) Development of MRI-based radiomics model to predict the risk of recurrence in patients with advanced high-grade serous ovarian carcinoma. AJR Am J Roentgenol 217:664–675
Article PubMed Google Scholar
Li H, Zhang R, Li R et al (2021) Noninvasive prediction of residual disease for advanced high-grade serous ovarian carcinoma by MRI-based radiomic-clinical nomogram. Eur Radiol 31:7855–7864
Article PubMed Google Scholar
Chen H-z, Wang X-r, Zhao F-m et al (2021) A CT-based radiomics nomogram for predicting early recurrence in patients with high-grade serous ovarian cancer. Eur J Radiol 145:110018
Article PubMed Google Scholar
Chen H-z, Wang X-r, Zhao F-m et al (2021) The development and validation of a CT-based radiomics nomogram to preoperatively predict lymph node metastasis in high-grade serous ovarian cancer. Front Oncol 11:3362
Google Scholar
Ai Y, Zhang J, Jin J, Zhang J, Zhu H, Jin X (2021) Preoperative prediction of metastasis for ovarian cancer based on computed tomography radiomics features and clinical factors. Front Oncol 11:610742
Article PubMed PubMed Central Google Scholar
Veeraraghavan H, Vargas HA, Alejandro-Jimenez S et al (2020) Integrated multi-tumor radio-genomic marker of outcomes in patients with high serous ovarian carcinoma. Cancers 12:3403
Article CAS PubMed PubMed Central Google Scholar
Wei W, Liu Z, Rong Y et al (2019) A computed tomography-based radiomic prognostic marker of advanced high-grade serous ovarian cancer recurrence: a multicenter study. Front Oncol 9:255
Article PubMed PubMed Central Google Scholar
Meier A, Veeraraghavan H, Nougaret S et al (2019) Association between CT-texture-derived tumor heterogeneity, outcomes, and BRCA mutation status in patients with high-grade serous ovarian cancer. Abdom Radiol (NY) 44:2040–2047
Article PubMed Google Scholar
Lu H, Arshad M, Thornton A et al (2019) A mathematical-descriptor of tumor-mesoscopic-structure from computed-tomography images annotates prognostic- and molecular-phenotypes of epithelial ovarian cancer. Nat Commun 10:764
Article CAS PubMed PubMed Central Google Scholar
Rizzo S, Botta F, Raimondi S et al (2018) Radiomics of high-grade serous ovarian cancer: association between quantitative CT features, residual tumour and disease progression within 12 months. Eur Radiol 28:4849–4859
Article PubMed Google Scholar
Vargas HA, Veeraraghavan H, Micco M et al (2017) A novel representation of inter-site tumour heterogeneity from pre-treatment computed tomography textures classifies ovarian cancers by clinical outcome. Eur Radiol 27:3991–4001
Article PubMed PubMed Central Google Scholar
Collins GS, Reitsma JB, Altman DG, Moons KG (2015) Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): the TRIPOD statement. Ann Intern Med 162:55–63
Article PubMed Google Scholar
Martin-Gonzalez P, Crispin-Ortuzar M, Rundo L et al (2020) Integrative radiogenomics for virtual biopsy and treatment monitoring in ovarian cancer. Insights Imaging. https://doi.org/10.1186/s13244-020-00895-2
Article PubMed PubMed Central Google Scholar
Nougaret S, Tardieu M, Vargas HA et al (2019) Ovarian cancer: an update on imaging in the era of radiomics. Diagn Interv Imaging 100:647–655
Article CAS PubMed Google Scholar
Rizzo S, Manganaro L, Dolciami M, Gasparri ML, Papadia A, Del Grande F (2021) Computed Tomography Based Radiomics as a Predictor of Survival in Ovarian Cancer Patients: A Systematic Review. Cancers 13.
Staal FCR, Aalbersberg EA, Van Der Velden D et al (2022) GEP-NET radiomics: a systematic review and radiomics quality score assessment. Eur Radiol. https://doi.org/10.1007/s00330-022-08996-w
Article PubMed Google Scholar
Li Y, Liu Y, Liang Y et al (2022) Radiomics can differentiate high-grade glioma from brain metastasis: a systematic review and meta-analysis. Eur Radiol. https://doi.org/10.1007/s00330-022-08828-x
Article PubMed PubMed Central Google Scholar
Gao Y, Cheng S, Zhu L et al (2022) A systematic review of prognosis predictive role of radiomics in pancreatic cancer: heterogeneity markers or statistical tricks? Eur Radiol. https://doi.org/10.1007/s00330-022-08922-0
Article PubMed PubMed Central Google Scholar
Brancato V, Cerrone M, Lavitrano M, Salvatore M, Cavaliere C (2022) A systematic review of the current status and quality of radiomics for glioma differential diagnosis. Cancers 14:2731
Article CAS PubMed PubMed Central Google Scholar
Ursprung S, Beer L, Bruining A et al (2020) Radiomics of computed tomography and magnetic resonance imaging in renal cell carcinoma: a systematic review and meta-analysis. Eur Radiol 30:3558–3566
Article PubMed PubMed Central Google Scholar

Download references

Funding

This study has received funding by grants from Natural Science Foundation of China (Grant No. 81901829), National High Level Hospital Clinical Research Funding (Grant No. 2022-PUMCH-A-004) and Natural Science Foundation of China (Grant No. 82271886).

Author information

Meng-Lin Huang and Jing Ren contributed equally to this work.

Authors and Affiliations

Department of Radiology, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, People’s Republic of China
Meng-Lin Huang, Jing Ren, Zheng-Yu Jin, Xin-Yu Liu, Yong-Lan He & Hua-Dan Xue
Department of Obstetrics and Gynecology, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, National Clinical Research Center for Obstetric & Gynecologic Diseases, Beijing, People’s Republic of China
Yuan Li

Authors

Meng-Lin Huang
View author publications
You can also search for this author in PubMed Google Scholar
Jing Ren
View author publications
You can also search for this author in PubMed Google Scholar
Zheng-Yu Jin
View author publications
You can also search for this author in PubMed Google Scholar
Xin-Yu Liu
View author publications
You can also search for this author in PubMed Google Scholar
Yong-Lan He
View author publications
You can also search for this author in PubMed Google Scholar
Yuan Li
View author publications
You can also search for this author in PubMed Google Scholar
Hua-Dan Xue
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

ZYJ, HDX, YL and YLH contributed to conceptualization; HDX, YLH, MLH and JR were involved in protocol finalization; JR and XYL were involved in technical contributions; JR, XYL and MLH contributed to article selection; YLH was involved in article consensus; MLH, JR and YLH contributed to manuscript writing; MLH, JR and XYL were involved in visualization; HDX, YL and YLH contributed to writing—review and editing; YLH, ZYJ and HDX were involved in supervision. All authors read and approved the final manuscript.

Corresponding authors

Correspondence to Yong-Lan He, Yuan Li or Hua-Dan Xue.

Ethics declarations

Ethics approval and consent to participate

Institutional Review Board approval was not required because our data are extracted from existing manuscripts.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1

. Electronic Supplementary Materials.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Huang, ML., Ren, J., Jin, ZY. et al. A systematic review and meta-analysis of CT and MRI radiomics in ovarian cancer: methodological issues and clinical utility. Insights Imaging 14, 117 (2023). https://doi.org/10.1186/s13244-023-01464-z

Download citation

Received: 12 April 2023
Accepted: 11 June 2023
Published: 03 July 2023
DOI: https://doi.org/10.1186/s13244-023-01464-z

A systematic review and meta-analysis of CT and MRI radiomics in ovarian cancer: methodological issues and clinical utility

Abstract

Objectives

Methods

Results

Conclusion

Critical relevance statement

Graphical abstract

Key points

Introduction

Materials and methods

Evidence acquisition protocol and registry

Literature search and study selection

Data extraction and methodological quality assessment

Data synthesis and analysis

Results

Study characteristics

Methodological quality assessment

Pairwise correlation analysis of methodological quality, baseline information, and performance metrics

Meta-analysis

Discussion

Availability of data and materials

Abbreviations

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher's Note

Supplementary Information

Additional file 1

Rights and permissions

About this article

Cite this article

Share this article

Keywords