Skip to main content
  • Original Article
  • Open access
  • Published:

Enhancing prediction of supraspinatus/infraspinatus tendon complex injuries through integration of deep visual features and clinical information: a multicenter two-round assessment study

Abstract

Objective

Develop and evaluate an ensemble clinical machine learning–deep learning (CML-DL) model integrating deep visual features and clinical data to improve the prediction of supraspinatus/infraspinatus tendon complex (SITC) injuries.

Methods

Patients with suspected SITC injuries were retrospectively recruited from two hospitals, with clinical data and shoulder x-ray radiographs collected. An ensemble CML-DL model was developed for diagnosing normal or insignificant rotator cuff abnormality (NIRCA) and significant rotator cuff tear (SRCT). All patients suspected with SRCT were confirmed by arthroscopy examination. The model’s performance was evaluated using sensitivity, specificity, accuracy, and area under the curve (AUC) metrics, and a two-round assessment was conducted to authenticate its clinical applicability.

Results

A total of 974 patients were divided into three cohorts: the training cohort (n = 828), the internal validation cohort (n = 89), and the external validation cohort (n = 57). The CML-DL model, which integrates clinical and deep visual features, demonstrated superior performance compared to individual models of either type. The model’s sensitivity, specificity, accuracy, and area under curve (95% confidence interval) were 0.880, 0.812, 0.836, and 0.902 (0.858–0.947), respectively. The CML-DL model exhibited higher sensitivity and specificity compared to or on par with the physicians in all validation cohorts. Furthermore, the assistance of the ensemble CML-DL model resulted in a significant improvement in sensitivity for junior physicians in all validation cohorts, without any reduction in specificity.

Conclusions

The ensembled CML-DL model provides a solution to help physicians improve the diagnosis performance of SITC injury, especially for junior physicians with limited expertise.

Critical relevance statement

The ensembled clinical machine learning–deep learning (CML-DL) model integrating deep visual features and clinical data provides a superior performance in the diagnosis of supraspinatus/infraspinatus tendon complex (SITC) injuries, particularly for junior physicians with limited expertise.

Key points

1. Integrating clinical and deep visual features improves diagnosing SITC injuries.

2. Ensemble CML-DL model validated for clinical use in two-round assessment.

3. Ensemble model boosts sensitivity in SITC injury diagnosis for junior physicians.

Graphical Abstract

Introduction

Rotator cuff tears (RCTs) are a prevalent and debilitating condition that affect millions of people worldwide [1]. RCTs are a common cause of shoulder pain and dysfunction and are often associated with significant reductions in quality of life and functional disability. The prevalence of RCTs increases with age, with some studies suggesting that up to 50% of individuals over the age of 60 may have asymptomatic RCTs [2].

Despite the high prevalence of RCTs, the accurate, rapid diagnosis and evaluation of the severity of this condition can be challenging [3]. When evaluating RCTs, we generally examine the supraspinatus/infraspinatus tendon complex (SITC) and subscapularis tendon separately. The SITC is the most frequently torn tendon and can cause significant pain and disability for patients. Clinical information such as medical history, physical examination, and clinical scoring can assist in detecting SITC injury with a sensitivity and specificity range of 40 to 80% [4]. Radiographs such as shoulder x-ray are convenient and allow for the visualization of bone changes such as humeral head migration and subacromial spurs that are associated with SITC injuries [5]. And the detection of sclerosis and cortical irregularity at the greater tuberosity through shoulder x-rays could suggest the presence of a SITC injuries. However, it is important to note that x-rays exhibit limited sensitivity when it comes to accurately detecting SITC injuries [6]. Advanced imaging techniques such as ultrasound, magnetic resonance imaging (MRI), or arthrography may be necessary for accurate identification of SITC injuries. However, the accuracy of ultrasound for SITC injuries heavily relies on the clinician’s expertise and experience, with reported sensitivities and specificities varying within the range of 60–100%. On the other hand, MRI or arthrography can be costly and may not be necessary for all patients.

Machine learning and deep learning (DL) neural networks have recently emerged as promising tools for diagnosing and evaluating SITC injuries [7,8,9,10]. However, these models have some limitations in clinical application. Firstly, most of these models are developed based on a single modality, either clinical information or MRI, leading to lower diagnostic performance [9, 10]. Secondly, these models lack the ability to comprehensively evaluate RCTs, as clinicians do, by assimilating patients’ clinical information, physical examination, scoring, and radiography findings. Lastly, the potential benefits of utilizing artificial intelligence in actual diagnostic scenarios for medical professionals have not been analyzed. Therefore, accurate and reliable diagnostic tools that can effectively combine clinical information with radiographic findings for early detection and accurate assessment of the severity of RCTs are urgently needed. Such tools align better with the diagnostic thinking habits of clinicians.

The study was devised with these limitations in mind (1) to develop the ensemble CML-DL model, a deep learning model that incorporates clinical information and radiographic findings to accurately assess the severity of SITC injury, and (2) to validate the clinical benefits of using this deep learning model to assist clinical decision-making. We believe that the development and implementation of such tools can improve patient outcomes, reduce the burden of disease, and optimize treatment strategies.

Methods

Patients

This retrospective study was conducted in accordance with the Helsinki Declaration and was approved by the ethics committees of all hospitals involved (SYSEC-KY-KS-2021–184). The requirement for informed consent was waived. This study adhered to the Standards for Reporting Diagnostic Accuracy Studies guidelines to ensure accurate and transparent reporting. Additionally, the Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis Guidelines were followed to provide a comprehensive framework for reporting prediction models in this study.

We selected patients from the Sun Yat-sen Memorial Hospital of Sun Yat-sen University (center 1) as our primary cohort due to its larger sample size. The study included patients enrolled between January 2018 and April 2023. For internal validation, we included patients admitted after January 2021, while others were used for the training cohort. To ensure independent external validation, we also included data from the Shenshan Medical Center, Sun Yat-sen Memorial Hospital of Sun Yat-sen University (center 2), as external validation cohorts (Fig. 1).

Fig. 1
figure 1

Flowchart of patient enrollment pathway

This study included patients who met predefined inclusion criteria: (1) patients over 18 year old with a chief complaint of shoulder pain, who underwent a standard medical history inquire, physical examination, and clinical score assessment; (2) patients who received standard anteroposterior view of shoulder x-ray radiograph and MRI when suspected with rotator cuff injury; (3) all patients suspected with SRCT were confirmed by arthroscopy examination; The exclusion criteria were as follows: (1) patients who had undergone previous shoulder surgery; (2) patients with shoulder fractures or tumors; (3) the AP view of radiography did not meet the Suter-Henninger (SH) scapular classification system [11] criteria for A1 or C1 type; and (4) patients with a time interval greater than 4 weeks between clinical assessment and either MRI or shoulder arthroscopy examination.

To analyze the SITC abnormalities, patients were classified into two groups: (1) normal or insignificant rotator cuff abnormalities (NIRCA), encompassing normal rotator cuff, tendinosis, and low-grade partial tears (tears involving ≤ 50% of the tendon thickness), and (2) significant rotator cuff tears (SRCT), which include high-grade partial tears (tears involving > 50% of the tendon thickness) and full-thickness tears, the latter potentially necessitating surgical intervention.

Clinical information

The clinical information was gathered from a multi-center database as part of a multi-center database of shoulder clinical research program (Table 1). This data included patient demographics (age, gender), trauma history, hand dominance, physical examination outcomes (such as painful arc, pseudoparalysis, Jobe tests, external rotation lag sign, lift-off test, belly press test, bear hug test, internal rotation lag sign, Neer sign, Hawkins-Kennedy test, coracoid impingement test, tenderness, Yergason’s test, and Speed’s test), pain levels assessed by the visual analog scale (VAS) score, and clinical scores including the American Shoulder and Elbow Surgeons (ASES) score, Constant-Murley score, Quick Disabilities of the Arm, Shoulder, and Hand (Quick-DASH) score, Simple Shoulder Test (SST), and University of California, Los Angeles (UCLA) score. During pain level assessments, we recorded the VAS score for the most intense pain, regardless of the patient’s posture or mobility. For missing data, such as history of trauma or dominant hand, we generated a separate “unknown” category.

Table 1 Clinical information collected from a multi-center database of shoulder clinical research program

Image acquisition and preprocessing

We retrospectively reviewed shoulder pain patients who visited the orthopedic clinic and underwent standard shoulder anteroposterior radiographs. All radiographs were downloaded in anonymized digital imaging and communications in medicine (DICOM) format. Two orthopedic physicians (J.Y.H. and Q.Z., with 5 and 7 years of experience, respectively) reviewed the images. To eliminate irrelevant information from non-lesion areas, we defined a region of interest (ROI) on the radiographs, with a 512 × 512-pixel rectangular area centered on the humeral head, and the ROI rectangle was then cropped.

Clinical features obtained from CML model

To obtain features from clinical information, we trained four benchmark models—random forest, support vector machine (SVM), lasso, and decision tree—on a cumulative set of 35 clinical variables. The Student T test or U test method was employed to choose significant features that could differentiate between patients with SRCT or NIRCA. Only features with p-values less than 0.05 were kept. The maximum relevance minimal redundancy (mRMR) method was used to assess the relevance and redundancy of each attribute. Maximum relevance sought to identify the attribute with the highest correlation with muscle status. The minimum redundancy criterion was used to ensure that features with the least redundancy were chosen. Using the mRMR technique, the relevance-redundancy index was utilized to order the features. To build the prediction models, a set of significant features with good correlation and low redundancy was chosen.

Deep visual features obtained from DL model

This study applied different deep learning models to predict SITC injury based on shoulder radiographs. The research employed three benchmark deep learning models, namely Resnet-101, Visual Geometry Group (VGG)-19, and Inception-V3. Prior to the training process, these benchmark models underwent pre-training with the ImageNet repository, which contains over one million images of natural origin and a thousand categories of objects. To customize the deep learning models for our specific task, we executed fine-tuning using transfer learning. This process involved freezing the weights of convolution layers that were initially optimized for identifying structures in images. We then replaced the deep layers with innovative, fully connected task-specific layers that were retrained using the backpropagation algorithm. After fine-tuning the three benchmark DL models, we extracted the deep visual features of shoulder radiographs from the fully connected layers. Finally, we trained four DL models on each subset independently and recorded the predicted results. To maximize the deep visual features and enhance the generalization ability of the model, we generated an ensemble DL model by integrating four DL models into a single ensemble model using advanced feature fusion techniques.

Ensemble CML-DL scheme

The CML scheme with the highest AUC was selected to generate the CML-DL scheme by combining the ensemble DL model. The above-mentioned methods utilized integrated features that combined 297 deep visual features, extracted from the ensemble deep learning model, with clinical variables as inputs to predict the outcome for SITC injuries. Our approach effectively leverages the strengths of multiple DL models and demonstrates their potential for accurate and efficient prediction of SITC injuries from shoulder radiographs. Figure 2 illustrates the detailed architecture of the ensemble CML-DL scheme.

Fig. 2
figure 2

Demonstration of construction of the ensemble CML-DL scheme. Four CML models including SVM, lasso, decision tress, and random forest were trained to obtain features from clinical characteristics. An ensemble DL scheme and three benchmark DL models were used to merge the image features extracted from shoulder radiographs. Finally, an ensemble CML-DL model was used to integrate the features obtained from images and digital data to predict NIRAC and SRCT. SRCT, significant rotator cuff tear; NIRCA, normal or insignificant rotator cuff abnormality; SVM, support vector machine, DL, deep learning; CML, clinical machine learning; VGG, Visual Geometry Group

Two-round assessment

To examine the clinical advantages physicians could gain from using the CML-DL model, a two-round assessment was conducted. The study included five physicians affiliated with center 1: three junior physicians (physicians 1–3) with an average clinical experience of 2.3 years (range: 2–3 years) and two experts (physicians 4–5) with an average of 6.5 years of experience in shoulder joint surgeries (range: 5–8 years). An additional cohort of five physicians affiliated with center 2 were also included in the study. The study involved two groups of physicians: physicians 6–8 who were classified as junior physicians with an average of 1.8 years of clinical experience ranging from 1 to 2 years and physicians 9–10 who were classified as experts with an average of 7 years of experience in shoulder joint surgeries ranging from 5 to 9 years. The study examined a total cohort of 146 patients, including 63 individuals who underwent SCRT and were randomly presented from both internal and external validation cohorts. Physicians should make informed decisions based on clinical data and radiographic findings on x-ray films. These may include evidence of a humeral head migration, which suggests a possible SITC injury. Additionally, the presence of subacromial spurs could indicate a potential impingement syndrome, and a critical shoulder angle (CSA) greater than 35° can serve as a risk factor for SITC injury. Throughout the investigation, the medical practitioners were blinded to both the outcomes of the MRI or arthroscopy assessments and each other’s observations.

To assess the diagnostic accuracy of the ensemble CML-DL model, we calculated a total score based on the opinions of five physicians. If a patient was identified as having a significant rotator cuff tear (SRCT) by a physician, one point was awarded. Therefore, the maximum score achievable was 5, while the minimum was 0. A higher total score indicated that a greater proportion of physicians believed the patient had a SRCT.

Statistical analysis

We evaluated the predictive accuracy of the CML, DL, and CML-DL models for SITC injury by calculating their sensitivity and specificity, plotting ROC curves, and calculating the corresponding AUC values. Statistical analyses were conducted using SPSS (version 22.0) and Python 3.8. Continuous variables were presented as mean ± standard deviation (SD), and categorical variables were presented as numbers and percentages. Between-group comparisons were conducted using Student’s t-test or Mann–Whitney U test for quantitative variables and the chi-squared test for qualitative variables. The 95% confidence interval (CI) was computed using bootstrapping with 2000 resamples. All statistical analyses were two-tailed, and a p-value < 0.05 was considered statistically significant.

Results

Clinical information

Based on the inclusion and exclusion criteria, 974 patients from two medical centers were included in the research. Patients from center 1 were split into 828 patients for the training cohort and 89 patients for the internal validation cohort to construct and verify the model. The external validation cohort comprised of 57 patients from center 2. There were no statistically significant differences observed in any characteristics between the NIRCA and SRCT groups in both the internal and external validation cohorts (p > 0.05). Table 2 presents the demographic characteristics of these patients.

Table 2 Demographic characteristics of patients suspected with rotator cuff tears

Model performance of CML, DL, and ensemble CML-DL scheme

Table 3 presents a comparison of diagnostic performance among different models. The differences in performance between different networks in the CML and DL models were small in the internal validation cohort (p < 0.05). Among all the DL models, the ensemble DL model demonstrated the highest performance, with a sensitivity, specificity, accuracy, and AUC (95% CI) of 0.800, 0.653, 0.712, and 0.797 (0.734–0.861), respectively.

Table 3 Diagnostic performance of the deep learning algorithm

Based on the evaluation of various models, we selected random forest as the basis for the CML-DL scheme, as it achieved the highest AUC. As expected, the results indicate that the CML-DL model outperformed all other CML models, exhibiting a sensitivity, specificity, accuracy, and AUC (95% CI) of 0.880, 0.812, 0.836, and 0.902 (0.858–0.947), respectively.

The CML-DL model, as an ensemble, has identified 24 features that are significant and have consistently demonstrated prognostic efficacy in predicting SITC injury. Figure 3 displays the top 15 features, including 6 clinical and 9 deep visual features, listed in descending order of significance. These features were found to be highly informative and played important roles in the model’s ability to accurately predict the SITC injury.

Fig. 3
figure 3

The variance importance plot lists the most significant variables in descending order

Comparison of the deep learning radiomics model to physicians

In the first-round assessment, we contrasted the diagnostic decisions of five physicians with the ensemble CML-DL model. Figure 4 illustrates the ROC curve of the ensemble CML-DL model, the diagnoses of each physician, and the average diagnostic results of all physicians in the various cohorts. Our result showed the ensemble CML-DL model achieved high AUC values of 0.950 (95% CI 0.936–0.963), 0.902 (95% CI 0.858–0.947), and 0.894 (95% CI 0.872–0.915) in the training, internal validation, and external validation cohorts, respectively. The sensitivity in the internal and external validation cohorts were 88.0% and 73.6%, respectively, while the specificity was 81.2% and 87.5%, respectively.

Fig. 4
figure 4

Comparison of the performance of the ensemble CML-DL model with that of physicians. The figure displays the identification of SRCT and NIRCA in the training cohort, internal validation cohort, and external validation cohort using the ensemble CML-DL model and by individual physicians. The performance of our ensemble CML-DL model is compared with each of the five readers and the average reader

Our results indicate that the diagnoses provided by the five physicians were either inferior or equivalent to those generated by the CML-DL model ensemble. The ROC curve analysis indicates that the model’s diagnostic accuracy surpassed that of the physicians, as indicated by the conspicuous lack of green points in the upper left region. Furthermore, the average of all five physicians’ diagnoses in all validation cohorts were located below the ROC curve of the ensemble CML-DL model (Fig. 4, green crosses), indicating that our model was superior to the physicians in general.

Moreover, the mean value of the diagnoses provided by the five physicians across all validation cohorts was positioned beneath the ROC curve of the ensemble CML-DL model (Fig. 4). This suggests that our model outperformed the physicians in a general sense.

Enhanced diagnosis with AI assistance

We analyzed the alterations in diagnoses provided by five physicians before and after AI assistance. The detailed changes in their decision, sensitivity, and specificity are presented in Table 4. The implementation of AI assistance resulted in a significant improvement in the performance of junior physicians (1, 2, and 6–8) across all validation cohorts, without compromising the specificity of the diagnostic process (p < 0.05). The results indicate that the ensemble CML-DL model has a favorable impact on the average accuracy of physicians, as evidenced by the blue points and crosses depicted in Fig. 4. Notably, in the second round of evaluation, physicians 5 and 10 (junior) had significantly higher specificity compared to their performance in the first round (p < 0.05).

Table 4 Summary of the changes in the decision-making of radiologists before and after AI assistance

Figure 5 illustrates the total scores of all cases in the external validation cohort, as assessed by five physicians, to demonstrate the clinical value of our ensemble CML-DL model. Although the ensemble CML-DL model’s predictions may have resulted in incorrect decisions by physicians in some cases, the cumulative scores of the five physicians across all cases in the validation cohorts showed a noticeable improvement in diagnostic efficacy after the implementation of ensemble CML-DL assistance.

Fig. 5
figure 5

A summary of the total scores from five physicians before and after CML-DL model assistance for each case in the external validation cohort. The green and blue circles represent the total score without and with CML-DL model assistance, respectively. The orange circles indicate that the lesion received the same score before and after AI assistance. The arrows indicate the trend of the total score after AI assistance. The total score is calculated as the sum of the scores of the five physicians individually. If an expert believes that a SRCT is present, it is scored as one point, leading to a maximum score of 5. The higher the score, the more physicians believe that the case is a SRCT. SRCT, significant rotator cuff tear; NIRCA, normal or insignificant rotator cuff abnormality; AI, artificial intelligence

Discussion

In this study, our aim was to develop and evaluate the performance of an ensemble CML-DL model for the diagnosis of SITC injuries. Our model demonstrated superior diagnostic performance compared to human physicians in both internal and external validation cohorts. Moreover, we demonstrated that the implementation of the ensemble CML-DL model could improve the diagnostic accuracy of human physicians, highlighting its potential clinical utility in real-world settings.

Our study demonstrated that the ensemble CML-DL model, integrating clinical information and radiography findings, exhibited superior or comparable diagnostic performance to previous studies. This can be attributed to several factors. First, previous studies have shown that medical history, physical examination, and clinical scores are useful for assessing rotator cuff injuries. ML models can improve diagnostic accuracy by automatically learning adaptive features from clinical information [7, 9, 10, 12, 13]. Secondly, radiographic findings are highly correlated with rotator cuff injuries, with specific features such as humeral head migration, supraspinatus calcification, and CSA > 35° being particularly indicative of RCTs. DL models can identify high-level abstract features that human clinicians may not recognize, resulting in higher precision and accuracy [14, 15]. Lastly, ensemble models can achieve better accuracy and generalizability than single deep learning models by combining predictions from multiple models, reducing individual biases and errors, and improving model robustness to overfitting [16].

One of the main strengths of our study was incorporating a real-world assessment with ten physicians from two different centers. This assertion holds significant importance as it is anticipated that CML-DL models will serve as a supplementary component in the coming times. Despite the advantages of DL and radiomics models, ultimate decision-making authority will remain with human physicians. A primary contributing factor to this phenomenon is the nascent stage of interpretability pertaining to deep learning features, coupled with the underexplored nature of the biological mechanism underlying radiomics features. Nevertheless, physicians should not refrain from utilizing deep learning techniques to enhance their diagnostic abilities. The prediction score of the ensemble CML-DL model functioned as a reliable signal for the physicians involved in this study. The model’s outlier score, which indicated high confidence in classifying lesions as either NIRCA or SRCT, played a crucial role in notifying medical practitioners regarding patients with diverse diagnoses determined by both human interpretation and quantitative computational analysis. When this assistance approach was used during the second round of image interpretation, physicians had a remarkable enhancement in their capacity to precisely identify and evaluate SRCT while still maintaining specificity. The ensemble CML-DL model has the potential to benefit clinical practice by supporting junior physicians. Although all physicians received valuable assistance from the model, junior physicians experienced a more significant benefit. Consequently, this methodology can enhance the learning rate of physicians with limited experience.

Despite the promising results of our study, there were several limitations that should be acknowledged. Firstly, the retrospective nature of our study resulted in missing patient information and the exclusion of many patients who did not meet our inclusion and exclusion criteria. Future prospective clinical studies could be conducted to further validate the performance of our ensemble CML-DL scheme. Secondly, although our study was conducted across multiple centers, the dataset used in this study was small. The use of strict inclusion and exclusion criteria, which required specific shoulder radiographs to be eligible for inclusion, resulted in smaller sample sizes at each center and restricted the number of centers that met our eligibility criteria. Lastly, only standard AP view of shoulder radiographs were included in this study. Future studies could consider including additional radiographic views, such as the Y-view and Stryker notch view, which may enhance the model’s predictive accuracy. Nevertheless, the strong performance of our model demonstrated its effectiveness in assisting clinicians to improve the diagnosis of rotator cuff injuries. Future studies with larger, more diverse datasets, multiple imaging views, and prospective designs are needed to further validate and extend our findings.

Conclusions

Our study successfully established an ensemble CML-DL model by combining clinical and deep visual features. The ensemble CML-DL model provides a valuable solution to help physicians improve the diagnostic performance of SITC injury, particularly for junior physicians with limited expertise.

Availability of data and materials

All data used or analyzed in this study are available from the corresponding author upon reasonable request.

Abbreviations

CML:

Clinical machine learning

DL:

Deep learning

MRI:

Magnetic resonance imaging

NIRCA:

Normal or insignificant rotator cuff abnormality

RCTs:

Rotator cuff tears

SITC:

Supraspinatus/infraspinatus tendon complex

SRCT:

Significant rotator cuff tear

VAS:

Visual analog scale

References

  1. Micallef J, Pandya J, Low AK (2019) Management of rotator cuff tears in the elderly population. Maturitas 123:9–14

    Article  PubMed  Google Scholar 

  2. Lawrence RL, Moutzouros V, Bey MJ (2019) Asymptomatic rotator cuff tears. JBJS Rev 7:e9. https://doi.org/10.2106/JBJS.RVW.18.00149

    Article  PubMed  PubMed Central  Google Scholar 

  3. Zadro JR, O’Keeffe M, Ferreira GE et al (2021) Diagnostic labels for rotator cuff disease can increase people’s perceived need for shoulder surgery: an online randomized controlled trial. J Orthop Sport Phys Ther 51:401–411

    Article  Google Scholar 

  4. Lädermann A, Meynard T, Denard PJ et al (2021) Reliable diagnosis of posterosuperior rotator cuff tears requires a combination of clinical tests. Knee Surgery, Sport Traumatol Arthrosc 29:2118–2133

    Article  Google Scholar 

  5. Song JG, Yun SJ, Song YW, Lee SH (2019) High performance of critical shoulder angle for diagnosing rotator cuff tears on radiographs. Knee Surg Sports Traumatol Arthrosc 27:289–298. https://doi.org/10.1007/s00167-018-5247-1

    Article  PubMed  Google Scholar 

  6. Tang Y, Hou J, Li Q et al (2019) The effectiveness of using the critical shoulder angle and acromion index for predicting rotator cuff tears: accurate diagnosis based on standard and nonstandard anteroposterior radiographs. Arthroscopy 35:2553–2561. https://doi.org/10.1016/j.arthro.2019.03.050

    Article  PubMed  Google Scholar 

  7. Lin DJ, Schwier M, Geiger B et al (2023) Deep learning diagnosis and classification of rotator cuff tears on shoulder MRI. Invest Radiol 58(6):405–412

  8. Yin L, Kong Y, Guo M et al (2022) A preliminary attempt to use radiomic features in the diagnosis of extra-articular long head biceps tendinitis. MAGMA 36:651–658

  9. Zhan J, Liu S, Dong C et al (2023) Shoulder MRI-based radiomics for diagnosis and severity staging assessment of surgically treated supraspinatus tendon tears. Eur Radiol 33:5587–5593

  10. Fritz B, Yi PH, Kijowski R, Fritz J (2023) Radiomics and deep learning for disease detection in musculoskeletal radiology: an overview of novel MRI-and CT-based approaches. Invest Radiol 58:3–13

    Article  PubMed  Google Scholar 

  11. Suter T, Gerber Popp A, Zhang Y et al (2015) The influence of radiographic viewing perspective and demographics on the critical shoulder angle. J Shoulder Elbow Surg 24:e149–e158. https://doi.org/10.1016/j.jse.2014.10.021

    Article  PubMed  PubMed Central  Google Scholar 

  12. Wang T-F, Chen D-S, Zhu J et al (2021) Unsupervised machine learning-based analysis of clinical features, bone mineral density features and medical care costs of rotator cuff tears. Risk Manag Healthc Policy 14:3977–3986

  13. Wainberg M, Merico D, Delong A, Frey BJ (2018) Deep learning in biomedicine. Nat Biotechnol 36:829–838

    Article  CAS  PubMed  Google Scholar 

  14. Docter S, Khan M, Ekhtiari S et al (2019) The relationship between the critical shoulder angle and the incidence of chronic, full-thickness rotator cuff tears and outcomes after rotator cuff repair: a systematic review. Arthroscopy 35:3135–3143

    Article  PubMed  Google Scholar 

  15. Spross C, Behrens G, Dietrich TJ et al (2019) Early arthroscopic repair of acute traumatic massive rotator cuff tears leads to reliable reversal of pseudoparesis: clinical and radiographic outcome. Arthroscopy 35:343–350

    Article  PubMed  Google Scholar 

  16. Cao Y, Geddes TA, Yang JYH, Yang P (2020) Ensemble deep learning in bioinformatics. Nat Mach Intell 2:500–508

    Article  Google Scholar 

Download references

Acknowledgements

We would like to express our appreciation to the OnekeyAI platform and its developers for their contributions to this study. We are also grateful to all the individuals who participated in these studies as well as each of the researchers and technicians who made this work possible.

Funding

This study was supported by the National Natural Science Foundation of China (no.81972067, 82002342) and the Fundamental Research Funds for the Central Universities, Sun Yat-sen University (2020004).

Author information

Authors and Affiliations

Authors

Contributions

YA conceived and designed the study. YA, ZZ, SC, JZ, YH Z, and CL obtained, organized, and cleaned the dataset. YA, JH, YL, and CZ performed the data analysis. YA wrote the manuscript. YA, RY, and Wang D.M revised the manuscript. TL and SC supervised the whole process. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Rui Yang.

Ethics declarations

Ethics approval and consent to participate

This retrospective study was conducted in compliance with the Helsinki Declaration and was approved by the ethics committees of all participating hospitals (SYSEC-KY-KS-2021–184). Given the retrospective study design, informed consent was waived.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Alike, Y., Li, C., Hou, J. et al. Enhancing prediction of supraspinatus/infraspinatus tendon complex injuries through integration of deep visual features and clinical information: a multicenter two-round assessment study. Insights Imaging 14, 200 (2023). https://doi.org/10.1186/s13244-023-01551-1

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s13244-023-01551-1

Keywords