Automatic and standardized quality assurance of digital mammography and tomosynthesis with deep convolutional neural networks

Hejduk, Patryk; Sexauer, Raphael; Ruppert, Carlotta; Borkowski, Karol; Unkelbach, Jan; Schmidt, Noemi

doi:10.1186/s13244-023-01396-8

Original Article
Open access
Published: 18 May 2023

Automatic and standardized quality assurance of digital mammography and tomosynthesis with deep convolutional neural networks

Patryk Hejduk ORCID: orcid.org/0000-0002-3839-4370¹,
Raphael Sexauer²,
Carlotta Ruppert¹,
Karol Borkowski¹,
Jan Unkelbach³ &
…
Noemi Schmidt²

Insights into Imaging volume 14, Article number: 90 (2023) Cite this article

2545 Accesses
1 Altmetric
Metrics details

Abstract

Objectives

The aim of this study was to develop and validate a commercially available AI platform for the automatic determination of image quality in mammography and tomosynthesis considering a standardized set of features.

Materials and methods

In this retrospective study, 11,733 mammograms and synthetic 2D reconstructions from tomosynthesis of 4200 patients from two institutions were analyzed by assessing the presence of seven features which impact image quality in regard to breast positioning. Deep learning was applied to train five dCNN models on features detecting the presence of anatomical landmarks and three dCNN models for localization features. The validity of models was assessed by the calculation of the mean squared error in a test dataset and was compared to the reading by experienced radiologists.

Results

Accuracies of the dCNN models ranged between 93.0% for the nipple visualization and 98.5% for the depiction of the pectoralis muscle in the CC view. Calculations based on regression models allow for precise measurements of distances and angles of breast positioning on mammograms and synthetic 2D reconstructions from tomosynthesis. All models showed almost perfect agreement compared to human reading with Cohen’s kappa scores above 0.9.

Conclusions

An AI-based quality assessment system using a dCNN allows for precise, consistent and observer-independent rating of digital mammography and synthetic 2D reconstructions from tomosynthesis. Automation and standardization of quality assessment enable real-time feedback to technicians and radiologists that shall reduce a number of inadequate examinations according to PGMI (Perfect, Good, Moderate, Inadequate) criteria, reduce a number of recalls and provide a dependable training platform for inexperienced technicians.

Key points

1
Deep convolutional neural network (dCNN) models have been trained for classification of mammography imaging quality features.
2
AI can reliably classify diagnostic image quality of mammography and tomosynthesis.
3
Quality control of mammography and tomosynthesis can be automated.

Introduction

Every year, 2.3 million women are diagnosed with breast cancer [1] representing 25% of all cancer cases in women [2]. Due to a relative lack of symptoms in the initial phases, early detection of breast cancer is important for mortality reduction [3]. Mammography breast cancer screening is the most common measure for mortality reduction, proved to lower death rates by 24–48% in controlled trials [4,5,6]. Although providing tangible benefits, screening is not a perfect solution. Risk of over-diagnosis, lack of automated tools for comprehensive evaluation of diagnostic quality, moderate accuracy in patients with dense breasts as well as high healthcare costs of breast cancer screening programmes are serious threats to the legitimacy of screening programmes [7].

For successful radiological evaluation of mammograms and tomosynthesis images, sufficient diagnostic information depicting as much breast parenchyma as possible has to be present on the two projection images. In screening programmes, quality assurance (QA) of physical and technical aspects is implemented, providing repeatable and standardized evaluations of image quality that are comparable between diagnostic units. Over the years, several QA protocols have been developed, although there is incomplete harmonization between them [8, 9]. The diagnostic image quality of an examination has a significant impact on cancer detectability [10]: inadequate positioning, image artefacts or insufficient breast compression may reduce the sensitivity for the detection of breast cancer from 84.0 to 66.3% [11]. In Europe and the USA, the most common diagnostic image quality control of mammographies are the Perfect–Good–Moderate–Inadequate (PGMI) criteria created by the United Kingdom Mammography Trainers group with the support of the Society and College of Radiographers [12], which recommend a qualitative evaluation of the diagnostic image quality to be performed by the radiologists or by a specially trained radiographer. Typically, in quality-controlled screening programmes, 70% perfect or good mammographies are requested and less than 3% inadequate mammographies are acceptable [13], in some screening programmes even stricter rules are applied with more than 75% perfect or good mammographies [14].

Even though the PGMI criteria are well defined, subjectivity of the human assessment may result in low reliability of the rating because of inter-reader variability, and it has been reported that the inter-reader agreement ranges only between slight (k = 0.02) and fair (k = 0.40) [15]. One study suggests that even 49.7% of mammograms do not satisfy the requested quality criteria of screening programmes resulting in additional and unnecessary risks for patients [15]. On top of that, each of those tasks is time-consuming, tedious and prone to errors from external factors for human readers [16].

Standardization of methods and automation of processes may have a strong impact on the image quality of mammography examinations. Thereby, standardization and automation potentially result in an increase of capacities of organizations, improved cancer detection rates, reduced diagnostic errors and a substantial decrease in healthcare costs [17]. QA automation with artificial intelligence constitutes a viable solution to those problems, minimizing errors in quality ratings and increasing general productivity in the mammography unit [18], although there is very limited commercial availability of products that are able to perform analysis of single-quality features providing explainable feedback to the end-user.

Artificial intelligence has been proven to accurately classify the breast density in mammography according to the ACR BI-RADS standard [19, 20]. Moreover, dCNNs have been used to mimic human decision-making process in the detection and classification of lesions in ultrasound imaging [21] as well as recognizing subtle features in mammographies like microcalcifications [22]. Although institutions performing screening programmes are adopting AI-based solutions, automated image quality control is not routinely available.

In this study, we trained dCNN models to determine the presence and the location of quality features in mammographies and tomosynthesis examinations, and we compared the accuracies of the dedicated dCNN models to human expert reading.

Materials and methods

Patient data

This retrospective study has been approved by the local ethics committee (“Kantonale Ethikkommission Zurich”; Approval Number: 2016–00,064 and Ethikkommission Nordwest-und Zentralschweiz, Project ID: 2021–01,472).

The included images were acquired between 2012 and 2020, the whole time span included in our ethics committee approval. The images originated from five different devices (manufactured by Siemens, Hologic, GE, IMS Giotto and Fujifilm) located in two institutions. All devices used in the study were running according to quality control tolerances and standards. Only female patients were included between 20 and 97 years old (median 57 years old).

We excluded all images with the presence of foreign objects like silicone implants, pacemakers and other foreign bodies like surgical clips, 662 images in total (478 DM, 184 DT). Postoperative patients were excluded as well.

For model training and validation, 90% of the included images were used (70% for training, 20% for validation), in detail 11,733 digital mammography (DM) images and synthetic 2D reconstructions of digital breast tomosynthesis (DBT) of 4200 female patients (CC orientation: 3565 DM, 2873 DT; MLO orientation: 2,710 DM, 2585 DT). For testing of classification models, 10% of the images were used; a separate dataset from training and validation datasets, a dataset consisting of 1,944 images, was used (CC orientation: 369 DM, 602 DT; MLO orientation: 371 DM, 602 DT). The testing of the regression models has been performed on 388 images (10% of images with position labels; CC orientation: 68 DM, 110 DT; MLO orientation: 82 DM, 128 DT).

Data preparation

Images for training and validation of dCNN models have been retrieved from the institutions’ PACS systems. Due to differences in image properties of the various manufacturers, each image was pre-processed. To avoid distortions, images from left orientation were flipped horizontally to keep breast projection on one side of an image for further post-processing. Differences in image sizes and aspect ratios were addressed by adding a stripe of black background colour to the right side of the image to create an image with square dimensions. Differences in image brightness and contrast between devices were controlled by applying windowing based on the manufacturer’s look-up tables. All images have been saved in lossless image format to avoid image quality degradation due to compression or reduction of bit depth. Images for nipple classification were cropped around the position of the nipple, which was marked by radiologists containing 10% of the original image’s width and height.

Image selection and data preparation

For determination of the image quality regarding the breast positioning, two methods were used: classification to determine feature presence and regression for precise localization of a feature on the image. In the classification task, four radiologists, with at least 3–8 years of experience in mammography imaging, independently labelled images by determining whether the respective feature is visible on an image or missing. An example of a classification of an IMF (inframammary fold) feature is presented in Fig. 1. For the regression tasks, the radiologists marked the location of a feature on the image. An example of the labelling of a position by a radiologist and the corresponding prediction of the AI model are presented in Fig. 2. All tasks related to image labelling were performed by two radiologists with at least 3 and 8 years of experience in breast imaging after the board certification, respectively.

Model selection

Eight dCNN models were trained to assess key image quality impacting the overall diagnostic image quality assessment: five classification models for detecting the presence of anatomical landmarks or features and three regression models for feature localization.

Classification models

(1)
Parenchyma depiction in CC and MLO orientation determination if the whole breast parenchyma is present on an image in CC view;
(2)
Parenchyma depiction in MLO orientation determination if the whole breast parenchyma is present on an image in MLO view;
(3)
Inframammary fold (IMF) in MLO orientation: presence of inframammary fold on the image. If fold is missing, there is also no certainty that the chest is adequately shown and corresponding tissue cannot be assessed;
(4)
Pectoralis muscle in CC orientation presence of the pectoralis muscle on a CC view to ensure that tissue near the chest wall is correctly recorded;
(5)
Nipple depiction in both CC and MLO views determination if the nipple is in profile of images of both projections and if the nipple is centered on the CC view.

Regression models

(6)
Position of the nipple in both CC and MLO views;
(7)
Pectoralis cranial in MLO view position of the pectoralis muscle at the cranial edge of the image (cranial point);
(8)
Pectoralis caudal in MLO view position of the pectoralis muscle at the thoracic wall at the edge of the image (caudal point).

An example of a qualitative evaluation of an examination is presented in Fig. 3. Anatomical landmarks found by regression models allow for the determination of the angle, in which the pectoral muscle is depicted and the calculation of the posterior nipple line in CC compared to MLO. The pectoralis angle was assessed with the use of two object detection models: detection cranial and thoracic wall edges position of the pectoral muscle compared to the evaluation performed by both experienced radiologists. The posterior nipple line length was calculated using results from the three object detection models: nipple detection in both projections and cranial and caudal points in MLO projection to assure that parts of the parenchyma near the pectoral muscle may are adequately represented. Examples of detected points and calculations are presented in Fig. 4.

Images have been randomly split into training and validation datasets in proportions of 3:1. The exact numbers of images which were used for training of each model are presented in Table 1. All images were rescaled from the original resolution to squares 224 × 224 pixels; to improve the model generalization, training and validation datasets were randomly augmented using the TensorFlow (Google LLC) data augmentation tool for rotation, horizontal, vertical shifts and zooming. The class weighting method was used to counteract class imbalances.

Table 1 Results of models training and validation

Full size table

dCNN architecture

Classification models have been trained with a network containing 13 convolutional layers with 3 × 3 kernel size with five max-pooling layers in between for dimension reduction and two dense layers with rectified linear unit activation function. Batch normalization and 50% dropout have been used for overfitting reduction. For final weights, softmax layer was used. Stochastic gradient descent was used for optimization and cross-entropy as loss function.

The regression models used similar architecture, but the loss function was replaced with a mean squared error (MSE), and Sigmoid was used as an activation function. Input for the regression model was a tuple containing an image and coordinates of a marked point presenting the position of a desired feature.

Training has been done on batches of 32 images with a learning rate of 1.0 × 10⁻⁵, and dCNN models were trained for 160 epochs, with a new model being saved after each epoch. The dCNN model with the highest validation accuracy or lowest validation RMSE has been chosen for testing. dCNN models were trained on a machine equipped with an Nvidia 1080 GTX GPU running a TensorFlow 2.0 software library on Ubuntu 16.04 OS.

Image analysis

All image analyses were done with a commercial platform (“b-box”, v 1.1, b-rayZ AG), a framework for automatic analyses of mammography images and synthetic 2D reconstructions from tomosynthesis. Test images were uploaded to the DICOM server of the b-box and automatically evaluated with the above-described dCNN models. The results of dCNN models’ predictions have been compared with classifications and regressions performed by radiologists as ground truth. The results evaluation for the classification tasks has been performed using the accuracy metric comparing class outcome of the model with the class outcome labelled by the radiologist. Regression models predicted coordinates of the feature position in comparison with the feature position marked by the radiologist, and the RMSE metric was used for error calculation. Images were analysed on a diagnostic monitor (EIZO RX350, resolution: 1536 × 2048, EIZO).

Statistical analysis

Confidence intervals (CIs) for prediction accuracies were calculated using the Monte Carlo method based on the distribution of prediction probabilities or errors for the test datasets. The statistical analysis was performed using the Scikit-learn 0.22.1 package for the Python programming language. Predictions of both accuracies and RMSE were evaluated with a confidence interval (CI) = 0.95.

Results

Training, validation and test results

Training of classification dCNN models resulted in accuracies between 84.6 and 89.4% for training images and 84.9–92.7% on the validation datasets. Regression models showed RMSEs ranging from 8.2 to 10.5 mm for training data and from 8.4 to 12.9 mm for the validation data. The detailed results of the training and validation accuracies and errors are presented in Table 1. The evaluation of results on test datasets is presented in Table 2.

Table 2 Results of model accuracies and errors on test datasets

Full size table

Parenchyma classification

Depiction of parenchyma by AI models was correct in 97.1% [95% CI 96.2–97.7%] of test cases in CC view and in 87.3% [95% CI 86.3–88.2%] of test images in MLO view. In CC view precision was 0.93, recall 0.99 and F1-score 0.96. In MLO view precision was 0.94, recall 0.89 and F1-score 0.91. Almost perfect agreement with radiologists was achieved with a Cohen’s kappa score of 0.94 [95% CI 0.93–0.96].

Inframammary fold classification

Depiction of Inframammary fold in MLO projection was correct on 96.3% [95% CI 94.9–97.4%] of images. Precision of the model was 0.94, recall 0.99 and F1-score 0.96. The model showed almost perfect agreement with radiologists with a Cohen’s kappa score of 0.93 [95% CI 0.90–0.95].

Pectoralis muscle classification

Depiction of the pectoral muscle in CC projection was correct on 98.5% [95% CI 98.2–99.5%] of images. Precision of the model was 0.98; recall and F1-score were 0.99. The model showed almost perfect agreement with radiologists with a Cohen’s kappa score of 0.96 [95% CI 0.94–0.98].

Nipple depiction

Depiction and classification of nipple in profile in MLO and nipple in profile and centered in CC images was correct in 93.0% [95% CI 91.7–94.0%] of the evaluated cases. Precision of the model was 0.99, recall 0.94 and F1-score 0.96. The model showed moderate agreement with radiologists was achieved with a Cohen’s kappa score of 0.42 [95% CI 0.34–0.50].

Regression models

The regression model for nipple position detection was able to predict the position with an error of 8.4 mm [95% CI 7.3–9.5 mm].

In MLO view, the cranial point of a pectoralis muscle was detected with an error of 17.9 mm [95% CI 13.2–21.0 mm] and the caudal point with an error of 13.2 mm [95% CI 9.9–16.0 mm].

Quality features calculations

The accuracy of the calculation of pectoralis angle was 99.1% [95% CI 98.2–99.5%] and for posterior nipple line 99.0% [95% CI 98.2–99.5%]. Precision, recall and F1-score were 0.99 in both cases. The calculations have been compared to the evaluation performed by our experienced radiologists. For the radiologists, accuracy of the calculation was 99.0% and precision, recall and F1-score were 0.99. Agreement with radiologists was almost perfect, and Cohen’s kappa score was 0.97 [95% CI 0.95–0.99].

Discussion

In this study, we were able to show that deep convolutional neural networks can be used to assess quality features on mammograms and synthetic 2D reconstructions from tomosynthesis. Classification models achieved over 85% accuracy for determination of key image quality features in breast imaging. Regression models allowed for precise localization of desired features to indicate possible positioning errors. When comparing results of the test dataset evaluation, all dCNN models were able to achieve almost perfect agreement with the radiologists.

In recent years, AI-based computer-aided detection solutions have attracted the attention of radiologists worldwide. A dCNN is an excellent method for image classification tasks. AI performance in breast cancer detection in mammography is comparable to experienced radiologists [23, 24], despite the struggle to find a way into the clinical workflow. Part of the reason for the lacking translation into the clinical world could be an error of analyzing images that do not contain sufficient information. Errors in positioning leading to insufficient image quality are common, and correct positioning is crucial for assessment [25]. The sensitivity of the assessment of mammograms drops by 21% among cases with failed positioning compared to cases with correct positioning [11]. Audits performed in the USA showed insufficient positioning as the leading cause for failed unit accreditations in ACR-accredited faculties, being responsible for 79% of all errors [26].

Current studies of quality control in breast cancer diagnosis focus on quality assurance. CNNs potentially allow for automated image quality assessment according to European Reference Organization for Quality Assured Breast Cancer Screening and Diagnostic Services (EUREF) [27]. However, the number of studies focusing on specific features visible on images, impact image quality is rather limited. Subjectivity of visual assessment is limiting the consensus within radiographers and radiologists while assessing image quality [8]. Lack of full harmonization and low adherence to ACR and EU guidelines result in a wide variability of results and openness to interpretation [28]. Lack of evidence-based practice and insufficient equipment for assessment lead to a situation where 44% of radiographers are not fully aware of the guidelines to be followed in their practice [29]. This lack of standardization is visible when analyzing tomosynthesis images. Due to lack of widely accepted guidelines for modern imaging techniques, synthetic 2D reconstructions from tomosynthesis were used for evaluation and visualization. Initial studies have been launched to apply AI to other techniques showing promising preliminary results in automated breast ultrasound [30] and breast CT [31]. AI solutions show promise of a significant reduction of workload for technicians while allowing standardization of procedures [32]. Image quality assessment could also be beneficially applied to diagnostic mammographies; however, in symptomatic patients with, e.g. mastodynia, signs of inflammation or post-operative scars it might prove difficult to apply the same standards as used in a screening setting. Therefore, dedicated guidelines for diagnostic mammographies might be necessary.

Our study has some limitations: (1) Datasets chosen for training, validation and testing may potentially be biased due to the retrospective nature of this study. (2) Images in this study came from a limited number of manufacturers, which may have an effect on a number of possible variants of images used for analysis. However, the purpose of this study was to provide a proof-of-principle of automated quality assessment and to obtain initial experiences on achievable accuracies. (3) Not all ethnicities were included in this study; however, representative data were taken from the general cohort of female patients in our institutions. (4) In this study, no other architectures of AI algorithms have been evaluated. Therefore, it might be possible to achieve even higher accuracies with other dCNN configurations. (5) This is a retrospective study focusing on evaluation of single-quality features. To evaluate an impact of real-time feedback on the performance of diagnostic units, an additional prospective study shall be conducted.

In conclusion, this study has proven the feasibility of automated assessment of quality features with the trained AI models that allow for standardization of quality control. The clinical implementation of our solution may allow for fast, observer-independent feedback to radiographers and radiologists in the assessment of the overall image quality for breast positioning. This may improve the sensitivity and specificity of examinations and reduce errors, recalls and workload for radiologists and technicians.

Availability of data and materials

Not applicable.

Abbreviations

ACR:: American College of Radiology
AI:: Artificial intelligence
CC:: Cranio-caudal
dCNN:: Deep convolutional neural network
DM:: Digital mammography
IMF:: Inframammary fold
MLO:: Medio-lateral oblique
PGMI:: Perfect–Good–Moderate–Inadequate quality assessment scoring system
QA:: Quality assurance
QC:: Quality control
RMSE:: Root mean squared error

References

World Health Organization (2021) Breast cancer. World Health Organization, Geneva. https://www.who.int/news-room/fact-sheets/detail/breast-cancer.
DeSantis CE, Bray F, Ferlay J, Lortet-Tieulent J, Anderson BO, Jemal A (2015) International variation in female breast cancer incidence and mortality rates. Cancer Epidemiol Biomark Prev 24(10):1495–1506
Article Google Scholar
Milosevic M, Jankovic D, Milenkovic A, Stojanov D (2018) Early diagnosis and detection of breast cancer. Technol Health Care 26(4):729–759
Article PubMed Google Scholar
Schopper D, de Wolf C (2019) How effective are breast cancer screening programmes by mammography? Review of the current evidence. Eur J Cancer 45(11):1916–1923
Article Google Scholar
Broeders M, Moss S, Nyström L et al (2012) The impact of mammographic screening on breast cancer mortality in Europe: a review of observational studies. J Med Screen 19:14–25
Article PubMed Google Scholar
Njor S, Nyström L, Moss S et al (2012) Breast cancer mortality in mammographic screening in Europe: a review of incidence-based mortality studies. J Med Screen 19:33–41
Article PubMed Google Scholar
Shah TA, Guraya SS (2017) Breast cancer screening programs: Review of merits, demerits, and recent recommendations practiced across the world. J Microsc Ultrastruct 5(2):59–69
Article PubMed Google Scholar
Reis C, Pascoal A, Sakellaris T et al (2013) Quality assurance and quality control in mammography: a review of available guidance worldwide. Insights Imaging 4:539–553
Article PubMed PubMed Central Google Scholar
Hill C, Robinson L (2015) Mammography image assessment; validity and reliability of current scheme. Radiography 21(4):304–307
Article Google Scholar
Saunders RS Jr, Baker JA, Delong DM, Johnson JP, Samei E (2007) Does image quality matter? Impact of resolution and noise on mammographic task performance. Med Phys 34:3971–3981
Article PubMed Google Scholar
Taplin SH, Rutter CM, Finder C, Mandelson MT, Houn F, White E (2002) Screening mammography: clinical image quality and the risk of interval breast cancer. AJR Am J Roentgenol 178(4):797–803
Article PubMed Google Scholar
NHSBSP (2021) NHS Breast screening programme screening standards valid for data collected from 1 April 2017, National Health Service, https://www.gov.uk/government/publications/breast-screening-consolidated-programme-standards/nhs-breast-screening-programme-screening-standards-valid-for-data-collected-from-1-april-2017
NHSBSP (2006) National Health Service Breast Cancer Screening Programme. National quality assurance coordinating group for radiography. Quality assurance guidelines for mammography including radiographic quality control, Sheffield
National Screening Service (2015) Guidelines for quality assurance in mammography screening, Dublin
Guertin MH, Théberge I, Dufresne MP et al (2014) Clinical image quality in daily practice of breast cancer mammography screening. Can Assoc Radiol J 65(4):199–206
Article PubMed Google Scholar
Bernstein MH, Baird GL, Lourenco AP (2022) Digital breast tomosynthesis and digital mammography recall and false-positive rates by time of day and reader experience. Radiology 303(1):63–68
Article PubMed Google Scholar
Akhigbe AO, Igbinedion BO (2013) Mammographic screening and reporting: a need for standardisation. Rev Niger Postgrad Med J 20(4):346–351
Article CAS Google Scholar
Sundell VM, Mäkelä T, Meaney A, Kaasalainen T, Savolainen S (2019) Automated daily quality control analysis for mammography in a multi-unit imaging center. Acta Radiol 60(2):140–148
Article PubMed Google Scholar
Ciritsis A, Rossi C, Vittoria De Martini I et al (2019) Determination of mammographic breast density using a deep convolutional neural network. Br J Radiol 92:1093
Google Scholar
Magni V, Interlenghi M, Cozzi A et al (2022) Development and validation of an AI-driven mammographic breast density classification tool based on radiologist consensus. Radiol Artif Intell 4(2):e210199
Article PubMed PubMed Central Google Scholar
Ciritsis A, Rossi C, Eberhard M, Marcon M, Becker AS, Boss A (2019) Automatic classification of ultrasound breast lesions using a deep convolutional neural network mimicking human decision-making. Eur Radiol 29(10):5458–5468
Article PubMed Google Scholar
Schönenberger C, Hejduk P, Ciritsis A, Marcon M, Rossi C, Boss A (2021) Classification of mammographic breast microcalcifications using a deep convolutional neural network: a BI-RADS-based approach. Invest Radiol 56(4):224–231
Article PubMed Google Scholar
Stower H (2020) AI for breast-cancer screening. Nat Med 26(2):163
PubMed Google Scholar
Pisano ED (2020) AI shows promise for breast cancer screening. Nature 577(7788):35–36
Article CAS PubMed Google Scholar
Mohd Norsuddin N, Ko ZX (2021) Common mammographic positioning error in Digital Er. 3rd International Conference for innovation in biomedical engineering and life sciences, Cham
FDA (2019) Poor Positioning responsible for most clinical image deficiencies, failures. Food and drug administration. https://www.fda.gov/radiation-emitting-products/mqsa-insights/poor-positioning-responsible-most-clinical-image-deficiencies-failures
Kretz T, Mueller KR, Schaeffter T, Elster C (2020) Mammography image quality assurance using deep learning. IEEE Trans Biomed Eng 67(12):3317–3326
Article PubMed Google Scholar
Yanpeng L, Poulos A, McLean D, Rickard M (2010) A review of methods of clinical image quality evaluation in mammography. Eur J Radiol 74(3):122–131
Article Google Scholar
Richli Meystre N, Henner A, Sà dos Reis C et al (2019) Characterization of radiographers’ mammography practice in five European countries: a pilot study. Insights Imaging 10:1–11
Article Google Scholar
Hejduk P, Marcon M, Unkelbach J et al (2022) Fully automatic classification of automated breast ultrasound (ABUS) imaging according to BI-RADS using a deep convolutional neural network. Eur Radiol 32(7):4868–4878
Article PubMed PubMed Central Google Scholar
Landsmann A, Wieler J, Hejduk P et al (2022) Applied machine learning in spiral breast-CT: can we train a deep convolutional neural network for automatic, standardized and observer independent classification of breast density? Diagnostics (Basel) 12(1):181
Article PubMed Google Scholar
Rodríguez-Ruiz A, Krupinski E, Mordang JJ, et al. (2019) Detection of breast cancer with mammography: effect of an artificial intelligence support system. Radiology 290(2):305–314
Article PubMed Google Scholar

Download references

Acknowledgements

This study was supported by the Clinical Research Priority Program Artificial Intelligence in Oncological Imaging of the University Zurich.

Funding

This study was supported by the Clinical Research Priority Program Artificial Intelligence in Oncological Imaging of the University Zurich. The authors state that this work has not received any other funding.

Author information

Authors and Affiliations

Institute of Diagnostic and Interventional Radiology, University Hospital Zurich, Rämistr. 100, 8091, Zurich, Switzerland
Patryk Hejduk, Carlotta Ruppert & Karol Borkowski
Breast Imaging, Radiology and Nuclear Medicine, University Hospital Basel, Basel, Switzerland
Raphael Sexauer & Noemi Schmidt
Department of Radiation Oncology, University Hospital Zurich, Zurich, Switzerland
Jan Unkelbach

Authors

Patryk Hejduk
View author publications
You can also search for this author in PubMed Google Scholar
Raphael Sexauer
View author publications
You can also search for this author in PubMed Google Scholar
Carlotta Ruppert
View author publications
You can also search for this author in PubMed Google Scholar
Karol Borkowski
View author publications
You can also search for this author in PubMed Google Scholar
Jan Unkelbach
View author publications
You can also search for this author in PubMed Google Scholar
Noemi Schmidt
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

NS is the scientific guarantor of this publication. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Patryk Hejduk.

Ethics declarations

Ethics approval and consent to participate

This retrospective study was approved by the local ethics committee “Kantonale Ethikkommission Zurich”, Approval Number: 2016-00064, and the Ethikkommission Nordwest-und Zentralschweiz (EKNZ), Project ID:2021-01472. Written informed consent was obtained from all subjects (patients) in this study.

Consent for publication

Not applicable.

Competing interests

The authors of this manuscript declare no relationships with any companies, whose products or services may be related to the subject matter of the article.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Hejduk, P., Sexauer, R., Ruppert, C. et al. Automatic and standardized quality assurance of digital mammography and tomosynthesis with deep convolutional neural networks. Insights Imaging 14, 90 (2023). https://doi.org/10.1186/s13244-023-01396-8

Download citation

Received: 03 November 2022
Accepted: 06 March 2023
Published: 18 May 2023
DOI: https://doi.org/10.1186/s13244-023-01396-8

Automatic and standardized quality assurance of digital mammography and tomosynthesis with deep convolutional neural networks

Abstract

Objectives

Materials and methods

Results

Conclusions

Key points

Introduction

Materials and methods

Patient data

Data preparation

Image selection and data preparation

Model selection

Classification models

Regression models

dCNN architecture

Image analysis

Statistical analysis

Results

Training, validation and test results

Parenchyma classification

Inframammary fold classification

Pectoralis muscle classification

Nipple depiction

Regression models

Quality features calculations

Discussion

Availability of data and materials

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords