16 items according to 6 key domains | Range | Median (range) | Percentage of ideal score, n (%) | Adherence rate, n (%) |
---|---|---|---|---|
Total 16 items | − 8–36 | 10 (3–18) | 305/1044 (29.2) | 207/464 (44.6) |
Domain 1: protocol quality and stability in image and segmentation | 0–5 | 2 (0–3) | 50/145 (34.5) | 50/116 (43.1) |
Protocol quality | 0–2 | 1 (0–1) | 22/58 (37.9) | 22/29 (75.9) |
Multiple segmentations | 0–1 | 1 (0–1) | 20/29 (69.0) | 20/29 (69.0) |
Test–retest | 0–1 | 0 (0–1) | 8/29 (27.6) | 8/29 (27.6) |
Phantom study | 0–1 | 0 (0–0) | 0/29 (0.0) | 0/29 (0.0) |
Domain 2: feature selection and validation | − 8 to 8 | 5 (− 8 to 8) | 94/232 (40.5) | 49/58 (84.5) |
Feature reduction or adjustment of multiple testing | − 3 to 3 | 3 (3–3) | 69/87 (79.3) | 26/29 (89.7) |
Validation | − 5 to 5 | 2 (− 5 to 5) | 25/145 (17.2) | 23/29 (79.3) |
Domain 3: biologic/clinical validation and utility | 0–6 | 2 (0–5) | 69/174 (39.7) | 61/116 (52.6) |
Non-radiomics features | 0–1 | 1 (0–1) | 18/29 (62.1) | 18/29 (62.1) |
Biologic correlations | 0–1 | 1 (0–1) | 27/29 (93.1) | 27/29 (93.1) |
Comparison to “gold standard” | 0–2 | 0 (0 to 2) | 16/58 (27.6) | 8/29 (27.6) |
Potential clinical utility | 0–2 | 0 (0–1) | 8/58 (13.8) | 8/29 (27.6) |
Domain 4: model performance index | 0 to 5 | 2 (1–4) | 61/145 (42.1) | 35/87 (40.2) |
Cut-off analysis | 0–1 | 0 (0–0) | 0/29 (0.0) | 0/29 (0.0) |
Discrimination statistics | 0–2 | 2 (1–2) | 49/58 (84.5) | 29/29 (100.0) |
Calibration statistics | 0–2 | 0 (0–2) | 12/58 (20.7) | 6/29 (20.7) |
Domain 5: high level of evidence | 0–8 | 0 (0–7) | 21/232 (9.1) | 3/58 (5.2) |
Prospective study | 0–7 | 0 (0–7) | 21/203 (10.3) | 3/29 (10.3) |
Cost-effectiveness analysis | 0–1 | 0 (0–0) | 0/29 (0.0) | 0.29 (0.0) |
Domain 6: open science and data | 0–4 | 0 (0–2) | 10/116 (8.6) | 9/29 (31.0) |