A4 Vertaisarvioitu artikkeli konferenssijulkaisussa
Can Radiomics Based Models Survive Across MRI Scanners?
Tekijät: Chaudhary, Jatin; Jambor, Ivan; Aronen, Hannu; Ettala, Otto; Saunavaara, Jani; Boström, Peter; Heikkonen, Jukka; Kanth, Rajeev; Merisaari, Harri
Toimittaja: Choudrie, Jyoti; Mahalle, Parikshit N.; Perumal, Thinagaran; Joshi, Amit
Konferenssin vakiintunut nimi: International Conference on Information and Communication Technology for Intelligent Systems
Kustantaja: Springer Nature Singapore
Julkaisuvuosi: 2026
Lehti: Lecture Notes in Networks and Systems
Kokoomateoksen nimi: ICT for Intelligent Systems : Proceedings of ICTIS 2025, Volume 12
Vuosikerta: 1493
Aloitussivu: 579
Lopetussivu: 592
ISBN: 978-981-96-8398-7
eISBN: 978-981-96-8399-4
ISSN: 2367-3370
eISSN: 2367-3389
DOI: https://doi.org/10.1007/978-981-96-8399-4_52
Julkaisun avoimuus kirjaamishetkellä: Ei avoimesti saatavilla
Julkaisukanavan avoimuus : Ei avoin julkaisukanava
Verkko-osoite: https://doi.org/10.1007/978-981-96-8399-4_52
Background: Prostate cancer (PCa) is the most common malignancy among men in the Western world and a leading cause of cancer-related mortality. Machine learning (ML) models leveraging radiomic features from Magnetic Resonance Imaging (MRI) have shown promise in improving diagnostic accuracy. However, a crucial question remains: Can radiomics-based models maintain their performance across different MRI scanners, or do vendor-specific variations undermine their reliability? This study systematically evaluates the reproducibility and robustness of ML models trained on radiomic features extracted using Pyradiomics and MRCradiomics across MRI scanners from different manufacturers.
Methods: We analyzed imaging data from 637 men with clinical suspicion of PCa, obtained from multiple clinical trials. Axial T2-weighted MRI scans were acquired using Siemens MAGNETOM Verio 3T and Philips Ingenia 3T scanners. Radiomic features were extracted using Pyradiomics and MRCradiomics, resulting in 2693 features. Feature selection via Maximum Relevance Minimum Redundancy (MRMR) reduced this set to 14 highly predictive features. Two ML models “Support Vector Machine (SVM) and Random Forest (RF)” were trained and evaluated on distinct training, validation, and test datasets, with performance assessed using Area Under the Curve (AUC) metrics.
Results: The SVM model, trained on combined Pyradiomics and MRCradiomics features, achieved an AUC of 0.74 on the Multi-Improd dataset, yet its performance dropped drastically to 0.35 on the Philips test set, indicating poor cross-vendor reproducibility. Similarly, the Random Forest model performed well on the Multi-Improd dataset (AUC = 0.73) but declined to 0.60 on the Philips set. Interestingly, models trained solely on Pyradiomics features demonstrated greater robustness, with the Random Forest model achieving an AUC of 0.78 on the Philips test set.
Conclusions: While radiomics-based ML models show promise for PCa detection, their generalizability across MRI scanners is far from guaranteed. Performance disparities across vendors highlight the critical need for standardized radiomic feature extraction pipelines to ensure model reliability in real-world clinical applications. Our findings suggest that some radiomics-based models can survive across MRI scanners, but only under carefully controlled conditions—reinforcing the importance of cross-vendor validation in AI-driven diagnostic tools.