A4 Vertaisarvioitu artikkeli konferenssijulkaisussa

Can Radiomics Based Models Survive Across MRI Scanners?




TekijätChaudhary, Jatin; Jambor, Ivan; Aronen, Hannu; Ettala, Otto; Saunavaara, Jani; Boström, Peter; Heikkonen, Jukka; Kanth, Rajeev; Merisaari, Harri

ToimittajaChoudrie, Jyoti; Mahalle, Parikshit N.; Perumal, Thinagaran; Joshi, Amit

Konferenssin vakiintunut nimiInternational Conference on Information and Communication Technology for Intelligent Systems

KustantajaSpringer Nature Singapore

Julkaisuvuosi2026

Lehti: Lecture Notes in Networks and Systems

Kokoomateoksen nimiICT for Intelligent Systems : Proceedings of ICTIS 2025, Volume 12

Vuosikerta1493

Aloitussivu579

Lopetussivu592

ISBN978-981-96-8398-7

eISBN978-981-96-8399-4

ISSN2367-3370

eISSN2367-3389

DOIhttps://doi.org/10.1007/978-981-96-8399-4_52

Julkaisun avoimuus kirjaamishetkelläEi avoimesti saatavilla

Julkaisukanavan avoimuus Ei avoin julkaisukanava

Verkko-osoitehttps://doi.org/10.1007/978-981-96-8399-4_52


Tiivistelmä

Background: Prostate cancer (PCa) is the most common malignancy among men in the Western world and a leading cause of cancer-related mortality. Machine learning (ML) models leveraging radiomic features from Magnetic Resonance Imaging (MRI) have shown promise in improving diagnostic accuracy. However, a crucial question remains: Can radiomics-based models maintain their performance across different MRI scanners, or do vendor-specific variations undermine their reliability? This study systematically evaluates the reproducibility and robustness of ML models trained on radiomic features extracted using Pyradiomics and MRCradiomics across MRI scanners from different manufacturers. 

Methods: We analyzed imaging data from 637 men with clinical suspicion of PCa, obtained from multiple clinical trials. Axial T2-weighted MRI scans were acquired using Siemens MAGNETOM Verio 3T and Philips Ingenia 3T scanners. Radiomic features were extracted using Pyradiomics and MRCradiomics, resulting in 2693 features. Feature selection via Maximum Relevance Minimum Redundancy (MRMR) reduced this set to 14 highly predictive features. Two ML models “Support Vector Machine (SVM) and Random Forest (RF)” were trained and evaluated on distinct training, validation, and test datasets, with performance assessed using Area Under the Curve (AUC) metrics. 

Results: The SVM model, trained on combined Pyradiomics and MRCradiomics features, achieved an AUC of 0.74 on the Multi-Improd dataset, yet its performance dropped drastically to 0.35 on the Philips test set, indicating poor cross-vendor reproducibility. Similarly, the Random Forest model performed well on the Multi-Improd dataset (AUC = 0.73) but declined to 0.60 on the Philips set. Interestingly, models trained solely on Pyradiomics features demonstrated greater robustness, with the Random Forest model achieving an AUC of 0.78 on the Philips test set. 

Conclusions: While radiomics-based ML models show promise for PCa detection, their generalizability across MRI scanners is far from guaranteed. Performance disparities across vendors highlight the critical need for standardized radiomic feature extraction pipelines to ensure model reliability in real-world clinical applications. Our findings suggest that some radiomics-based models can survive across MRI scanners, but only under carefully controlled conditions—reinforcing the importance of cross-vendor validation in AI-driven diagnostic tools.



Last updated on