A1 Refereed original research article in a scientific journal
Deep Learning Enables Automatic Detection of Joint Damage Progression in Rheumatoid Arthritis-Model Development and External Validation
Authors: Venäläinen, Mikko S.; Biehl, Alexander; Holstila, Milja; Kuusalo, Laura; Elo, Laura L.
Publisher: Oxford University Press
Publication year: 2024
Journal: Rheumatology
Journal name in source: Rheumatology (Oxford, England)
Journal acronym: Rheumatology (Oxford)
ISSN: 1462-0324
eISSN: 1462-0332
DOI: https://doi.org/10.1093/rheumatology/keae215
Web address : https://academic.oup.com/rheumatology/advance-article/doi/10.1093/rheumatology/keae215/7643527
Self-archived copy’s web address: https://research.utu.fi/converis/portal/detail/Publication/387649339
Objectives: Although deep learning has demonstrated substantial potential in automatic quantification of joint damage in rheumatoid arthritis (RA), evidence for detecting longitudinal changes at an individual patient level is lacking. Here, we introduce and externally validate our automated RA scoring algorithm (AuRA), and demonstrate its utility for monitoring radiographic progression in a real-world setting.
Methods: The algorithm, originally developed during the Rheumatoid Arthritis 2-Dialogue for Reverse Engineering Assessment and Methods (RA2-DREAM) challenge, was trained to predict expert-curated Sharp-van der Heijde total scores in hand and foot radiographs from two previous clinical studies (n = 367). We externally validated AuRA against data (n = 205) from Turku University Hospital and compared the performance against two top-performing RA2-DREAM solutions. Finally, for 54 patients, we extracted additional radiograph sets from another control visit to the clinic (average time interval of 4.6 years).
Results: In the external validation cohort, with a root-mean-square-error (RMSE) of 23.6, AuRA outperformed both top-performing RA2-DREAM algorithms (RMSEs 35.0 and 35.6). The improved performance was explained mostly by lower errors at higher expert-assessed scores. The longitudinal changes predicted by our algorithm were significantly correlated with changes in expert-assessed scores (Pearson's R = 0.74, p< 0.001).
Conclusion: AuRA had the best external validation performance and demonstrated potential for detecting longitudinal changes in joint damage. Available in https://hub.docker.com/r/elolab/aura, our algorithm can easily be applied for automatic detection of radiographic progression in the future, reducing the need for laborious manual scoring.
Downloadable publication This is an electronic reprint of the original article. |
Funding information in the publication:
M.S.V. reports funding from the Academy of Finland (grant number 322123) and the state research funding of well-being services county of Southwest Finland. L.L.E. reports grants from the European Research Council ERC (677943), European Union’s Horizon 2020 research and innovation programme (955321), Academy of Finland (310561, 314443, 329278, 335434, 335611 and 341342) and Sigrid Juselius Foundation during the conduct of the study. L.K. reports funding from the state research funding of well-being services county of Southwest Finland, Urpo Huunonen Fund and Aino Taberman Fund of University of Turku. Our research is also supported by University of Turku Graduate School (UTUGS), Biocenter Finland, and ELIXIR Finland. Disclosure statement: All authors declare that they have no conflicts of interest.