A1 Refereed original research article in a scientific journal
The Interplay Between PROM Score Distributions and Treatment Effect Detection Likelihood in Randomized Controlled Trials – A Meta-epidemiologic study
Authors: Panula, Valtteri; Saarinen, Antti; Vaajala, Matias; Liukkonen, Rasmus; Pakarinen, Oskari; Laaksonen, Juho; Ponkilainen, Ville; Kuitunen, Ilari; Uimonen, Mikko
Publisher: Elsevier
Publication year: 2026
Journal: Journal of Clinical Epidemiology
Article number: 112114
Volume: 191
ISSN: 0895-4356
eISSN: 1878-5921
DOI: https://doi.org/10.1016/j.jclinepi.2025.112114
Publication's open availability at the time of reporting: Open Access
Publication channel's open availability : Partially Open Access publication channel
Web address : https://doi.org/10.1016/j.jclinepi.2025.112114
Self-archived copy’s web address: https://research.utu.fi/converis/portal/detail/Publication/506255969
Self-archived copy's licence: CC BY
Self-archived copy's version: Publisher`s PDF
Objectives
We hypothesized that, in musculoskeletal RCTs using PROMs, higher baseline scores and the clustering of follow-up scores near the upper bound (i.e., ceiling effect) compress variability and attenuate measurable between-group differences, thereby lowering the likelihood of observing a statistically significant effect. We therefore examined how score distributions at pre-treatment and follow-up influence the likelihood of detecting between-group differences.
Study Design and SettingWe conducted a meta-epidemiologic study of RCTs, published between 2015 and 2024, that compared treatment effects on musculoskeletal disorders between two study groups using PROMs. The observed distributions of the PROM scores at baseline and follow-up were collected from the included studies. All PROM scores were rescaled to 0-100 with higher scores indicating better health. The likelihood of observing a statistically significant difference in PROM scores between the study groups was examined by calculating the score difference required to achieve a p-value < 0.05.
ResultsA total of 255 RCTs were included. PROM scores improved from baseline to follow-up in most studies (98%), with a mean change of +28 points. The correlation coefficient between the mean baseline score and mean score change was -0.66 (95% CI -0.72 - -0.59) indicating that higher baseline scores were associated with lower score change. Additionally, there was a moderate correlation between the mean and SD of PROM scores at follow-up (-0.39; 95% CI -0.48 - -0.28). The mean likelihood of detecting a difference was 65% (SD 11%) at baseline and 65% (SD 11%) at follow-up. The likelihood reached the 80% benchmark in only 8.5% and 8.1% of the studies at baseline and follow-up, respectively.
ConclusionThe concentration of PROM score distributions towards the high end of the scale, especially when higher baseline scores are present, diminishes the likelihood of detecting significant differences between study groups, particularly at follow-up assessments in studies analyzing musculoskeletal complaints. This underscores the importance of critically evaluating the conclusions drawn from these studies.
Downloadable publication This is an electronic reprint of the original article. |
Funding information in the publication:
This research did not receive any specific grant from funding agency in the public, commercial, or not-for-profit sectors.