A1 Refereed original research article in a scientific journal

The Interplay Between PROM Score Distributions and Treatment Effect Detection Likelihood in Randomized Controlled Trials – A Meta-epidemiologic study




AuthorsPanula, Valtteri; Saarinen, Antti; Vaajala, Matias; Liukkonen, Rasmus; Pakarinen, Oskari; Laaksonen, Juho; Ponkilainen, Ville; Kuitunen, Ilari; Uimonen, Mikko

PublisherElsevier

Publication year2026

Journal: Journal of Clinical Epidemiology

Article number112114

Volume191

ISSN0895-4356

eISSN1878-5921

DOIhttps://doi.org/10.1016/j.jclinepi.2025.112114

Publication's open availability at the time of reportingOpen Access

Publication channel's open availability Partially Open Access publication channel

Web address https://doi.org/10.1016/j.jclinepi.2025.112114

Self-archived copy’s web addresshttps://research.utu.fi/converis/portal/detail/Publication/506255969

Self-archived copy's licenceCC BY

Self-archived copy's versionPublisher`s PDF


Abstract
Objectives

We hypothesized that, in musculoskeletal RCTs using PROMs, higher baseline scores and the clustering of follow-up scores near the upper bound (i.e., ceiling effect) compress variability and attenuate measurable between-group differences, thereby lowering the likelihood of observing a statistically significant effect. We therefore examined how score distributions at pre-treatment and follow-up influence the likelihood of detecting between-group differences.

Study Design and Setting

We conducted a meta-epidemiologic study of RCTs, published between 2015 and 2024, that compared treatment effects on musculoskeletal disorders between two study groups using PROMs. The observed distributions of the PROM scores at baseline and follow-up were collected from the included studies. All PROM scores were rescaled to 0-100 with higher scores indicating better health. The likelihood of observing a statistically significant difference in PROM scores between the study groups was examined by calculating the score difference required to achieve a p-value < 0.05.

Results

A total of 255 RCTs were included. PROM scores improved from baseline to follow-up in most studies (98%), with a mean change of +28 points. The correlation coefficient between the mean baseline score and mean score change was -0.66 (95% CI -0.72 - -0.59) indicating that higher baseline scores were associated with lower score change. Additionally, there was a moderate correlation between the mean and SD of PROM scores at follow-up (-0.39; 95% CI -0.48 - -0.28). The mean likelihood of detecting a difference was 65% (SD 11%) at baseline and 65% (SD 11%) at follow-up. The likelihood reached the 80% benchmark in only 8.5% and 8.1% of the studies at baseline and follow-up, respectively.

Conclusion

The concentration of PROM score distributions towards the high end of the scale, especially when higher baseline scores are present, diminishes the likelihood of detecting significant differences between study groups, particularly at follow-up assessments in studies analyzing musculoskeletal complaints. This underscores the importance of critically evaluating the conclusions drawn from these studies.


Downloadable publication

This is an electronic reprint of the original article.
This reprint may differ from the original in pagination and typographic detail. Please cite the original version.




Funding information in the publication
This research did not receive any specific grant from funding agency in the public, commercial, or not-for-profit sectors.


Last updated on 14/01/2026 03:43:43 PM