Refereed journal article or data article (A1)
Cost-effective survival prediction for patients with advanced prostate cancer using clinical trial and real-world hospital registry datasets
List of Authors: Mika Murtojärvi, Anni S. Halkola, Antti Airola, Teemu D. Laajala, Tuomas Mirtti, Tero Aittokallio, Tapio Pahikkala
Publisher: Elsevier Ireland Ltd
Publication year: 2020
Journal: International Journal of Medical Informatics
Journal acronym: INT J MED INFORM
Volume number: 133
Number of pages: 8
ISSN: 1386-5056
eISSN: 1872-8243
DOI: http://dx.doi.org/10.1016/j.ijmedinf.2019.104014
URL: http://www.sciencedirect.com/science/article/pii/S1386505618311857
Self-archived copy’s web address: https://research.utu.fi/converis/portal/detail/Publication/43869141
Introduction
Predictive survival modeling offers systematic tools for clinical decision-making and individualized tailoring of treatment strategies to improve patient outcomes while reducing overall healthcare costs. In 2015, a number of machine learning and statistical models were benchmarked in the DREAM 9.5 Prostate Cancer Challenge, based on open clinical trial data for metastatic castration resistant prostate cancer (mCRPC). However, applying these models into clinical practice poses a practical challenge due to the inclusion of a large number of model variables, some of which are not routinely monitored or are expensive to measure.
ObjectivesTo develop cost-specified variable selection algorithms for constructing cost-effective prognostic models of overall survival that still preserve sufficient model performance for clinical decision making.
MethodsPenalized Cox regression models were used for the survival prediction. For the variable selection, we implemented two algorithms: (i) LASSO regularization approach; and (ii) a greedy cost-specified variable selection algorithm. The models were compared in three cohorts of mCRPC patients from randomized clinical trials (RCT), as well as in a real-world cohort (RWC) of advanced prostate cancer patients treated at the Turku University Hospital. Hospital laboratory expenses were utilized as a reference for computing the costs of introducing new variables into the models.
ResultsCompared to measuring the full set of clinical variables, economic costs could be reduced by half without a significant loss of model performance. The greedy algorithm outperformed the LASSO-based variable selection with the lowest tested budgets. The overall top performance was higher with the LASSO algorithm.
ConclusionThe cost-specified variable selection offers significant budget optimization capability for the real-world survival prediction without compromising the predictive power of the model.
Downloadable publication This is an electronic reprint of the original article. |