A1 Refereed original research article in a scientific journal

Combining supervised and unsupervised named entity recognition to detect psychosocial risk factors in occupational health checks




AuthorsUronen Leena, Salanterä Sanna, Hakala Kai, Hartiala Jaakko, Moen Hans

PublisherElsevier

Publication year2022

JournalInternational Journal of Medical Informatics

Article number104695

Volume160

eISSN1872-8243

DOIhttps://doi.org/10.1016/j.ijmedinf.2022.104695

Web address https://doi.org/10.1016/j.ijmedinf.2022.104695

Self-archived copy’s web addresshttps://research.utu.fi/converis/portal/detail/Publication/175069684


Abstract

Introduction: In occupational health checks the information about psychosocial risk factors, which influence work ability, is documented in free text. Early detection of psychosocial risk factors helps occupational health care to choose the right and targeted interventions to maintain work capacity. In this study the aim was to evaluate if we can automate the recognition of these psychosocial risk factors in occupational health check electronic records with natural language processing (NLP).

Materials and methods: We compared supervised and unsupervised named entity recognition (NER) to detect psychosocial risk factors from health checks’ documentation. Occupational health nurses have done these records.

Results: Both methods found over 60% of psychosocial risk factors from the records. However, the combination of BERT-NER (supervised NER) and QExp (query expansion/paraphrasing) seems to be more suitable. In both methods the most (correct) risk factors were found in the work environment and equipment category.

Conclusion: This study showed that it was possible to detect risk factors automatically from free-text documentation of health checks. It is possible to develop a text mining tool to automate the detection of psychosocial risk factors at an early stage


Downloadable publication

This is an electronic reprint of the original article.
This reprint may differ from the original in pagination and typographic detail. Please cite the original version.





Last updated on 2024-26-11 at 12:35