A4 Refereed article in a conference publication

Improving layman readability of clinical narratives with unsupervised synonym replacement




AuthorsMoen H., Peltonen L., Koivumäki M., Suhonen H., Salakoski T., Ginter F., Salanterä S.

EditorsAdrien Ugon, Daniel Karlsson, Gunnar O. Klein, Anne Moen

Conference nameConference on Medical Informatics Europe

PublisherIOS Press

Publication year2018

JournalMedical informatics Europe

Book title Building Continents of Knowledge in Oceans of Data: The Future of Co-Created eHealth

Journal name in sourceStudies in Health Technology and Informatics

Series titleStudies in Health Technology and Informatics

Volume247

First page 725

Last page729

Number of pages5

ISBN978-1-61499-851-8

eISBN978-1-61499-852-5

ISSN0926-9630

DOIhttps://doi.org/10.3233/978-1-61499-852-5-725

Web address http://ebooks.iospress.nl/publication/48887

Self-archived copy’s web addresshttps://research.utu.fi/converis/portal/detail/Publication/32055606


Abstract

We report on the development and evaluation of a prototype tool aimed to assist laymen/patients in understanding the content of clinical narratives. The tool relies largely on unsupervised machine learning applied to two large corpora of unlabeled text – a clinical corpus and a general domain corpus. A joint semantic word-space model is created for the purpose of extracting easier to understand alternatives for words considered difficult to understand by laymen. Two domain experts evaluate the tool and inter-rater agreement is calculated. When having the tool suggest ten alternatives to each difficult word, it suggests acceptable lay words for 55.51% of them. This and future manual evaluation will serve to further improve performance, where also supervised machine learning will be used.


Downloadable publication

This is an electronic reprint of the original article.
This reprint may differ from the original in pagination and typographic detail. Please cite the original version.





Last updated on 2024-26-11 at 20:35