A4 Refereed article in a conference publication
Improving layman readability of clinical narratives with unsupervised synonym replacement
Authors: Moen H., Peltonen L., Koivumäki M., Suhonen H., Salakoski T., Ginter F., Salanterä S.
Editors: Adrien Ugon, Daniel Karlsson, Gunnar O. Klein, Anne Moen
Conference name: Conference on Medical Informatics Europe
Publisher: IOS Press
Publication year: 2018
Journal: Medical informatics Europe
Book title : Building Continents of Knowledge in Oceans of Data: The Future of Co-Created eHealth
Journal name in source: Studies in Health Technology and Informatics
Series title: Studies in Health Technology and Informatics
Volume: 247
First page : 725
Last page: 729
Number of pages: 5
ISBN: 978-1-61499-851-8
eISBN: 978-1-61499-852-5
ISSN: 0926-9630
DOI: https://doi.org/10.3233/978-1-61499-852-5-725
Web address : http://ebooks.iospress.nl/publication/48887
Self-archived copy’s web address: https://research.utu.fi/converis/portal/detail/Publication/32055606
We report on the development and evaluation of a prototype tool aimed to assist laymen/patients in understanding the content of clinical narratives. The tool relies largely on unsupervised machine learning applied to two large corpora of unlabeled text – a clinical corpus and a general domain corpus. A joint semantic word-space model is created for the purpose of extracting easier to understand alternatives for words considered difficult to understand by laymen. Two domain experts evaluate the tool and inter-rater agreement is calculated. When having the tool suggest ten alternatives to each difficult word, it suggests acceptable lay words for 55.51% of them. This and future manual evaluation will serve to further improve performance, where also supervised machine learning will be used.
Downloadable publication This is an electronic reprint of the original article. |