Delirium Identification from Nursing Reports Using Large Language Models - UTU Research Portal

A4 Refereed article in a conference publication

Delirium Identification from Nursing Reports Using Large Language Models

Authors: Graf, Lisa; Ritzi, Alexander; Schöler, Lili M.

Editors: Andrikopoulou, Elisavet; Gallos, Parisis; Arvanitis, Theodoros N.; Austin, Rosalynn; Benis, Arriel; Cornet, Ronald; Chatzistergos, Panagiotis; Dejaco, Alexander; Dusseljee-Peute, Linda; Mohasseb, Alaa; Natsiavas, Pantelis; Nakkas, Haythem; Scott, Philip

Conference name: Medical Informatics Europe Conference

Publisher: IOS Press

Publication year: 2025

Journal: Studies in Health Technology and Informatics

Book title : Intelligent Health Systems – From Technology to Data and Knowledge: Proceedings of MIE 2025

Volume: 327

First page : 886

Last page: 887

eISBN: 978-1-64368-596-0

ISSN: 0926-9630

eISSN: 1879-8365

DOI: https://doi.org/10.3233/SHTI250492

Web address : https://doi.org/10.3233/shti250492

Self-archived copy’s web address: https://research.utu.fi/converis/portal/detail/Publication/499069125

Abstract

This study investigates large language models for delirium detection from nursing reports, comparing keyword matching, prompting, and finetuning. Using a manually labelled dataset from the University Hospital Freiburg, Germany, we tested Llama3 and Phi3 models. Both prompting and finetuning were effective, with finetuning Phi3 (3.8B) achieving the highest accuracy (90.24%) and AUROC (96.07%), significantly outperforming other methods.

Downloadable publication

This is an electronic reprint of the original article.
This reprint may differ from the original in pagination and typographic detail. Please cite the original version.

SHTI-327-SHTI250492(1).pdf