A4 Refereed article in a conference publication

Delirium Identification from Nursing Reports Using Large Language Models




AuthorsGraf, Lisa; Ritzi, Alexander; Schöler, Lili M.

EditorsAndrikopoulou, Elisavet; Gallos, Parisis; Arvanitis, Theodoros N.; Austin, Rosalynn; Benis, Arriel; Cornet, Ronald; Chatzistergos, Panagiotis; Dejaco, Alexander; Dusseljee-Peute, Linda; Mohasseb, Alaa; Natsiavas, Pantelis; Nakkas, Haythem; Scott, Philip

Conference nameMedical Informatics Europe Conference

PublisherIOS Press

Publication year2025

JournalStudies in Health Technology and Informatics

Book title Intelligent Health Systems – From Technology to Data and Knowledge: Proceedings of MIE 2025

Volume327

First page 886

Last page887

eISBN978-1-64368-596-0

ISSN0926-9630

eISSN1879-8365

DOIhttps://doi.org/10.3233/SHTI250492

Web address https://doi.org/10.3233/shti250492

Self-archived copy’s web addresshttps://research.utu.fi/converis/portal/detail/Publication/499069125


Abstract

This study investigates large language models for delirium detection from nursing reports, comparing keyword matching, prompting, and finetuning. Using a manually labelled dataset from the University Hospital Freiburg, Germany, we tested Llama3 and Phi3 models. Both prompting and finetuning were effective, with finetuning Phi3 (3.8B) achieving the highest accuracy (90.24%) and AUROC (96.07%), significantly outperforming other methods.


Downloadable publication

This is an electronic reprint of the original article.
This reprint may differ from the original in pagination and typographic detail. Please cite the original version.





Last updated on 2025-07-08 at 08:16