Generative AI - Signal to Noise - UTU Tutkimustietojärjestelmä

A1 Vertaisarvioitu alkuperäisartikkeli tieteellisessä lehdessä

Generative AI - Signal to Noise

Tekijät: Kane, Adam; Correia, Ricardo; Healy, Kevin; Jackson, Andrew

Kustantaja: Springer Science and Business Media LLC

Julkaisuvuosi: 2025

Lehti: Digital society

Artikkelin numero: 51

Vuosikerta: 4

ISSN: 2731-4650

eISSN: 2731-4669

DOI: https://doi.org/10.1007/s44206-025-00209-3

Julkaisun avoimuus kirjaamishetkellä: Avoimesti saatavilla

Julkaisukanavan avoimuus : Osittain avoin julkaisukanava

Verkko-osoite: https://doi.org/10.1007/s44206-025-00209-3

Rinnakkaistallenteen osoite: https://research.utu.fi/converis/portal/detail/Publication/500103776

Tiivistelmä

The sudden deployment of large language models (LLMs) has been a seismic event for science, with professional scientists, including biologists, struggling to work out how to fit this new technology into their working lives. The benefits of LLMs are manifold but here we flag a neglected and very serious negative aspect of their use in the area of culturomics. This field depends on analysing word frequencies to pick out the prevailing zeitgeist in corpora of text that are readily available online through social media and analysable through modern software. This provides insights into human culture on a scale that was impossible 20 years ago. Culturomics has influenced many topics where understanding the human perspective is key. However, LLMs are \u2018polluting the waters\u2019 by producing AI generated text that is, by definition, not what people are talking about. We believe there\u2019s a strong case to be made for highlighting the nature of LLM pollution and give our view for how to clean the waters.

Ladattava julkaisu

This is an electronic reprint of the original article.
This reprint may differ from the original in pagination and typographic detail. Please cite the original version.

s44206-025-00209-3.pdf

Julkaisussa olevat rahoitustiedot:
Open Access funding provided by the IReL Consortium.