Ville Komulainen
ville.m.komulainen@utu.fi ORCID identifier: https://orcid.org/0009-0002-1283-7353 |
Publications
- An Expanded Massive Multilingual Dataset for High-Performance Language Technologies (HPLT) (2025)
- Annual Meeting of the Association for Computational Linguistics
(A4 Refereed article in a conference publication ) - Got Compute, but No Data: Lessons From Post-training a Finnish LLM (2025)
- NEALT proceedings series
(A4 Refereed article in a conference publication ) - Poro 34B and the Blessing of Multilinguality (2025)
- NEALT proceedings series
(A4 Refereed article in a conference publication ) - FinGPT: Large Generative Models for a Small Language (2023) Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing Luukkonen Risto, Komulainen Ville, Luoma Jouni, Eskelinen Anni, Kanerva Jenna, Kupari Hanna-Mari, Ginter Filip, Laippala Veronika, Muennighoff Niklas, Piktus Aleksandra, Wang Thomas, Tazi Nouamane, Scao Le Teven, Wolf Thomas, Suominen Osma, Sairanen Samuli, Merioksa Mikko, Heinonen Jyrki, Vahtola Aija, Antao Samuel, Pyysalo Sampo
(A4 Refereed article in a conference publication )