Veronika Laippala
mavela@utu.fi +358 29 450 3330 +358 50 328 9739 Arcanuminkuja 1 Turku |
Computational linguistics; text linguistics; corpus linguistics; digital discourse analysis.
I am a linguist who likes computers. My main research topics include language variation across different communicative situations and the development of automatic tools so that we could better benefit from large, web-crawled corpora.
My ongoing projects include "A piece of news, an opinion or something else? Different texts and their detection from the multilingual Internet" funded by Emil Aaltonen foundation and "Massively multilingual modeling of registers in web-scale data" funded by Academy of Finland.
For more information, please have a look at our lab website at https://turkunlp.github.io/
- Etäyhteyksistä paluu normaaliin arkeen: yliopistovierailu kampuksella (2022)
- Leala-tutkimuskeskuksen blogi
- Explaining Classes through Stable Word Attributions (2022)
- Annual Meeting of the Association for Computational Linguistics
- Register identification from the unrestricted open Web using the Corpus of Online Registers of English (2022)
- Language Resources and Evaluation
- Selkosten Proust taipuu moneen - Iijoki-korpus ja digitaalisen tekstilouhinnan mahdollisuudet (2022) Kalle Päätalo tutkijoiden silmin Karkulehto Sanna, Laippala Veronika, Launis Kati, Märsynaho Jaana, Saviniemi Maija, Sääskilahti Minna
- Towards better structured and less noisy Web data: Oscar with Register annotations (2022)
- International Conference on Computational Linguistics
- Beyond the English web: Zero-shot cross-lingual and lightweight monolingual classification of registers (2021) Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Student Research Workshop Repo Liina, Skantsi Valtteri, Rönnqvist Samuel, Hellström Saara, Oinonen Miika, Salmela Anna, Biber Douglas, Egbert Jesse, Pyysalo Sampo, Laippala Veronika
- Exploring the role of lexis and grammar for the stable identification of register in an unrestricted corpus of web documents (2021)
- Language Resources and Evaluation
- Multilingual and Zero-Shot is Closing in on Monolingual Web Register Classification (2021)
- Linköping Electronic Conference Proceedings
- A broad-coverage corpus for finnish named entity recognition (2020) 12th International Conference on Language Resources and Evaluation Jouni Luoma, Miika Oinonen, Maria Pyykönen, Veronika Laippala, Sampo Pyysalo
- Affectivity in the #jesuisCharlie Twitter discussion (2020)
- Pragmatics
- Commenting on poverty online: A corpus-assisted discourse study of the Suomi24 forum (2020)
- SKY Journal of Linguistics
- From Web Crawl to Clean Register-Annotated Corpora (2020) Proceedings of the 12th Web as Corpus Workshop Laippala Veronika, Rönnqvist Samuel, Hellström Saara, Luotolahti, Juhani, Repo Liina, Salmela Anna, Skantsi Valtteri and Pyysalo Sampo
- Korpusaineistot (2020) Kielentutkimuksen menetelmiä I-IV Veronika Laippala, Minna Palander-Collin
- Määrällinen korpuslingvistiikka (2020) Kielentutkimuksen menetelmiä I-IV Veronika Laippala, Aki-Juhani Kyröläinen
- Digilang – Turun yliopiston digitaalisia kieliaineistoja kehittämässä (2019)
- Studia Humaniora Ouluensia
Veronika Laippala, Christophe Leblay, Jorma Luutonen, Maarit Mutta, Markku
Nikulin, Elisa Reunanen - From bits and numbers to explanations – doing research on Internet-based big data (2019)
- Studia Humaniora Ouluensia
- Toward Multilingual Identification of Online Registers (2019)
- Linköping Electronic Conference Proceedings
- A bottom-up analysis of sentence-initial DRDs in the Finnish Internet (2018) Veronika Laippala, Aki-Juhani Kyröläinen, Filip Ginter, Jenna Kanerva, Johanna Komppa, Jyrki
Kalliokoski - Dependency profiles in the large-scale analysis of discourse connectives (2018)
- Corpus Linguistics and Linguistic Theory
- Investigating the cross-lingual translatability of VerbNet-style classification (2018)
- Language Resources and Evaluation