Liina Repo
Filosofian maisteri – Master of Arts
liina.t.repo@utu.fi Arcanuminkuja 1 Turku ORCID identifier: https://orcid.org/0000-0003-1868-3674 |
corpus linguistics; computational linguistics; Late Modern English; register studies
Digital Language Studies
I am a doctoral researcher in Digital Language Studies. My interests revolve around using computational methods with historical language.
In my research, I am interested in using automatic text classification methods to model noisy historical data. More specifically, I focus on predicting and modelling registers (text varieties) from large Late Modern English datasets with machine learning methods.
Other research projects I've participated in:
- Project researcher, Prosovar project (Digilang)
- Project researcher, Universal Parsebanks project (Digilang)
- Project assistant, Structuring Language Use Across Multilingual Web Corpora
- Filosofian tohtoreiden urapolkupuheenvuorot: moniosaaminen voimavarana (2024)
- Hiiskuttua: Turun yliopiston humanistisen tiedekunnan verkkolehti
(D1 Article in a professional journal) - From Discrete to Continuous Classes: A Situational Analysis of Multilingual Web Registers with LLM Annotations (2024) Proceedings of the 4th International Conference on Natural Language Processing for Digital Humanities Henriksson, Erik; Myntti, Amanda; Hellström, Saara; Erten-Johansson, Selcen; Eskelinen, Anni; Repo, Liina; Laippala, Veronika
(A4 Refereed article in a conference publication ) - Intersecting Register and Genre: Understanding the Contents of Web-Crawled Corpora (2024) Proceedings of the 4th International Conference on Natural Language Processing for Digital Humanities Myntti, Amanda; Repo, Liina; Freyermuth, Elian; Kanner, Antti; Laippala, Veronika; Henriksson, Erik
(A4 Refereed article in a conference publication ) - Towards Automatic Register Classification in Unrestricted Databases of Historical English (2024) Linguistics across Disciplinary Borders : the March of Data Repo Liina, Hashimoto Brett, Liimatta Aatu, Saario Lassi, Säily Tanja, Tiihonen Iiro, Tolonen Mikko, Laippala Veronika
(A3 Refereed book chapter or chapter in a compilation book) - In search of founding era registers: automatic modeling of registers from the corpus of Founding Era American English (2023)
- Digital Scholarship in the Humanities
(A1 Refereed original research article in a scientific journal) - Explainable Publication Year Prediction of Eighteenth Century Texts with the BERT Model (2022) Proceedings of the 3rd Workshop on Computational Approaches to Historical Language Change Rastas Iiro, Ryan Yann, Tiihonen Iiro, Qaraei Mohammedreza, Repo Liina, Babbar Rohit, Mäkelä Eetu, Tolonen Mikko, Ginter Filip
(A4 Refereed article in a conference publication ) - Beyond the English web: Zero-shot cross-lingual and lightweight monolingual classification of registers (2021) Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Student Research Workshop Repo Liina, Skantsi Valtteri, Rönnqvist Samuel, Hellström Saara, Oinonen Miika, Salmela Anna, Biber Douglas, Egbert Jesse, Pyysalo Sampo, Laippala Veronika
(A4 Refereed article in a conference publication ) - From Web Crawl to Clean Register-Annotated Corpora (2020) Proceedings of the 12th Web as Corpus Workshop Laippala Veronika, Rönnqvist Samuel, Hellström Saara, Luotolahti, Juhani, Repo Liina, Salmela Anna, Skantsi Valtteri and Pyysalo Sampo
(A4 Refereed article in a conference publication )