Otto Tarkka
MA
ohitar@utu.fi Office: 451A ORCID identifier: https://orcid.org/0000-0001-8200-0319 |
natural language processing; linguistics; digital linguistics; corpus-assisted discourse analysis
turkunlp.org
I started studying English at the University of Turku in 2016 and got my Bachelor's degree three years later. My BA thesis was a corpus linguistic study on learner English. After my BA, I almost accidentally enrolled on a course called 'Automatic Text Processing' and was immediately hooked. I decided to do my MA in Digital Language Studies and wrote my MA thesis on topic modelling. During my studies I worked with the fine people at the TurkuNLP research group and have been working on my PhD with them since 2023.
I am a PhD student currently doing research as part of the GreenNLP project at TurkuNLP. I am interested in machine learning, Large Language Models and applying these emerging technologies in corpus linguistic research.
- Automated Emotion Annotation of Finnish Parliamentary Speeches Using GPT-4 (2024)
- LREC Proceedings
(A4 Refereed article in a conference publication ) - Towards diverse and contextually anchored paraphrase modeling: A dataset and baselines for Finnish (2023)
- Natural Language Engineering
(A1 Refereed original research article in a scientific journal) - Mistä koronapandemian aikana keskustellaan sosiaalisessa mediassa? (2022) Saarni Jenna, Tarkka Otto
(E1 Popularised article) - Textual Paraphrase Dataset for Deep Language Modelling (2022) European Language Grid: A Language Technology Platform for Multilingual Europe Kanerva Jenna, Ginter Filip, Chang Li-Hsin, Skantsi Valtteri, Kilpeläinen Jemina, Kupari Hanna-Mari, Piirto Aurora, Saarni Jenna, Sevón Maija, Tarkka Otto
(A3 Refereed book chapter or chapter in a compilation book) - Finnish Paraphrase Corpus (2021)
- Linköping Electronic Conference Proceedings
(A4 Refereed article in a conference publication )