Otto Tarkka
MA
ohitar@utu.fi : 451A |
natural language processing; linguistics; digital linguistics; corpus-assisted discourse analysis
turkunlp.org
I started studying English at the University of Turku in 2016 and got my Bachelor's degree three years later. My BA thesis was a corpus linguistic study on learner English. After my BA, I almost accidentally enrolled on a course called 'Automatic Text Processing' and was immediately hooked. I decided to do my MA in Digital Language Studies and wrote my MA thesis on topic modelling. During my studies I worked with the fine people at the TurkuNLP research group and have been working on my PhD with them since 2023.
I am a PhD student currently doing research as part of the GreenNLP project at TurkuNLP. I am interested in machine learning, Large Language Models and applying these emerging technologies in corpus linguistic research.
- Automated Emotion Annotation of Finnish Parliamentary Speeches Using GPT-4 (2024)
- LREC Proceedings
- Towards diverse and contextually anchored paraphrase modeling: A dataset and baselines for Finnish (2023)
- Natural Language Engineering
- Mistä koronapandemian aikana keskustellaan sosiaalisessa mediassa? (2022) Saarni Jenna, Tarkka Otto
- Textual Paraphrase Dataset for Deep Language Modelling (2022) European Language Grid: A Language Technology Platform for Multilingual Europe Kanerva Jenna, Ginter Filip, Chang Li-Hsin, Skantsi Valtteri, Kilpeläinen Jemina, Kupari Hanna-Mari, Piirto Aurora, Saarni Jenna, Sevón Maija, Tarkka Otto
- Finnish Paraphrase Corpus (2021)
- Linköping Electronic Conference Proceedings