Otto Tarkka
 MA


ohitar@utu.fi



Office451A


ORCID identifierhttps://orcid.org/0000-0001-8200-0319

TurkuNLP




Areas of expertise
natural language processing; linguistics; digital linguistics; corpus-assisted discourse analysis

Research community or research topic
turkunlp.org

Biography

I started studying English at the University of Turku in 2016 and got my Bachelor's degree three years later. My BA thesis was a corpus linguistic study on learner English. After my BA, I almost accidentally enrolled on a course called 'Automatic Text Processing' and was immediately hooked. I decided to do my MA in Digital Language Studies and wrote my MA thesis on topic modelling. During my studies I worked with the fine people at the TurkuNLP research group and have been working on my PhD with them since 2023.



Research

I am a PhD student currently doing research as part of the GreenNLP project at TurkuNLP. I am interested in machine learning, Large Language Models and applying these emerging technologies in corpus linguistic research.



Publications

  • Automated Emotion Annotation of Finnish Parliamentary Speeches Using GPT-4  (2024)  
    • LREC Proceedings
    Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024) : ParlaCLARIN IV Workshop on Creating, Analysing, and Increasing Accessibility of Parliamentary Corpora Tarkka, Otto; Koljonen, Jaakko; Korhonen, Markus; Laine, Juuso; Martiskainen, Kristian; Elo, Kimmo; Laippala, Veronika
    (
    A4 Refereed article in a conference publication )


  • Towards diverse and contextually anchored paraphrase modeling: A dataset and baselines for Finnish  (2023)  
    • Natural Language Engineering
     Kanerva Jenna, Ginter Filip, Chang Li-Hsin, Rastas Iiro, Skantsi Valtteri, Kilpeläinen Jemina, Kupari Hanna-Mari, Piirto Aurora, Saarni Jenna, Sevón Maija, Tarkka Otto
    (
    A1 Refereed original research article in a scientific journal)


  • Mistä koronapandemian aikana keskustellaan sosiaalisessa mediassa?  (2022)   Saarni Jenna, Tarkka Otto
    (
    E1 Popularised article)


  • Textual Paraphrase Dataset for Deep Language Modelling  (2022)  European Language Grid: A Language Technology Platform for Multilingual Europe Kanerva Jenna, Ginter Filip, Chang Li-Hsin, Skantsi Valtteri, Kilpeläinen Jemina, Kupari Hanna-Mari, Piirto Aurora, Saarni Jenna, Sevón Maija, Tarkka Otto
    (
    A3 Refereed book chapter or chapter in a compilation book)


  • Finnish Paraphrase Corpus  (2021)  
    • Linköping Electronic Conference Proceedings
    Proceedings of the 23rd Nordic Conference on Computational Linguistics (NoDaLiDa 2021) Kanerva Jenna, Ginter Filip, Chang Li-Hsin, Rastas Iiro, Skantsi Valtteri, Kilpeläinen Jemina, Kupari Hanna-Mari, Saarni Jenna, Sevón Maija, Tarkka Otto
    (
    A4 Refereed article in a conference publication )



Last updated on 2024-09-11 at 14:44