Jenna Kanerva
 

Asiantuntijuusalueet
kieliteknologia, luonnollisen kielen prosessointi, koneoppiminen, korpukset, annotointi

Biografia

I am a doctoral researcher at the Department of Computing, University of Turku. I’m working as a part of the TurkuNLP research group focusing on language technology and natural language processing (NLP) related topics. I got my Master of Science degree in 2014 at the University of Turku (major subject computer science).



Tutkimus

My PhD research focuses on the area of language technology, especially being interested in machine learning based methods for Finnish language processing. I also greatly enjoy and respect elementary corpus work after being part of the data collection and annotation effort of several language data resources built for Finnish language at the TurkuNLP group. After building the elementary resources, these datasets are used to develop several language processing tools based on the latest machine learning methods.



Opetus

Starting from the year 2014, I have acted as a responsible/co-responsible person for the Introduction to Language Technology course lectured at the University of Turku each year. In addition to this, I have been lecturing/co-lecturing several courses/lectures related to language technology at the University of Turku, as well as being invited to give lectures as part-time teacher at the Arcada University of Applied Sciences and the University of Tampere (Pori unit). In order to advance as a teacher, I have completed a 25 ECTS study module of university pedagogy within the years 2019-2021.



Julkaisut
  
null
  
null
  
1/3
  
null
  
null
  

  • A Deep Dive into Multi-Head Attention and Multi-Aspect EmbeddingCreating a Historical Migration Dataset from Finnish Church Records, 1800–1920  (2025)  Proceedings of the 15th International Conference on Recent Advances in Natural Language Processing - Natural Language Processing in the Generative AI era Teimouri, Maryam; Kanerva, Jenna; Ginter, Filip
    (
    D3 Artikkeli ammatillisessa konferenssijulkaisussa )


  •   (2025)  
    • Journal of Open Humanities Data
     Vesalainen, Ari; Kanerva, Jenna; Nitsch, Aïda; Korsu, Kiia; Larkiola, Ilari; Ruotsalainen, Laura; Ginter, Filip
    (
    A1 Vertaisarvioitu alkuperäisartikkeli tieteellisessä lehdessä )


  • TCBLex - A lexical database of Finnish literary texts for children  (2025)  
    • Behavior Research Methods
     Nojonen, Tapio; Korsu, Kiia; Ginter, Filip; Laippala, Veronika; Kanerva, Jenna
    (
    A1 Vertaisarvioitu alkuperäisartikkeli tieteellisessä lehdessä )


  • Extracting Social Connections from Finnish Karelian Refugee Interviews Using LLMs  (2024)  
    • CEUR Workshop Proceedings
    Proceedings of the Computational Humanities Research Conference 2024 (CHR 2024), Aarhus, Denmark, December 4-6, 202 Laato, Joonatan; Kanerva, Jenna; Loehr, John; Lummaa, Virpi; Ginter, Filip
    (
    A4 Vertaisarvioitu artikkeli konferenssijulkaisussa)


  • Improving Latin Dependency Parsing by Combining Treebanks and Predictions  (2024)  Proceedings of the 4th International Conference on Natural Language Processing for Digital Humanities Kupari, Hanna-Mari Kristiina; Henriksson, Erik; Laippala, Veronika; Kanerva, Jenna
    (
    A4 Vertaisarvioitu artikkeli konferenssijulkaisussa)


  • Semantic search as extractive paraphrase span detection  (2024)  
    • Language Resources and Evaluation
     Kanerva Jenna, Kitti Hanna, Chang Li-Hsin, Vahtola Teemu, Creutz Mathias, Ginter Filip
    (
    A1 Vertaisarvioitu alkuperäisartikkeli tieteellisessä lehdessä )


  • Understanding the structure and meaning of Finnish texts: From corpus creation to deep language modelling  (2024)   Kanerva, Jenna
    (
    G5 Artikkeliväitöskirja)


  • FinGPT: Large Generative Models for a Small Language  (2023)  Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing Luukkonen Risto, Komulainen Ville, Luoma Jouni, Eskelinen Anni, Kanerva Jenna, Kupari Hanna-Mari, Ginter Filip, Laippala Veronika, Muennighoff Niklas, Piktus Aleksandra, Wang Thomas, Tazi Nouamane, Scao Le Teven, Wolf Thomas, Suominen Osma, Sairanen Samuli, Merioksa Mikko, Heinonen Jyrki, Vahtola Aija, Antao Samuel, Pyysalo Sampo
    (
    A4 Vertaisarvioitu artikkeli konferenssijulkaisussa)


  • Towards diverse and contextually anchored paraphrase modeling: A dataset and baselines for Finnish  (2023)  
    • Natural Language Engineering
     Kanerva Jenna, Ginter Filip, Chang Li-Hsin, Rastas Iiro, Skantsi Valtteri, Kilpeläinen Jemina, Kupari Hanna-Mari, Piirto Aurora, Saarni Jenna, Sevón Maija, Tarkka Otto
    (
    A1 Vertaisarvioitu alkuperäisartikkeli tieteellisessä lehdessä )


  • Deep Learning and Film History: Model Explanation Techniques in the Analysis of Temporality in Finnish Fiction Film Metadata  (2022)  
    • CEUR Workshop Proceedings
    The 6th Digital Humanities in the Nordic and Baltic Countries Conference (DHNB 2022), Uppsala, Sweden, March 15-18, 2022 Ginter Filip, Kiiskinen Harri, Kanerva Jenna, Chang Li-Hsin, Salmi Hannu
    (
    A4 Vertaisarvioitu artikkeli konferenssijulkaisussa)


  • GEMv2: Multilingual NLG Benchmarking in a Single Line of Code  (2022)  Proceedings of the The 2022 Conference on Empirical Methods in Natural Language Processing: System Demonstrations Gehrmann Sebastian, Bhattacharjee Abhik, Mahendiran Abinaya, Wang Alex, Papangelis Alexandros, Madaan Aman, McMillan-Major Angelina, Shvets Anna, Upadhyay Ashish, Bohnet Bernd, Yao Bingsheng, Wilie Bryan, Bhagavatula Chandra, You Chaobin, Thomson Craig, Garbacea Cristina, Wang, Dakuo, Deutsch Daniel, Xiong Deyi, Jin Di, Gkatzia Dimitra, Radev Dragomir, Clark Elizabeth, Durmus Esin, Ladhak Faisal, Ginter Filip, Winata Genta Indra, Strobelt, Hendrik, Hayashi, Hiroaki, Novikova Jekaterina, Kanerva Jenna, Chim Jenny, Zhou Jiawei, Clive Jordan, Maynez Joshua, Sedoc João, Juraska Juraj, Dhole Kaustubh, Chandu Khyathi Raghavi, Perez-Beltrachini Laura, Ribeiro Leonardo F.R., Tunstall Lewis, Zhang Li, Pushkarna Mahima, Creutz Mathias, White Michael, Kale Mihir Sanjay, Eddine Moussa Kamal, Daheim Nico, Subramani, Nishant, Dusek Ondrej, Liang Paul Pu, Ammanamanchi Pawan Sasanka, Zhu Qi, Puduppully Ratish, Kriz Reno, Shahriyar Rifat, Cardenas Ronald, Mahamood Saad, Osei Salomey, Cahyawijaya Samuel, Štajner Sanja, Montella Sebastien, Jolly Shailza, Mille Simon, Hasan Tahmid, Shen Tianhao, Adewumi Tosin, Raunak Vikas, Raheja Vipul, Nikolaev Vitaly, Tsai Vivian, Jernite Yacine, Xu Ying, Sang Yisi, Liu Yixin, Hou Yufang
    (
    A4 Vertaisarvioitu artikkeli konferenssijulkaisussa)


  • Out-of-Domain Evaluation of Finnish Dependency Parsing  (2022)  
    • LREC Proceedings
    Proceedings of the 13th Conference on Language Resources and Evaluation (LREC 2022) Kanerva Jenna, Ginter Filip
    (
    A4 Vertaisarvioitu artikkeli konferenssijulkaisussa)


  • Paimen, piika ja emäntä. Arvot ja ammatit suomalaisessa näytelmäelokuvassa 1907–2017  (2022)  
    • Lähikuva
     Salmi Hannu, Kanerva Jenna, Kiiskinen Harri, Ginter Filip
    (
    A1 Vertaisarvioitu alkuperäisartikkeli tieteellisessä lehdessä )


  • Textual Paraphrase Dataset for Deep Language Modelling  (2022)  European Language Grid: A Language Technology Platform for Multilingual Europe Kanerva Jenna, Ginter Filip, Chang Li-Hsin, Skantsi Valtteri, Kilpeläinen Jemina, Kupari Hanna-Mari, Piirto Aurora, Saarni Jenna, Sevón Maija, Tarkka Otto
    (
    A3 Vertaisarvioitu kirjan tai muun kokoomateoksen osa)


  • Towards Automatic Short Answer Assessment for Finnish as a Paraphrase Retrieval Task  (2022)  Proceedings of the 17th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2022) Chang Li-Hsin, Kanerva Jenna, Ginter Filip
    (
    A4 Vertaisarvioitu artikkeli konferenssijulkaisussa)


  • Finnish Paraphrase Corpus  (2021)  
    • Linköping Electronic Conference Proceedings
    Proceedings of the 23rd Nordic Conference on Computational Linguistics (NoDaLiDa 2021) Kanerva Jenna, Ginter Filip, Chang Li-Hsin, Rastas Iiro, Skantsi Valtteri, Kilpeläinen Jemina, Kupari Hanna-Mari, Saarni Jenna, Sevón Maija, Tarkka Otto
    (
    A4 Vertaisarvioitu artikkeli konferenssijulkaisussa)


  • Quantitative Evaluation of Alternative Translations in a Corpus of Highly Dissimilar Finnish Paraphrases  (2021)  Proceedings for the First Workshop on Modelling Translation: Translatology in the Digital Age Chang Li-Hsin, Pyysalo Sampo, Kanerva Jenna, Ginter Filip
    (
    A4 Vertaisarvioitu artikkeli konferenssijulkaisussa)


  • Universal Lemmatizer: A sequence-to-sequence model for lemmatizing Universal Dependencies treebanks  (2021)  
    • Natural Language Engineering
     Kanerva Jenna, Ginter Filip, Salakoski Tapio
    (
    A1 Vertaisarvioitu alkuperäisartikkeli tieteellisessä lehdessä )


  • WikiBERT Models: Deep Transfer Learning for Many Languages  (2021)  
    • Linköping Electronic Conference Proceedings
    Proceedings of the 23rd Nordic Conference on Computational Linguistics (NoDaLiDa) Pyysalo Sampo, Kanerva Jenna, Virtanen Antti, Ginter Filip
    (
    A4 Vertaisarvioitu artikkeli konferenssijulkaisussa)


  • Dependency parsing of biomedical text with BERT  (2020)  
    • BMC Bioinformatics
     Kanerva Jenna, Ginter Filip, Pyysalo Sampo
    (
    A1 Vertaisarvioitu alkuperäisartikkeli tieteellisessä lehdessä )



Last updated on 2024-26-02 at 11:46