Filip Ginter
 


figint@utu.fi



Työhuone4th floor, 451A


ORCID-tunnistehttps://orcid.org/0000-0002-5484-6103

Google Scholar

LinkedIn

GitHub


Asiantuntijuusalueet
natural language processing; human language technology; machine learning; deep learning; resource development

Tutkimusyhteisö tai tutkimusaihe
human language technology, natural language processing, machine learning applied to human language, both methodological and resource creation research

Biografia

I am a researcher at the Department of Computing, University of Turku. My research is in the area of natural language processing. I belong to the TurkuNLP (turkunlp.org) research group.

I was born in 1978 in Ostrava, Czech Republic (Czechoslovakia back then). In 2001, I got a M.Sc. (tech) in computer science at the computer science department of VSB - Technical University Ostrava. My major subject was artificial intelligence. I gained a PhD in computer science in 2007. The title of my thesis is Towards Information Extraction in the Biomedical Domain: Methods and Resources.

As of 2022, I am a professor of language technology and as of 2021 the deputy director of the Department of Computing.



Tutkimus

My primary field of research is language technology / natural language processing. In my post-PhD career, I have focused on the development of NLP tools and resources primarily for Finnish, but later also numerous other languages via the Universal Dependencies project. My work is heavy on resource development, both in terms of data and machine learning pipelines. Open science and resources play an important role in my research, much of which is carried out in the open on GitHub and as a rule, all resources are openly available for unrestricted use. I work collaboratively, especially with my younger colleagues, rather than striving for deeper, primary author inquiries.



Opetus

I have been actively teaching since early on during my PhD studies. I independently prepared my first advanced level NLP course in 2004, and since ca. 2008 I have been teaching at least one course every year, substantially more during my bioinformatics lecturer appointment. While a lecturer in the bioinformatics MSc degree programme, I was lecturing international students in two cities. In 2016, I was tasked with developing and coordinating the introduction of a new 20 ECTS study module on natural language processing. This module is, with modifications, still in use and shared between the departments of Languages and Computing, both in terms of teaching and in terms of students. In 2019-2020 and 2020-2021 I was also co-lecturing, upon invitation, two courses in natural language processing in the Arcada University of Applied Sciences in Helsinki.



Julkaisut
  
null
  
null
  
1/8
  
null
  
null
  

  • Application of the Question Answering method to extract information from materials science literature   (2024)   Sipilä, Matilda; Mehryary, Farrokh; Pyysalo, Sampo; Ginter, Filip; Todorović Milica
    (
    Abstrakti)


  • Automatic Short Answer Grading for Finnish with ChatGPT  (2024)  
    • Proceedings of the AAAI Conference on Artificial Intelligence
    Proceedings of the 38th AAAI Conference on Artificial Intelligence Chang Li-Hsin, Ginter Filip
    (
    A4 Vertaisarvioitu artikkeli konferenssijulkaisussa)


  • Breakpoints in Iterative Development and Interdisciplinary Collaboration of AI-Driven Automated Assessment  (2024)  
    • International Conference on Information Technology Based Higher Education and Training
    2024 21st International Conference on Information Technology Based Higher Education and Training (ITHET) Huang, Xiaoshan; Chang, Li-Hsin; Veermans, Koen; Ginter, Filip
    (
    A4 Vertaisarvioitu artikkeli konferenssijulkaisussa)


  • Extracting Social Connections from Finnish Karelian Refugee Interviews Using LLMs  (2024)  
    • CEUR Workshop Proceedings
    Proceedings of the Computational Humanities Research Conference 2024 (CHR 2024), Aarhus, Denmark, December 4-6, 202 Laato, Joonatan; Kanerva, Jenna; Loehr, John; Lummaa, Virpi; Ginter, Filip
    (
    A4 Vertaisarvioitu artikkeli konferenssijulkaisussa)


  • Question Answering models for information extraction from perovskite materials science literature   (2024)  2024 MRS Fall Meeting and Exhibit Sipilä, Matilda; Mehryary, Farrokh; Pyysalo, Sampo; Ginter, Filip, Todorović, Milica
    (
    Abstrakti)


  • Semantic search as extractive paraphrase span detection  (2024)  
    • Language Resources and Evaluation
     Kanerva Jenna, Kitti Hanna, Chang Li-Hsin, Vahtola Teemu, Creutz Mathias, Ginter Filip
    (
    A1 Vertaisarvioitu alkuperäisartikkeli tieteellisessä lehdessä )


  • FinGPT: Large Generative Models for a Small Language  (2023)  Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing Luukkonen Risto, Komulainen Ville, Luoma Jouni, Eskelinen Anni, Kanerva Jenna, Kupari Hanna-Mari, Ginter Filip, Laippala Veronika, Muennighoff Niklas, Piktus Aleksandra, Wang Thomas, Tazi Nouamane, Scao Le Teven, Wolf Thomas, Suominen Osma, Sairanen Samuli, Merioksa Mikko, Heinonen Jyrki, Vahtola Aija, Antao Samuel, Pyysalo Sampo
    (
    A4 Vertaisarvioitu artikkeli konferenssijulkaisussa)


  • Identifying gender bias in blockbuster movies through the lens of machine learning  (2023)  
    • Humanities & social sciences communications
     Haris Muhammad Junaid, Upreti Aanchal, Kurtaran Melih, Ginter Filip, Lafond Sebastien, Azimi Sepinoud
    (
    A1 Vertaisarvioitu alkuperäisartikkeli tieteellisessä lehdessä )


  • Infectious Media: Cholera and the Circulation of Texts in the Finnish Press, 1860–1920  (2023)  
    • Media History
     Paasikivi Sofia, Salmi Hannu, Vesanto Aleksi, Ginter Filip
    (
    A1 Vertaisarvioitu alkuperäisartikkeli tieteellisessä lehdessä )


  • Multi-CrossRE A Multi-Lingual Multi-Domain Dataset for Relation Extraction  (2023)  
    • NEALT proceedings series
    Proceedings of The 24th Nordic Conference on Computational Linguistics (NoDaLiDa) Bassignana Elisa, Ginter Filip, Pyysalo Sampo, Rob van der Goot, Plank Barbara
    (
    A4 Vertaisarvioitu artikkeli konferenssijulkaisussa)


  • Silver Syntax Pre-training for Cross-Domain Relation Extraction  (2023)  Findings of the Association for Computational Linguistics: ACL 2023 Bassignana Elisa, Ginter Filip, Pyysalo Sampo, van der Goot Rob, Plank Barbara
    (
    A4 Vertaisarvioitu artikkeli konferenssijulkaisussa)


  • Towards diverse and contextually anchored paraphrase modeling: A dataset and baselines for Finnish  (2023)  
    • Natural Language Engineering
     Kanerva Jenna, Ginter Filip, Chang Li-Hsin, Rastas Iiro, Skantsi Valtteri, Kilpeläinen Jemina, Kupari Hanna-Mari, Piirto Aurora, Saarni Jenna, Sevón Maija, Tarkka Otto
    (
    A1 Vertaisarvioitu alkuperäisartikkeli tieteellisessä lehdessä )


  • Toxicity Detection in Finnish Using Machine Translation  (2023)  
    • NEALT proceedings series
    The 24rd Nordic Conference on Computational Linguistics (NoDaLiDa 2023) Eskelinen Anni, Silvala Laura, Ginter Filip, Pyysalo Sampo, Laippala Veronika
    (
    A4 Vertaisarvioitu artikkeli konferenssijulkaisussa)


  • Deep Learning and Film History: Model Explanation Techniques in the Analysis of Temporality in Finnish Fiction Film Metadata  (2022)  
    • CEUR Workshop Proceedings
    The 6th Digital Humanities in the Nordic and Baltic Countries Conference (DHNB 2022), Uppsala, Sweden, March 15-18, 2022 Ginter Filip, Kiiskinen Harri, Kanerva Jenna, Chang Li-Hsin, Salmi Hannu
    (
    A4 Vertaisarvioitu artikkeli konferenssijulkaisussa)


  • Detecting Sequential Genre Change in Eighteenth-Century Texts  (2022)  
    • CEUR Workshop Proceedings
    Proceedings of the Computational Humanities Research Conference 2022 Zhang Jinbin, Ryan Yann Ciarán, Rastas Iiro, Ginter Filip, Tolonen Mikko, Babbar Rohit
    (
    A4 Vertaisarvioitu artikkeli konferenssijulkaisussa)


  • Explainable Publication Year Prediction of Eighteenth Century Texts with the BERT Model  (2022)  Proceedings of the 3rd Workshop on Computational Approaches to Historical Language Change Rastas Iiro, Ryan Yann, Tiihonen Iiro, Qaraei Mohammedreza, Repo Liina, Babbar Rohit, Mäkelä Eetu, Tolonen Mikko, Ginter Filip
    (
    A4 Vertaisarvioitu artikkeli konferenssijulkaisussa)


  • Explaining Classes through Stable Word Attributions  (2022)  
    • Annual Meeting of the Association for Computational Linguistics
    The 60th Annual Meeting of the Association for Computational Linguistics: Findings of ACL 2022 Rönnqvist Samuel, Myntti Amanda, Kyröläinen Aki-Juhani, Ginter Filip, Laippala Veronika
    (
    A4 Vertaisarvioitu artikkeli konferenssijulkaisussa)


  • GEMv2: Multilingual NLG Benchmarking in a Single Line of Code  (2022)  Proceedings of the The 2022 Conference on Empirical Methods in Natural Language Processing: System Demonstrations Gehrmann Sebastian, Bhattacharjee Abhik, Mahendiran Abinaya, Wang Alex, Papangelis Alexandros, Madaan Aman, McMillan-Major Angelina, Shvets Anna, Upadhyay Ashish, Bohnet Bernd, Yao Bingsheng, Wilie Bryan, Bhagavatula Chandra, You Chaobin, Thomson Craig, Garbacea Cristina, Wang, Dakuo, Deutsch Daniel, Xiong Deyi, Jin Di, Gkatzia Dimitra, Radev Dragomir, Clark Elizabeth, Durmus Esin, Ladhak Faisal, Ginter Filip, Winata Genta Indra, Strobelt, Hendrik, Hayashi, Hiroaki, Novikova Jekaterina, Kanerva Jenna, Chim Jenny, Zhou Jiawei, Clive Jordan, Maynez Joshua, Sedoc João, Juraska Juraj, Dhole Kaustubh, Chandu Khyathi Raghavi, Perez-Beltrachini Laura, Ribeiro Leonardo F.R., Tunstall Lewis, Zhang Li, Pushkarna Mahima, Creutz Mathias, White Michael, Kale Mihir Sanjay, Eddine Moussa Kamal, Daheim Nico, Subramani, Nishant, Dusek Ondrej, Liang Paul Pu, Ammanamanchi Pawan Sasanka, Zhu Qi, Puduppully Ratish, Kriz Reno, Shahriyar Rifat, Cardenas Ronald, Mahamood Saad, Osei Salomey, Cahyawijaya Samuel, Štajner Sanja, Montella Sebastien, Jolly Shailza, Mille Simon, Hasan Tahmid, Shen Tianhao, Adewumi Tosin, Raunak Vikas, Raheja Vipul, Nikolaev Vitaly, Tsai Vivian, Jernite Yacine, Xu Ying, Sang Yisi, Liu Yixin, Hou Yufang
    (
    A4 Vertaisarvioitu artikkeli konferenssijulkaisussa)


  • Neural Network and Random Forest Models in Protein Function Prediction  (2022)  
    • IEEE/ACM Transactions on Computational Biology and Bioinformatics
     Hakala Kai, Kaewphan Suwisa, Björne Jari, Mehryary Farrokh, Moen Hans, Tolvanen Martti, Salakoski Tapio, Ginter Filip
    (
    A1 Vertaisarvioitu alkuperäisartikkeli tieteellisessä lehdessä )


  • Out-of-Domain Evaluation of Finnish Dependency Parsing  (2022)  
    • LREC Proceedings
    Proceedings of the 13th Conference on Language Resources and Evaluation (LREC 2022) Kanerva Jenna, Ginter Filip
    (
    A4 Vertaisarvioitu artikkeli konferenssijulkaisussa)



Last updated on 2025-30-01 at 17:59