Filip Ginter
 


figint@utu.fi



Työhuone4th floor, 451A


ORCID-tunnistehttps://orcid.org/0000-0002-5484-6103

Google Scholar

LinkedIn

GitHub


Asiantuntijuusalueet
natural language processing; human language technology; machine learning; deep learning; resource development

Tutkimusyhteisö tai tutkimusaihe
human language technology, natural language processing, machine learning applied to human language, both methodological and resource creation research

Biografia

I am a researcher at the Department of Computing, University of Turku. My research is in the area of natural language processing. I belong to the TurkuNLP (turkunlp.org) research group.

I was born in 1978 in Ostrava, Czech Republic (Czechoslovakia back then). In 2001, I got a M.Sc. (tech) in computer science at the computer science department of VSB - Technical University Ostrava. My major subject was artificial intelligence. I gained a PhD in computer science in 2007. The title of my thesis is Towards Information Extraction in the Biomedical Domain: Methods and Resources.

As of 2022, I am a professor of language technology and as of 2021 the deputy director of the Department of Computing.



Tutkimus

My primary field of research is language technology / natural language processing. In my post-PhD career, I have focused on the development of NLP tools and resources primarily for Finnish, but later also numerous other languages via the Universal Dependencies project. My work is heavy on resource development, both in terms of data and machine learning pipelines. Open science and resources play an important role in my research, much of which is carried out in the open on GitHub and as a rule, all resources are openly available for unrestricted use. I work collaboratively, especially with my younger colleagues, rather than striving for deeper, primary author inquiries.



Opetus

I have been actively teaching since early on during my PhD studies. I independently prepared my first advanced level NLP course in 2004, and since ca. 2008 I have been teaching at least one course every year, substantially more during my bioinformatics lecturer appointment. While a lecturer in the bioinformatics MSc degree programme, I was lecturing international students in two cities. In 2016, I was tasked with developing and coordinating the introduction of a new 20 ECTS study module on natural language processing. This module is, with modifications, still in use and shared between the departments of Languages and Computing, both in terms of teaching and in terms of students. In 2019-2020 and 2020-2021 I was also co-lecturing, upon invitation, two courses in natural language processing in the Arcada University of Applied Sciences in Helsinki.



Julkaisut
  
null
  
null
  
4/8
  
null
  
null
  

  • CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies  (2018)  Proceedings of the CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies Daniel Zeman, Jan Hajiˇc, Martin Popel, Martin Potthast, Milan Straka, Filip Ginter, Joakim Nivre, Slav Petrov
    (
    A4 Vertaisarvioitu artikkeli konferenssijulkaisussa)


  • Data and systems for medication-related text classification and concept normalization from Twitter: insights from the Social Media Mining for Health (SMM4H)-2017 shared task  (2018)  
    • Journal of the American Medical Informatics Association
     Abeed Sarker, Maksim Belousov, Jasper Friedrichs, Kai Hakala, Svetlana Kiritchenko, Farrokh Mehryary, Sifei Han, Tung Tran, Anthony Rios, Ramakanth Kavuluru, Berry de Bruijn, Filip Ginter, Debanjan Mahata, Saif M. Mohammad, Goran Nenadic, Graciela Gonzalez-Hernandez
    (
    A1 Vertaisarvioitu alkuperäisartikkeli tieteellisessä lehdessä )


  • Dependency profiles in the large-scale analysis of discourse connectivesTurkuNLP Entry for Interactive Bio-ID Assignment  (2018)  
    • Corpus Linguistics and Linguistic Theory
     Veronika Laippala, Aki-Juhani Kyröläinen, Jenna Kanerva, Filip Ginter
    (
    A1 Vertaisarvioitu alkuperäisartikkeli tieteellisessä lehdessä )


  • Wide-scope biomedical named entity recognition and normalization with CRFs, fuzzy matching and character level modeling  (2018)  Proceedings of the Second Workshop on Universal Dependencies (UDW 2018) Joakim Nivre, Paola Marongiu, Filip Ginter, Jenna Kanerva, Simonetta Montemagni, Sebastian Schuster, Maria Simi
    (
    A4 Vertaisarvioitu artikkeli konferenssijulkaisussa)


  •   (2018)  Proceedings of the 9th International Workshop on Health Text Mining and Information Analysis (LOUHI 2018) Hans Moen, Kai Hakala, Laura-Maria Peltonen, Henry Suhonen, Petri Loukasmäki, Tapio Salakoski, Filip Ginter, Sanna Salanterä
    (
    A4 Vertaisarvioitu artikkeli konferenssijulkaisussa)


  • Finding novel relationships with integrated gene-gene association network analysis of Synechocystis sp. PCC 6803 using species-independent text-mining  (2018)  
    • PeerJ
     Sanna M. Kreula, Suwisa Kaewphan, Filip Ginter, Patrik R. Jones
    (
    A1 Vertaisarvioitu alkuperäisartikkeli tieteellisessä lehdessä )


  •   (2018)  
    • Medical informatics Europe
    Building Continents of Knowledge in Oceans of Data: The Future of Co-Created eHealth Moen H., Peltonen L., Koivumäki M., Suhonen H., Salakoski T., Ginter F., Salanterä S.
    (
    A4 Vertaisarvioitu artikkeli konferenssijulkaisussa)


  • Mind the Gap: Data Enrichment in Dependency Parsing of Elliptical ConstructionsCoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies  (2018)  Proceedings of the Second Workshop on Universal Dependencies (UDW 2018) Kira Droganova, Filip Ginter, Jenna Kanerva, Daniel Zeman
    (
    A4 Vertaisarvioitu artikkeli konferenssijulkaisussa)


  •   (2018)  
    • Journal of Pragmatics
     Johansson Marjut, Kyröläinen Aki-Juhani, Ginter Filip, Lehti Lotta, Krizsán Attila, Laippala Veronika
    (
    A1 Vertaisarvioitu alkuperäisartikkeli tieteellisessä lehdessä )


  •   (2018)  
    • Database: The Journal of Biological Databases and Curation
     Farrokh Mehryary, Jari Björne, Tapio Salakoski, Filip Ginter
    (
    A1 Vertaisarvioitu alkuperäisartikkeli tieteellisessä lehdessä )


  • 2018Proceedings of the CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies Jenna Kanerva, Filip Ginter, Niko Miekka, Akseli Leino, Tapio SalakoskiA4 Vertaisarvioitu artikkeli konferenssijulkaisussa


  • 2018Proceedings of the BioCreative VI Workshop Suwisa Kaewphan, Farrokh Mehryary, Kai Hakala, Tapio Salakoski, Filip Ginter
    (
    A4 Vertaisarvioitu artikkeli konferenssijulkaisussa)


  •   (2018)  
    • Database: The Journal of Biological Databases and Curation
     Suwisa Kaewphan, Kai Hakala, Niko Miekka, Tapio Salakoski, Filip Ginter
    (
    A1 Vertaisarvioitu alkuperäisartikkeli tieteellisessä lehdessä )


  • An autoencoder-based neural network model for selectional preference: Evidence from pseudo-disambiguation and cloze tasks  (2017)  
    • Eesti ja soome-ugri keeleteaduse ajakiri
     Kyröläinen A., Luotolahti J., Ginter F.
    (
    A1 Vertaisarvioitu alkuperäisartikkeli tieteellisessä lehdessä )


  • Applying BLAST to Text Reuse Detection in Finnish Newspapers and Journals, 1771–1910  (2017)  Proceedings of the NoDaLiDa 2017 Workshop on Processing Historical Language Aleksi Vesanto, Asko Nivala, Heli Rantala, Tapio Salakoski, Hannu Salmi, Filip Ginter
    (
    A4 Vertaisarvioitu artikkeli konferenssijulkaisussa)


  • Assessing the Annotation Consistency of the Universal Dependencies Corpora2017
    • Linköping Electronic Conference Proceedings
    Proceedings of the International Conference on Dependency Linguistics (Depling'17) de Marneffe Marie-Catherine, Grioni Matias, Kanerva Jenna, Ginter Filip
    (
    A4 Vertaisarvioitu artikkeli konferenssijulkaisussa)


  • A System for Identifying and Exploring Text Repetition in Large Historical Document Corpora  (2017)  Proceedings of the 21st Nordic Conference on Computational Linguistics, NoDaLiDa, 22-24 May 2017, Gothenburg, Sweden Aleksi Vesanto, Asko Nivala, Tapio Salakoski, Hannu Salmi, Filip Ginter
    (
    A4 Vertaisarvioitu artikkeli konferenssijulkaisussa)


  •   (2017)  Proceedings of the CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies Zeman D, Popel M, Straka M, Hajič J, Nivre J, Ginter F, Luotolahti J, Pyysalo S, Petrov S, Potthast M, Tyers F, Badmaeva E, Gökırmak M, Nedoluzhko A, Cinková S, Hajič jr. J, Hlaváčová J, Kettnerová V, Urešová Z, Kanerva J, Ojala S, Missilä A, Manning C, Schuster S, Reddy S, Taji D, Habash N, Leung H, Marneffe M, Sanguinetti M, Simi M, Kanayama H, Paiva V, Droganova K, Martínez Alonso H, Uszkoreit H, Macketanz V, Burchardt A, Harris K, Marheinecke K, Rehm G, Kayadelen T, Attia M, Elkahky A, Yu Z, Pitler E, Lertpradit S, Mandl M, Kirchner J, Fernandez Alcalde H, Strnadova J, Banerjee E, Manurung R, Stella A, Shimada A, Kwak S, Mendonçca G, Lando T, Nitisaroj R, Li J
    (
    A4 Vertaisarvioitu artikkeli konferenssijulkaisussa)


  • Creating register sub-corpora for the Finnish Internet Parsebank.  (2017)  
    • Linköping Electronic Conference Proceedings
    Proceedings of the 21st Nordic Conference on Computational Linguistics Laippala Veronika, Luotolahti Juhani, Kyröläinen Aki-Juhani, Salakoski Tapio, Ginter Filip
    (
    A4 Vertaisarvioitu artikkeli konferenssijulkaisussa)


  • Cross-Lingual Pronoun Prediction with Deep Recurrent Neural Networks v2.0  (2017)  Proceedings of the Third Workshop on Discourse in Machine Translation Luotolahti Juhani, Kanerva Jenna, Ginter Filip
    (
    A4 Vertaisarvioitu artikkeli konferenssijulkaisussa)



Last updated on