Filip Ginter
 


figint@utu.fi



Työhuone4th floor, 451A


ORCID-tunnistehttps://orcid.org/0000-0002-5484-6103

Google Scholar

LinkedIn

GitHub


Asiantuntijuusalueet
natural language processing; human language technology; machine learning; deep learning; resource development

Tutkimusyhteisö tai tutkimusaihe
human language technology, natural language processing, machine learning applied to human language, both methodological and resource creation research

Biografia

I am a researcher at the Department of Computing, University of Turku. My research is in the area of natural language processing. I belong to the TurkuNLP (turkunlp.org) research group.

I was born in 1978 in Ostrava, Czech Republic (Czechoslovakia back then). In 2001, I got a M.Sc. (tech) in computer science at the computer science department of VSB - Technical University Ostrava. My major subject was artificial intelligence. I gained a PhD in computer science in 2007. The title of my thesis is Towards Information Extraction in the Biomedical Domain: Methods and Resources.

As of 2022, I am a professor of language technology and as of 2021 the deputy director of the Department of Computing.



Tutkimus

My primary field of research is language technology / natural language processing. In my post-PhD career, I have focused on the development of NLP tools and resources primarily for Finnish, but later also numerous other languages via the Universal Dependencies project. My work is heavy on resource development, both in terms of data and machine learning pipelines. Open science and resources play an important role in my research, much of which is carried out in the open on GitHub and as a rule, all resources are openly available for unrestricted use. I work collaboratively, especially with my younger colleagues, rather than striving for deeper, primary author inquiries.



Opetus

I have been actively teaching since early on during my PhD studies. I independently prepared my first advanced level NLP course in 2004, and since ca. 2008 I have been teaching at least one course every year, substantially more during my bioinformatics lecturer appointment. While a lecturer in the bioinformatics MSc degree programme, I was lecturing international students in two cities. In 2016, I was tasked with developing and coordinating the introduction of a new 20 ECTS study module on natural language processing. This module is, with modifications, still in use and shared between the departments of Languages and Computing, both in terms of teaching and in terms of students. In 2019-2020 and 2020-2021 I was also co-lecturing, upon invitation, two courses in natural language processing in the Arcada University of Applied Sciences in Helsinki.



Julkaisut
  
null
  
null
  
5/8
  
null
  
null
  

  • A System for Identifying and Exploring Text Repetition in Large Historical Document Corpora  (2017)  Proceedings of the 21st Nordic Conference on Computational Linguistics, NoDaLiDa, 22-24 May 2017, Gothenburg, Sweden Aleksi Vesanto, Asko Nivala, Tapio Salakoski, Hannu Salmi, Filip Ginter
    (
    A4 Vertaisarvioitu artikkeli konferenssijulkaisussa)


  • CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies  (2017)  Proceedings of the CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies Zeman D, Popel M, Straka M, Hajič J, Nivre J, Ginter F, Luotolahti J, Pyysalo S, Petrov S, Potthast M, Tyers F, Badmaeva E, Gökırmak M, Nedoluzhko A, Cinková S, Hajič jr. J, Hlaváčová J, Kettnerová V, Urešová Z, Kanerva J, Ojala S, Missilä A, Manning C, Schuster S, Reddy S, Taji D, Habash N, Leung H, Marneffe M, Sanguinetti M, Simi M, Kanayama H, Paiva V, Droganova K, Martínez Alonso H, Uszkoreit H, Macketanz V, Burchardt A, Harris K, Marheinecke K, Rehm G, Kayadelen T, Attia M, Elkahky A, Yu Z, Pitler E, Lertpradit S, Mandl M, Kirchner J, Fernandez Alcalde H, Strnadova J, Banerjee E, Manurung R, Stella A, Shimada A, Kwak S, Mendonçca G, Lando T, Nitisaroj R, Li J
    (
    A4 Vertaisarvioitu artikkeli konferenssijulkaisussa)


  • Creating register sub-corpora for the Finnish Internet Parsebank.  (2017)  
    • Linköping Electronic Conference Proceedings
    Proceedings of the 21st Nordic Conference on Computational Linguistics Laippala Veronika, Luotolahti Juhani, Kyröläinen Aki-Juhani, Salakoski Tapio, Ginter Filip
    (
    A4 Vertaisarvioitu artikkeli konferenssijulkaisussa)


  • Cross-Lingual Pronoun Prediction with Deep Recurrent Neural Networks v2.0  (2017)  Proceedings of the Third Workshop on Discourse in Machine Translation Luotolahti Juhani, Kanerva Jenna, Ginter Filip
    (
    A4 Vertaisarvioitu artikkeli konferenssijulkaisussa)


  • Dependency profiles as a tool for big data analysis of linguistic constructions: A case study of emoticons  (2017)  
    • Eesti ja soome-ugri keeleteaduse ajakiri
     Laippala V., Kyröläinen A., Kanerva J., Luotolahti J., Ginter F.
    (
    A1 Vertaisarvioitu alkuperäisartikkeli tieteellisessä lehdessä )


  • Dep_search: Efficient Search Tool for Large Dependency Parsebanks  (2017)  
    • Linköping Electronic Conference Proceedings
    Proceedings of the 21st Nordic Conference on Computational Linguistics (NoDaLiDa) Luotolahti Juhani, Kanerva Jenna, Ginter Filip
    (
    A4 Vertaisarvioitu artikkeli konferenssijulkaisussa)


  • Detecting mentions of pain and acute confusion in Finnish clinical text  (2017)  SIGBioMed Workshop on Biomedical Natural Language: Proceedings of the 16th BioNLP Workshop Hans Moen, Kai Hakala, Farrokh Mehryary, Laura-Maria Peltonen, Tapio Salakoski, Filip Ginter, Sanna Salanterä
    (
    A4 Vertaisarvioitu artikkeli konferenssijulkaisussa)


  • Distributional Semantics of the Partitive A Argument Construction in Finnish  (2017)  Empirical Approaches to Cognitive Linguistics: Analysing Real-Life Data Huumo Tuomas, Kyröläinen Aki-Juhani, Kanerva Jenna, Luotolahti M. Juhani, Salakoski Tapio, Ginter Filip, Laippala Veronika
    (
    A3 Vertaisarvioitu kirjan tai muun kokoomateoksen osa)


  • End-to-End System for Bacteria Habitat Extraction  (2017)  SIGBioMed Workshop on Biomedical Natural Language: Proceedings of the 16th BioNLP Workshop Farrokh Mehryary, Kai Hakala, Suwisa Kaewphan, Jari Björne, Tapio Salakoski, Filip Ginter
    (
    A4 Vertaisarvioitu artikkeli konferenssijulkaisussa)


  • Ensemble of Convolutional Neural Networks for Medicine Intake Recognition in Twitter  (2017)  
    • CEUR Workshop Proceedings
    Proceedings of the 2nd Social Media Mining for Health Research and Applications Workshop (SMM4H 2017) Kai Hakala, Farrokh Mehryary, Hans Moen, Suwisa Kaewphan, Tapio Salakoski, Filip Ginter
    (
    A4 Vertaisarvioitu artikkeli konferenssijulkaisussa)


  • EPE 2017: The Biomedical Event Extraction Downstream Application  (2017)  Proceedings of the 2017 Shared Task on Extrinsic Parser Evaluation (EPE 2017) at the Fourth International Conference on Dependency Linguistics (Depling 2017) and the 15th International Conference on Parsing Technologies (IWTP 2017) Björne J, Ginter F, Salakoski T
    (
    A4 Vertaisarvioitu artikkeli konferenssijulkaisussa)


  • Fully Delexicalized Contexts for Syntax-Based Word Embeddings  (2017)  
    • Linköping Electronic Conference Proceedings
    Proceedings of the International Conference on Dependency Linguistics (Depling'17) Kanerva Jenna, Pyysalo Sampo, Ginter Filip
    (
    A4 Vertaisarvioitu artikkeli konferenssijulkaisussa)


  • The 2017 Shared Task on Extrinsic Parser Evaluation. Towards a Reusable Community Infrastructure  (2017)  Proceedings of the 2017 Shared Task on Extrinsic Parser Evaluation (EPE 2017) at the Fourth International Conference on Dependency Linguistics (Depling 2017) and the 15th International Conference on Parsing Technologies (IWTP 2017) Oepen S, Øvrelid L, Björne J, Johansson R, Lapponi E, Ginter F, Velldal E
    (
    A4 Vertaisarvioitu artikkeli konferenssijulkaisussa)


  • TurkuNLP: Delexicalized Pre-training of Word Embeddings for Dependency Parsing  (2017)  
    • Annual Meeting of the Association for Computational Linguistics
    Proceedings of the CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies Kanerva Jenna, Luotolahti Juhani, Ginter Filip
    (
    A4 Vertaisarvioitu artikkeli konferenssijulkaisussa)


  • An expanded evaluation of protein function prediction methods shows an improvement in accuracy  (2016)  
    • Genome Biology
     Jiang YX, Oron TR, Clark WT, Bankapur AR, D'Andrea D, Lepore R, Funk CS, Kahanda I, Verspoor KM, Ben-Hur A, Koo DCE, Penfold-Brown D, Shasha D, Youngs N, Bonneau R, Lin A, Sahraeian SME, Martelli PL, Profiti G, Casadio R, Cao RZ, Zhong Z, Cheng JL, Altenhoff A, Skunca N, Dessimoz C, Dogan T, Hakala K, Kaewphan S, Mehryary F, Salakoski T, Ginter F, Fang H, Smithers B, Oates M, Gough J, Toronen P, Koskinen P, Holm L, Chen CT, Hsu WL, Bryson K, Cozzetto D, Minneci F, Jones DT, Chapman S, Dukka BKC, Khan IK, Kihara D, Ofer D, Rappoport N, Stern A, Cibrian-Uhalte E, Denny P, Foulger RE, Hieta R, Legge D, Lovering RC, Magrane M, Melidoni AN, Mutowo-Meullenet P, Pichler K, Shypitsyna A, Li B, Zakeri P, ElShal S, Tranchevent LC, Das S, Dawson NL, Lee D, Lees JG, Sillitoe I, Bhat P, Nepusz T, Romero AE, Sasidharan R, Yang HX, Paccanaro A, Gillis J, Sedeno-Cortes AE, Pavlidis P, Feng S, Cejuela JM, Goldberg T, Hamp T, Richter L, Salamov A, Gabaldon T, Marcet-Houben M, Supek F, Gong QT, Ning W, Zhou YP, Tian WD, Falda M, Fontana P, Lavezzo E, Toppo S, Ferrari C, Giollo M, Piovesan D, Tosatto SCE, del Pozo A, Fernandez JM, Maietta P, Valencia A, Tress ML, Benso A, Di Carlo S, Politano G, Savino A, Rehman HU, Re M, Mesiti M, Valentini G, Bargsten JW, van Dijk ADJ, Gemovic B, Glisic S, Perovic V, Veljkovic V, Veljkovic N, Almeida-e-Silva DC, Vencio RZN, Sharan M, Vogel J, Kansakar L, Zhang S, Vucetic S, Wang Z, Sternberg MJE, Wass MN, Huntley RP, Martin MJ, O'Donovan C, Robinson PN, Moreau Y, Tramontano A, Babbitt PC, Brenner SE, Linial M, Orengo CA, Rost B, Greene CS, Mooney SD, Friedberg I, Radivojac P
    (
    A1 Vertaisarvioitu alkuperäisartikkeli tieteellisessä lehdessä )


  • Cell line name recognition in support of the identification of synthetic lethality in cancer from text – Cell line name recognition  (2016)  
    • Bioinformatics
     Suwisa Kaewphan, Sofie Van Landeghem, Tomoko Ohta, Yves Van de Peer, Filip Ginter, Sampo Pyysalo
    (
    A1 Vertaisarvioitu alkuperäisartikkeli tieteellisessä lehdessä )


  • Cross-Lingual Pronoun Prediction with Deep Recurrent Neural Networks  (2016)  Proceedings of the First Conference on Machine Translation (WMT) Juhani Luotolahti, Jenna Kanerva, Filip Ginter
    (
    A4 Vertaisarvioitu artikkeli konferenssijulkaisussa)


  • Deep Learning With Minimal Training Data: TurkuNLP Entry in the BioNLP Shared Task 2016  (2016)  Proceedings of the 4th BioNLP Shared Task Workshop Farrokh Mehryary, Jari Bjorne, Sampo Pyysalo, Tapio Salakoski, Filip Ginter
    (
    A4 Vertaisarvioitu artikkeli konferenssijulkaisussa)


  • Filtering large-scale event collections using a combination of supervised and unsupervised learning for event trigger classification  (2016)  
    • Journal of Biomedical Semantics
     Mehryary F, Kaewphan S, Hakala K, Ginter F
    (
    B1 Vertaisarvioimaton kirjoitus tieteellisessä lehdessä )


  • Phrase-Based SMT for Finnish with More Data, Better Models and Alternative Alignment and Translation Tools  (2016)  Proceedings of the First Conference on Machine Translation Jörg Tiedemann, Fabienne Cap, Jenna Kanerva, Filip Ginter, Sara Stymne, Robert östling, Marion Di Marco
    (
    A4 Vertaisarvioitu artikkeli konferenssijulkaisussa)



Last updated on