Filip Ginter
figint@utu.fi : 4th floor, 451A |
natural language processing; human language technology; machine learning; deep learning; resource development
human language technology, natural language processing, machine learning applied to human language, both methodological and resource creation research
I am a researcher at the Department of Computing, University of Turku. My research is in the area of natural language processing. I belong to the TurkuNLP (turkunlp.org) research group.
I was born in 1978 in Ostrava, Czech Republic (Czechoslovakia back then). In 2001, I got a M.Sc. (tech) in computer science at the computer science department of VSB - Technical University Ostrava. My major subject was artificial intelligence. I gained a PhD in computer science in 2007. The title of my thesis is Towards Information Extraction in the Biomedical Domain: Methods and Resources.
As of 2022, I am a professor of language technology and as of 2021 the deputy director of the Department of Computing.
My primary field of research is language technology / natural language processing. In my post-PhD career, I have focused on the development of NLP tools and resources primarily for Finnish, but later also numerous other languages via the Universal Dependencies project. My work is heavy on resource development, both in terms of data and machine learning pipelines. Open science and resources play an important role in my research, much of which is carried out in the open on GitHub and as a rule, all resources are openly available for unrestricted use. I work collaboratively, especially with my younger colleagues, rather than striving for deeper, primary author inquiries.
I have been actively teaching since early on during my PhD studies. I independently prepared my first advanced level NLP course in 2004, and since ca. 2008 I have been teaching at least one course every year, substantially more during my bioinformatics lecturer appointment. While a lecturer in the bioinformatics MSc degree programme, I was lecturing international students in two cities. In 2016, I was tasked with developing and coordinating the introduction of a new 20 ECTS study module on natural language processing. This module is, with modifications, still in use and shared between the departments of Languages and Computing, both in terms of teaching and in terms of students. In 2019-2020 and 2020-2021 I was also co-lecturing, upon invitation, two courses in natural language processing in the Arcada University of Applied Sciences in Helsinki.
- The 2017 Shared Task on Extrinsic Parser Evaluation. Towards a Reusable Community Infrastructure (2017) Proceedings of the 2017 Shared Task on Extrinsic Parser Evaluation (EPE 2017) at the Fourth International Conference on Dependency Linguistics (Depling 2017) and the 15th International Conference on Parsing Technologies (IWTP 2017) Oepen S, Øvrelid L, Björne J, Johansson R, Lapponi E, Ginter F, Velldal E
- TurkuNLP: Delexicalized Pre-training of Word Embeddings for Dependency Parsing (2017)
- Annual Meeting of the Association for Computational Linguistics
- An expanded evaluation of protein function prediction methods shows an improvement in accuracy (2016)
- Genome Biology
- Cell line name recognition in support of the identification of synthetic lethality in cancer from text – Cell line name recognition (2016)
- Bioinformatics
- Cross-Lingual Pronoun Prediction with Deep Recurrent Neural Networks (2016) Proceedings of the First Conference on Machine Translation (WMT) Juhani Luotolahti, Jenna Kanerva, Filip Ginter
- Deep Learning With Minimal Training Data: TurkuNLP Entry in the BioNLP Shared Task 2016 (2016) Proceedings of the 4th BioNLP Shared Task Workshop Farrokh Mehryary, Jari Bjorne, Sampo Pyysalo, Tapio Salakoski, Filip Ginter
- Filtering large-scale event collections using a combination of supervised and unsupervised learning for event trigger classification (2016)
- Journal of Biomedical Semantics
- Phrase-Based SMT for Finnish with More Data, Better Models and Alternative Alignment and Translation Tools (2016) Proceedings of the First Conference on Machine Translation Jörg Tiedemann, Fabienne Cap, Jenna Kanerva, Filip Ginter, Sara Stymne, Robert östling, Marion Di Marco
- Sentence-initial discourse markers in the Finnish Internet (2016) Conference Handbook Laippala Veronika, Kyröläinen Aki-Juhani,Komppa Johanna, Vilkuna Maria, Kalliokoski Jyrki, Ginter Filip
- Syntactic analyses and named entity recognition for PubMed and. PubMed Central — up-to-the-minute (2016) Proceedings of the 15th Workshop on Biomedical Natural Language Processing (BioNLP) Kai Hakala, Suwisa Kaewphan, Tapio Salakoski, Filip Ginter
- Universal dependencies for Persian (2016) Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016) Seraji Mojgan, Ginter Filip, Nivre Joakim
- Universal Dependencies v1: A Multilingual Treebank Collection (2016) Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016) Joakim Nivre, Marie-Catherine de Marneffe, Filip Ginter, Yoav Goldberg, Jan Hajic, Christopher D. Manning, Ryan McDonald, Slav Petrov, Sampo Pyysalo, Natalia Silveira, Reut Tsarfaty, Daniel Zeman
- Application of the EVEX resource to event extraction and network construction: Shared Task entry and result analysis (2015)
- BMC Bioinformatics
- Care episode retrieval: distributional semantic models for information retrieval in the clinical domain (2015)
- BMC Medical Informatics and Decision Making
- Morphological Segmentation and OPUS for Finnish-English Machine Translation (2015) Tiedemann Jörg, Ginter Filip, Kanerva Jenna
- Sentence Compression for Automatic Subtitling (2015) Proceedings of NoDaLiDa 2015 Juhani Luotolahti, Filip Ginter
- SETS: Scalable and Efficient Tree Search in Dependency Graphs (2015) Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Demonstrations Juhani Luotolahti, Jenna Kanerva, Sampo Pyysalo, Filip Ginter
- Sharing annotations better: RESTful Open Annotation (2015) Proceedings of ACL-IJCNLP 2015 System Demonstrations Pyysalo Sampo, Campos Jorge, Cejuela Juan Miguel, Ginter Filip, Hakala Kai, Li Chen, Stenetorp Pontus, Jensen Lars Juhl
- Syntactic Ngrams as Keystructures Reflecting Typical Syntactic Patterns of Corpora in Finnish (2015)
- Procedia Social and Behavioral Sciences
- The Finnish Proposition Bank (2015)
- Language Resources and Evaluation