Filip Ginter
figint@utu.fi Office: 4th floor, 451A ORCID identifier: https://orcid.org/0000-0002-5484-6103 |
natural language processing; human language technology; machine learning; deep learning; resource development
human language technology, natural language processing, machine learning applied to human language, both methodological and resource creation research
I am a researcher at the Department of Computing, University of Turku. My research is in the area of natural language processing. I belong to the TurkuNLP (turkunlp.org) research group.
I was born in 1978 in Ostrava, Czech Republic (Czechoslovakia back then). In 2001, I got a M.Sc. (tech) in computer science at the computer science department of VSB - Technical University Ostrava. My major subject was artificial intelligence. I gained a PhD in computer science in 2007. The title of my thesis is Towards Information Extraction in the Biomedical Domain: Methods and Resources.
As of 2022, I am a professor of language technology and as of 2021 the deputy director of the Department of Computing.
My primary field of research is language technology / natural language processing. In my post-PhD career, I have focused on the development of NLP tools and resources primarily for Finnish, but later also numerous other languages via the Universal Dependencies project. My work is heavy on resource development, both in terms of data and machine learning pipelines. Open science and resources play an important role in my research, much of which is carried out in the open on GitHub and as a rule, all resources are openly available for unrestricted use. I work collaboratively, especially with my younger colleagues, rather than striving for deeper, primary author inquiries.
I have been actively teaching since early on during my PhD studies. I independently prepared my first advanced level NLP course in 2004, and since ca. 2008 I have been teaching at least one course every year, substantially more during my bioinformatics lecturer appointment. While a lecturer in the bioinformatics MSc degree programme, I was lecturing international students in two cities. In 2016, I was tasked with developing and coordinating the introduction of a new 20 ECTS study module on natural language processing. This module is, with modifications, still in use and shared between the departments of Languages and Computing, both in terms of teaching and in terms of students. In 2019-2020 and 2020-2021 I was also co-lecturing, upon invitation, two courses in natural language processing in the Arcada University of Applied Sciences in Helsinki.
- Towards the Classification of the Finnish Internet Parsebank: Detecting Translations and Informality (2015) 20th Nordic Conference of Computational Linguistics (Nodalida 2015) Laippala Veronika, Kanerva Jenna, Pyysalo Sampo, Missilä Anna, Salakoski Tapio, Ginter Filip
(A4 Refereed article in a conference publication ) - Towards Universal Web Parsebanks (2015) Proceedings of the International Conference on Dependency Linguistics (Depling'15) Juhani Luotolahti, Jenna Kanerva, Veronika Laippala, Sampo Pyysalo , Filip Ginter
(A4 Refereed article in a conference publication ) - Turku: Semantic Dependency Parsing as a Sequence Classification (2015) Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval 2015) Kanerva Jenna, Luotolahti Juhani, Ginter Filip
(A4 Refereed article in a conference publication ) - Universal Dependencies for Finnish (2015) Proceedings of the 20th Nordic Conference of Computational Linguistics (NODALIDA 2015) Sampo Pyysalo, Jenna Kanerva, Anna Missilä, Veronika Laippala, Filip Ginter
(A4 Refereed article in a conference publication ) - Building the essential resources for Finnish: the Turku Dependency Treebank (2014)
- Language Resources and Evaluation
(A1 Refereed original research article in a scientific journal) - Care Episode Retrieval – Proceedings of the 5th International Workshop on Health Text Mining and Information Analysis (Louhi) (2014) Proceedings of the 5th International Workshop on Health Text Mining and Information Analysis (Louhi) Moen H, Marsi E, Ginter F, Murtola L-M, Salakoski T, Salanterä S
(A4 Refereed article in a conference publication ) - Eliminating Incorrect Events from Large‐Scale Event Networks by Trigger Word Clustering and Pruning (2014) Proceedings of the 6th International Symposium on Semantic Mining in Biomedicine (SMBM 2014) Farrokh Mehryary, Suwisa Kaewphan, Kai Hakala, Filip Ginter
(A4 Refereed article in a conference publication ) - Post-hoc Manipulations of Vector Space Models with Application to Semantic Role Labeling (2014) Proceedings of the 2nd Workshop on Continuous Vector Space Models and their Compositionality (CVSC) @ EACL 2014 Jenna Kanerva, Filip Ginter
(A4 Refereed article in a conference publication ) - Statistical parsing of varieties of clinical Finnish (2014)
- Artificial Intelligence in Medicine
(A1 Refereed original research article in a scientific journal) - Syntactic N-gram Collection from a Large-Scale Corpus of Internet Finnish (2014) Proceedings of the Sixth International Conference Baltic HLT 2014 Jenna Kanerva, Juhani Luotolahti, Veronika Laippala, Filip Ginter
(A4 Refereed article in a conference publication ) - Turku: Broad-Coverage Semantic Parsing with Rich Features (2014) Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval 2014) Jenna Kanerva, Juhani Luotolahti, Filip Ginter
(A4 Refereed article in a conference publication ) - Universal Stanford Dependencies: a Cross-Linguistic Typology (2014) Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14) Marie-Catherine de Marneffe, Timothy Dozat, Natalia Silveira, Katri Haverinen, Filip Ginter, Joakim Nivre, Christopher D. Manning
(A4 Refereed article in a conference publication ) - UTU: Disease Mention Recognition and Normalization with CRFs and Vector Space Representations (2014) Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval 2014) Suwisa Kaewphan, Kai Hakaka, Filip Ginter
(A4 Refereed article in a conference publication ) - A Dependency-based Analysis of Treebank Annotation Errors (2013) Computational Dependency Theory Haverinen Katri, Ginter Filip, Laippala Veronika, Kohonen Samuel, Viljanen Timo, Nyblom Jenna, Salakoski Tapio
(A3 Refereed book chapter or chapter in a compilation book) - Building a Large Automatically Parsed Corpus of Finnish (2013)
- Linköping Electronic Conference Proceedings
(A4 Refereed article in a conference publication ) - Distributional Semantic Resources for Biomedical Text Processing (2013) Proceedings of the 5th International Symposium on Languages in Biology and Medicine (LBM '13) Pyysalo Sampo, Ginter Filip, Moen Hans, Salakoski Tapio, Ananiadou Sophia
(A4 Refereed article in a conference publication ) - Evaluating large-scale text mining applications beyond the traditional numeric performance measures (2013) Proceedings of the 2013 Workshop on Biomedical Natural Language Processing (BioNLP'13) Sofie Van Landeghem, Suwisa Kaewphan, Filip Ginter, Yves Van de Peer
(A4 Refereed article in a conference publication ) - EVEX in ST'13: Application of a large-scale text mining resource to event extraction and network construction (2013) Proceedings of the BioNLP Shared Task 2013 Workshop (BioNLP-ST'13) Kai Hakala, Sofie Van Landeghem, Tapio Salakoski, Yves Van de Peer, Filip Ginter
(A4 Refereed article in a conference publication ) - Hypothesis Generation in Large-Scale Event Networks (2013) Proceedings of the 5th International Symposium on Languages in Biology and Medicine (LBM'13) Hakala Kai, Mehryary Farrokh, Kaewphan Suwisa, Ginter Filip
(A4 Refereed article in a conference publication ) - Joint Morphological and Syntactic Analysis for Richly Inflected Languages (2013)
- Transactions of the Association for Computational Linguistics
(A1 Refereed original research article in a scientific journal)