Filip Ginter
figint@utu.fi Työhuone: 4th floor, 451A ORCID-tunniste: https://orcid.org/0000-0002-5484-6103 |
natural language processing; human language technology; machine learning; deep learning; resource development
human language technology, natural language processing, machine learning applied to human language, both methodological and resource creation research
I am a researcher at the Department of Computing, University of Turku. My research is in the area of natural language processing. I belong to the TurkuNLP (turkunlp.org) research group.
I was born in 1978 in Ostrava, Czech Republic (Czechoslovakia back then). In 2001, I got a M.Sc. (tech) in computer science at the computer science department of VSB - Technical University Ostrava. My major subject was artificial intelligence. I gained a PhD in computer science in 2007. The title of my thesis is Towards Information Extraction in the Biomedical Domain: Methods and Resources.
As of 2022, I am a professor of language technology and as of 2021 the deputy director of the Department of Computing.
My primary field of research is language technology / natural language processing. In my post-PhD career, I have focused on the development of NLP tools and resources primarily for Finnish, but later also numerous other languages via the Universal Dependencies project. My work is heavy on resource development, both in terms of data and machine learning pipelines. Open science and resources play an important role in my research, much of which is carried out in the open on GitHub and as a rule, all resources are openly available for unrestricted use. I work collaboratively, especially with my younger colleagues, rather than striving for deeper, primary author inquiries.
I have been actively teaching since early on during my PhD studies. I independently prepared my first advanced level NLP course in 2004, and since ca. 2008 I have been teaching at least one course every year, substantially more during my bioinformatics lecturer appointment. While a lecturer in the bioinformatics MSc degree programme, I was lecturing international students in two cities. In 2016, I was tasked with developing and coordinating the introduction of a new 20 ECTS study module on natural language processing. This module is, with modifications, still in use and shared between the departments of Languages and Computing, both in terms of teaching and in terms of students. In 2019-2020 and 2020-2021 I was also co-lecturing, upon invitation, two courses in natural language processing in the Arcada University of Applied Sciences in Helsinki.
- Towards the Classification of the Finnish Internet Parsebank: Detecting Translations and Informality (2015) 20th Nordic Conference of Computational Linguistics (Nodalida 2015) Laippala Veronika, Kanerva Jenna, Pyysalo Sampo, Missilä Anna, Salakoski Tapio, Ginter Filip
(A4 Vertaisarvioitu artikkeli konferenssijulkaisussa) - Towards Universal Web Parsebanks (2015) Proceedings of the International Conference on Dependency Linguistics (Depling'15) Juhani Luotolahti, Jenna Kanerva, Veronika Laippala, Sampo Pyysalo , Filip Ginter
(A4 Vertaisarvioitu artikkeli konferenssijulkaisussa) - Turku: Semantic Dependency Parsing as a Sequence Classification (2015) Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval 2015) Kanerva Jenna, Luotolahti Juhani, Ginter Filip
(A4 Vertaisarvioitu artikkeli konferenssijulkaisussa) - Universal Dependencies for Finnish (2015) Proceedings of the 20th Nordic Conference of Computational Linguistics (NODALIDA 2015) Sampo Pyysalo, Jenna Kanerva, Anna Missilä, Veronika Laippala, Filip Ginter
(A4 Vertaisarvioitu artikkeli konferenssijulkaisussa) - Building the essential resources for Finnish: the Turku Dependency Treebank (2014)
- Language Resources and Evaluation
(A1 Vertaisarvioitu alkuperäisartikkeli tieteellisessä lehdessä ) - Care Episode Retrieval – Proceedings of the 5th International Workshop on Health Text Mining and Information Analysis (Louhi) (2014) Proceedings of the 5th International Workshop on Health Text Mining and Information Analysis (Louhi) Moen H, Marsi E, Ginter F, Murtola L-M, Salakoski T, Salanterä S
(A4 Vertaisarvioitu artikkeli konferenssijulkaisussa) - Eliminating Incorrect Events from Large‐Scale Event Networks by Trigger Word Clustering and Pruning (2014) Proceedings of the 6th International Symposium on Semantic Mining in Biomedicine (SMBM 2014) Farrokh Mehryary, Suwisa Kaewphan, Kai Hakala, Filip Ginter
(A4 Vertaisarvioitu artikkeli konferenssijulkaisussa) - Post-hoc Manipulations of Vector Space Models with Application to Semantic Role Labeling (2014) Proceedings of the 2nd Workshop on Continuous Vector Space Models and their Compositionality (CVSC) @ EACL 2014 Jenna Kanerva, Filip Ginter
(A4 Vertaisarvioitu artikkeli konferenssijulkaisussa) - Statistical parsing of varieties of clinical Finnish (2014)
- Artificial Intelligence in Medicine
(A1 Vertaisarvioitu alkuperäisartikkeli tieteellisessä lehdessä ) - Syntactic N-gram Collection from a Large-Scale Corpus of Internet Finnish (2014) Proceedings of the Sixth International Conference Baltic HLT 2014 Jenna Kanerva, Juhani Luotolahti, Veronika Laippala, Filip Ginter
(A4 Vertaisarvioitu artikkeli konferenssijulkaisussa) - Turku: Broad-Coverage Semantic Parsing with Rich Features (2014) Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval 2014) Jenna Kanerva, Juhani Luotolahti, Filip Ginter
(A4 Vertaisarvioitu artikkeli konferenssijulkaisussa) - Universal Stanford Dependencies: a Cross-Linguistic Typology (2014) Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14) Marie-Catherine de Marneffe, Timothy Dozat, Natalia Silveira, Katri Haverinen, Filip Ginter, Joakim Nivre, Christopher D. Manning
(A4 Vertaisarvioitu artikkeli konferenssijulkaisussa) - UTU: Disease Mention Recognition and Normalization with CRFs and Vector Space Representations (2014) Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval 2014) Suwisa Kaewphan, Kai Hakaka, Filip Ginter
(A4 Vertaisarvioitu artikkeli konferenssijulkaisussa) - A Dependency-based Analysis of Treebank Annotation Errors (2013) Computational Dependency Theory Haverinen Katri, Ginter Filip, Laippala Veronika, Kohonen Samuel, Viljanen Timo, Nyblom Jenna, Salakoski Tapio
(A3 Vertaisarvioitu kirjan tai muun kokoomateoksen osa) - Building a Large Automatically Parsed Corpus of Finnish (2013)
- Linköping Electronic Conference Proceedings
(A4 Vertaisarvioitu artikkeli konferenssijulkaisussa) - Distributional Semantic Resources for Biomedical Text Processing (2013) Proceedings of the 5th International Symposium on Languages in Biology and Medicine (LBM '13) Pyysalo Sampo, Ginter Filip, Moen Hans, Salakoski Tapio, Ananiadou Sophia
(A4 Vertaisarvioitu artikkeli konferenssijulkaisussa) - Evaluating large-scale text mining applications beyond the traditional numeric performance measures (2013) Proceedings of the 2013 Workshop on Biomedical Natural Language Processing (BioNLP'13) Sofie Van Landeghem, Suwisa Kaewphan, Filip Ginter, Yves Van de Peer
(A4 Vertaisarvioitu artikkeli konferenssijulkaisussa) - EVEX in ST'13: Application of a large-scale text mining resource to event extraction and network construction (2013) Proceedings of the BioNLP Shared Task 2013 Workshop (BioNLP-ST'13) Kai Hakala, Sofie Van Landeghem, Tapio Salakoski, Yves Van de Peer, Filip Ginter
(A4 Vertaisarvioitu artikkeli konferenssijulkaisussa) - Hypothesis Generation in Large-Scale Event Networks (2013) Proceedings of the 5th International Symposium on Languages in Biology and Medicine (LBM'13) Hakala Kai, Mehryary Farrokh, Kaewphan Suwisa, Ginter Filip
(A4 Vertaisarvioitu artikkeli konferenssijulkaisussa) - Joint Morphological and Syntactic Analysis for Richly Inflected Languages (2013)
- Transactions of the Association for Computational Linguistics
(A1 Vertaisarvioitu alkuperäisartikkeli tieteellisessä lehdessä )