Filip Ginter
figint@utu.fi Työhuone: 4th floor, 451A ORCID-tunniste: https://orcid.org/0000-0002-5484-6103 |
natural language processing; human language technology; machine learning; deep learning; resource development
I am a researcher at the Department of Computing, University of Turku. My research is in the area of natural language processing. I belong to the TurkuNLP (turkunlp.org) research group.
I was born in 1978 in Ostrava, Czech Republic (Czechoslovakia back then). In 2001, I got a M.Sc. (tech) in computer science at the computer science department of VSB - Technical University Ostrava. My major subject was artificial intelligence. I gained a PhD in computer science in 2007. The title of my thesis is Towards Information Extraction in the Biomedical Domain: Methods and Resources.
As of 2022, I am a professor of language technology and as of 2021 the deputy director of the Department of Computing.
My primary field of research is language technology / natural language processing. In my post-PhD career, I have focused on the development of NLP tools and resources primarily for Finnish, but later also numerous other languages via the Universal Dependencies project. My work is heavy on resource development, both in terms of data and machine learning pipelines. Open science and resources play an important role in my research, much of which is carried out in the open on GitHub and as a rule, all resources are openly available for unrestricted use. I work collaboratively, especially with my younger colleagues, rather than striving for deeper, primary author inquiries.
I have been actively teaching since early on during my PhD studies. I independently prepared my first advanced level NLP course in 2004, and since ca. 2008 I have been teaching at least one course every year, substantially more during my bioinformatics lecturer appointment. While a lecturer in the bioinformatics MSc degree programme, I was lecturing international students in two cities. In 2016, I was tasked with developing and coordinating the introduction of a new 20 ECTS study module on natural language processing. This module is, with modifications, still in use and shared between the departments of Languages and Computing, both in terms of teaching and in terms of students. In 2019-2020 and 2020-2021 I was also co-lecturing, upon invitation, two courses in natural language processing in the Arcada University of Applied Sciences in Helsinki.
- Integrating Large-Scale Text Mining and Co-Expression Networks: Targeting NADP(H) Metabolism in E. coli with Event Extraction (2012) Third Workshop on Building and Evaluating Resources for Biomedical Text Mining (BioTxtM 2012) Suwisa Kaewphan, Sanna Kreula, Sofie Van Landeghem, Yves Van de Peer, Patrik R Jones, Filip Ginter
(Vertaisarvioitu artikkeli konferenssijulkaisussa (A4)) - PubMed-Scale Event Extraction for Post-Translational Modifications, Epigenetics and Protein Structural Relations (2012) BioNLP: Proceedings of the 2012 Workshop on Biomedical Natural Language Processing Björne J, Van Landeghem S, Pyysalo S, Ohta T, Ginter F, Van de Peer Y, Ananiadou S, Salakoski T
(Vertaisarvioitu artikkeli konferenssijulkaisussa (A4)) - University of Turku in the BioNLP'11 Shared Task (2012)
- BMC Bioinformatics
(Vertaisarvioitu alkuperäisartikkeli tai data-artikkeli tieteellisessä aikakauslehdessä (A1)) - A Dependency-based Analysis of Treebank Annotation Errors (2011) Proceedings of International Conference on Dependency Linguistics (Depling'11), Barcelona, Spain Haverinen K, Ginter F, Laippala V, Kohonen S, Viljanen T, Nyblom J, Salakoski T
(Vertaisarvioitu artikkeli konferenssijulkaisussa (A4)) - EVEX: A PubMed-Scale Resource for Homology-Based Generalization of Text Mining Predictions (2011) Proceedings of BioNLP'11 Workshop Van Landeghem S, Ginter F, Peer Y, Salakoski T
(Vertaisarvioitu artikkeli konferenssijulkaisussa (A4)) - EXTRACTING CONTEXTUALIZED COMPLEX BIOLOGICAL EVENTS WITH RICH GRAPH-BASED FEATURE SETS (2011)
- Computational Intelligence
(Vertaisarvioitu alkuperäisartikkeli tai data-artikkeli tieteellisessä aikakauslehdessä (A1)) - U-Compare bio-event meta-service: compatible BioNLP event extraction services (2011)
- BMC Bioinformatics
(Vertaisarvioitu alkuperäisartikkeli tai data-artikkeli tieteellisessä aikakauslehdessä (A1)) - Complex Event Extraction at {PubMed} Scale (2010)
- Bioinformatics
(Vertaisarvioitu alkuperäisartikkeli tai data-artikkeli tieteellisessä aikakauslehdessä (A1)) - Dependency-Based PropBanking of Clinical Finnish (2010) Proceedings of The Fourth Linguistic Annotation Workshop (LAW IV) Haverinen K, Ginter F, Laippala V, Viljanen T, Salakoski T
(Vertaisarvioitu artikkeli konferenssijulkaisussa (A4)) - Scaling up Biomedical Event Extraction to the Entire {PubMed} (2010) Proceedings of the 2010 Workshop on Biomedical Natural Language Processing (BioNLP'10) Björne J, Ginter F, Pyysalo S, Tsujii J, Salakoski T
(Vertaisarvioitu artikkeli konferenssijulkaisussa (A4)) - Treebanking Finnish (2010)
- NEALT proceedings series
(Vertaisarvioitu artikkeli konferenssijulkaisussa (A4)) - Extracting Complex Biological Events with Rich Graph-Based Feature Sets (2009) Proceedings of the BioNLP 2009 Workshop Companion Volume for Shared Task Björne J, Heimonen J, Ginter F, Airola A, Pahikkala T, Salakoski T
(Vertaisarvioitu artikkeli konferenssijulkaisussa (A4)) - A Graph Kernel for Protein-Protein Interaction Extraction (2008) Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing (BioNLP 2008) Airola A, Pyysalo S, Björne J, Pahikkala T, Ginter F, Salakoski T
(Vertaisarvioitu artikkeli konferenssijulkaisussa (A4)) - All-paths graph kernel for protein-protein interaction extraction with evaluation of cross-corpus learning (2008)
- BMC Bioinformatics
(Vertaisarvioitu alkuperäisartikkeli tai data-artikkeli tieteellisessä aikakauslehdessä (A1)) - Comparative analysis of five protein-protein interaction corpora (2008)
- BMC Bioinformatics
(Vertaisarvioitu alkuperäisartikkeli tai data-artikkeli tieteellisessä aikakauslehdessä (A1)) - Machine Learning to Automate the Assignment of Diagnosis Codes to Free-text Radiology Reports: a Method Description (2008) Proceedings of the ICML/UAI workshop on Machine Learning in health care applications Suominen H, Ginter F, Pyysalo S, Airola A, Pahikkala T, Salanterä S, Salakoski T
(Vertaisarvioitu artikkeli konferenssijulkaisussa (A4)) - BioInfer: a corpus for information extraction in the biomedical domain (2007)
- BMC Bioinformatics
(Vertaisarvioitu alkuperäisartikkeli tai data-artikkeli tieteellisessä aikakauslehdessä (A1)) - Contextual weighting for Support Vector Machines in literature mining: an application to gene versus protein name disambiguation (2005)
- BMC Bioinformatics
(Vertaisarvioitu alkuperäisartikkeli tai data-artikkeli tieteellisessä aikakauslehdessä (A1))