A1 Refereed original research article in a scientific journal
Extracting Geographical References from Finnish Literature: Fully Automated Processing of Plain-Text Corpora
Authors: Kiiskinen Harri, Nivala Asko, Westerlund Jasmine, Saarelainen Juhana
Publication year: 2023
Journal: Journal of Computational Literary Studies
Journal acronym: JCLS
Volume: 2
Issue: 1
First page : 1
Last page: 20
DOI: https://doi.org/10.48694/jcls.3584
Web address : https://doi.org/10.48694/jcls.3584
Self-archived copy’s web address: https://research.utu.fi/converis/portal/detail/Publication/380977362
In the Atlas of Finnish Literature 1870-1940 project, we extract geo- graphical information from a Finnish-language corpus of literary texts published between 1870 and 1940. The texts are transformed from plain texts to TEI/XML, and further processed with named entity recognition and linking tools. The results are presented in a web-based environment. This article describes the technical structure of the analysis chain, the tools used and the metaprocesses used to manage the research dataset.
Downloadable publication This is an electronic reprint of the original article. |