A1 Refereed original research article in a scientific journal

Extracting Geographical References from Finnish Literature: Fully Automated Processing of Plain-Text Corpora




AuthorsKiiskinen Harri, Nivala Asko, Westerlund Jasmine, Saarelainen Juhana

Publication year2023

JournalJournal of Computational Literary Studies

Journal acronymJCLS

Volume2

Issue1

First page 1

Last page20

DOIhttps://doi.org/10.48694/jcls.3584

Web address https://doi.org/10.48694/jcls.3584

Self-archived copy’s web addresshttps://research.utu.fi/converis/portal/detail/Publication/380977362


Abstract

In the Atlas of Finnish Literature 1870-1940 project, we extract geo- graphical information from a Finnish-language corpus of literary texts published between 1870 and 1940. The texts are transformed from plain texts to TEI/XML, and further processed with named entity recognition and linking tools. The results are presented in a web-based environment. This article describes the technical structure of the analysis chain, the tools used and the metaprocesses used to manage the research dataset.


Downloadable publication

This is an electronic reprint of the original article.
This reprint may differ from the original in pagination and typographic detail. Please cite the original version.





Last updated on 2024-26-11 at 13:02