A4 Vertaisarvioitu artikkeli konferenssijulkaisussa

Applying BLAST to Text Reuse Detection in Finnish Newspapers and Journals, 1771–1910




TekijätAleksi Vesanto, Asko Nivala, Heli Rantala, Tapio Salakoski, Hannu Salmi, Filip Ginter

ToimittajaGerlof Bouma, Yvonne Adesam

Konferenssin vakiintunut nimiWorkshop on Processing Historical Language

KustannuspaikkaGothenburg

Julkaisuvuosi2017

Kokoomateoksen nimiProceedings of the NoDaLiDa 2017 Workshop on Processing Historical Language

Sarjan nimiNEALT Proceedings Series

Numero sarjassa133

Vuosikerta32

Aloitussivu54

Lopetussivu58

ISBN978-91-7685-503-4

ISSN1650-3686

Verkko-osoitehttp://www.ep.liu.se/ecp/133/010/ecp17133010.pdf

Rinnakkaistallenteen osoitehttps://research.utu.fi/converis/portal/detail/Publication/20562472


Tiivistelmä







We present the results of text reuse de-
tection, based on the corpus of scanned
and OCR-recognized Finnish newspapers
and journals from 1771 to 1910. Our
study draws on BLAST, a software cre-
ated for comparing and aligning biologi-
cal sequences. We show different types of
text reuse in this corpus, and also present
a comparison to the software Passim, de-
veloped at the Northeastern University in
Boston, for text reuse detection. 





Ladattava julkaisu

This is an electronic reprint of the original article.
This reprint may differ from the original in pagination and typographic detail. Please cite the original version.





Last updated on 2024-26-11 at 19:41