A4 Refereed article in a conference publication

Scaling up bibliographic data science




AuthorsTolonen M., Marjanen J., Roivainen H., Lahti L.

EditorsCostanza Navarretta, Manex Agirrezabal, Bente Maegaard

Conference nameDigital Humanities in the Nordic Countries

PublisherCEUR-WS

Publication year2019

JournalCEUR Workshop Proceedings

Book title Proceedings of the Digital Humanities in the Nordic Countries 4th Conference

Journal name in sourceCEUR Workshop Proceedings

Series titleCEUR Workshop Proceedings

Volume2364

First page 450

Last page456

ISSN1613-0073

Web address http://ceur-ws.org/Vol-2364/41_paper.pdf

Self-archived copy’s web addresshttps://research.utu.fi/converis/portal/detail/Publication/41240748


Abstract

Bibliographic data science is an emerging research paradigm in digital
humanities. It aims at systematic quantification of the trends in
knowledge production based on large-scale analysis of bibliographic
metadata collections and the methods of modern data science. Compared to
the earlier related attempts in book history and sociology of
literature, advances in data processing and quality control are now
making it possible for the first time to scale up the analysis to
millions of print products while at the same time paying attention to
data quality, representativity and completeness. This provides a new
quantitative method that can support the analysis of classical research
questions in intellectual history. Here, we discuss the methodological
challenges that we have encountered in such studies and how to scale up
the solutions based on collaborative research efforts.


Downloadable publication

This is an electronic reprint of the original article.
This reprint may differ from the original in pagination and typographic detail. Please cite the original version.





Last updated on 2024-26-11 at 16:17