A4 Refereed article in a conference publication 
Scaling up bibliographic data science
Authors: Tolonen M., Marjanen J., Roivainen H., Lahti L.
Editors: Costanza Navarretta, Manex Agirrezabal, Bente Maegaard
Conference name: Digital Humanities in the Nordic Countries
Publisher: CEUR-WS
Publication year: 2019
Journal:CEUR Workshop Proceedings
Book title : Proceedings of the Digital Humanities in the Nordic Countries 4th Conference
Journal name in sourceCEUR Workshop Proceedings
Series title: CEUR Workshop Proceedings
Volume: 2364
First page : 450
Last page: 456
ISSN: 1613-0073
Web address : http://ceur-ws.org/Vol-2364/41_paper.pdf
Self-archived copy’s web address: https://research.utu.fi/converis/portal/detail/Publication/41240748
Bibliographic data science is an emerging research paradigm in digital 
humanities. It aims at systematic quantification of the trends in 
knowledge production based on large-scale analysis of bibliographic 
metadata collections and the methods of modern data science. Compared to
 the earlier related attempts in book history and sociology of 
literature, advances in data processing and quality control are now 
making it possible for the first time to scale up the analysis to 
millions of print products while at the same time paying attention to 
data quality, representativity and completeness. This provides a new 
quantitative method that can support the analysis of classical research 
questions in intellectual history. Here, we discuss the methodological 
challenges that we have encountered in such studies and how to scale up 
the solutions based on collaborative research efforts.
Downloadable publication  This is an electronic reprint of the original article.  |