A1 Vertaisarvioitu alkuperäisartikkeli tieteellisessä lehdessä
Validating and Extracting Information from National Identification Numbers in R: The Case of Finland and Sweden
Tekijät: Kantanen, Pyry; Bülow, Erik; Lahtinen, Aleksi; Magnusson, Måns; Paananen, Jussi; Lahti, Leo
Kustantaja: The R Foundation
Julkaisuvuosi: 2024
Lehti: The R journal
Vuosikerta: 16
Numero: 3
Aloitussivu: 4
Lopetussivu: 14
eISSN: 2073-4859
DOI: https://doi.org/10.32614/rj-2024-023
Julkaisun avoimuus kirjaamishetkellä: Avoimesti saatavilla
Julkaisukanavan avoimuus : Kokonaan avoin julkaisukanava
Verkko-osoite: https://doi.org/10.32614/rj-2024-023
Rinnakkaistallenteen osoite: https://research.utu.fi/converis/portal/detail/Publication/505865827
National identification numbers (NIN) and similar identification code systems are widely used for uniquely identifying individuals and organizations in Finland, Sweden, and many other countries. To increase the general understanding of such techniques of identification, openly available methods and tools for NIN analysis and validation are needed. The hetu and sweidnumbr R packages provide functions for extracting embedded information, checking the validity, and generating random but valid numbers in the context of Finnish and Swedish NINs and other identification codes. In this article, we demonstrate these functions from both packages and provide theoretical context and motivation on the importance of the subject matter. Our work contributes to the growing toolkit of standardized methods for computational social science research, epidemiology, demographic studies, and other register-based inquiries.
Ladattava julkaisu This is an electronic reprint of the original article. |
Julkaisussa olevat rahoitustiedot:
LL, PK and AL were supported by the Research Council of Finland: decision 358720 (FIN-CLARIAH research infrastructure) and decision 352604 (Strategic Research Council, YOUNG Despair Research Consortium).