Validating and Extracting Information from National Identification Numbers in R: The Case of Finland and Sweden
: Kantanen, Pyry; Bülow, Erik; Lahtinen, Aleksi; Magnusson, Måns; Paananen, Jussi; Lahti, Leo
Publisher: The R Foundation
: 2024
The R journal
: 16
: 3
: 4
: 14
: 2073-4859
DOI: https://doi.org/10.32614/rj-2024-023
: https://doi.org/10.32614/rj-2024-023
: https://research.utu.fi/converis/portal/detail/Publication/505865827
National identification numbers (NIN) and similar identification code systems are widely used for uniquely identifying individuals and organizations in Finland, Sweden, and many other countries. To increase the general understanding of such techniques of identification, openly available methods and tools for NIN analysis and validation are needed. The hetu and sweidnumbr R packages provide functions for extracting embedded information, checking the validity, and generating random but valid numbers in the context of Finnish and Swedish NINs and other identification codes. In this article, we demonstrate these functions from both packages and provide theoretical context and motivation on the importance of the subject matter. Our work contributes to the growing toolkit of standardized methods for computational social science research, epidemiology, demographic studies, and other register-based inquiries.
:
LL, PK and AL were supported by the Research Council of Finland: decision 358720 (FIN-CLARIAH research infrastructure) and decision 352604 (Strategic Research Council, YOUNG Despair Research Consortium).