Validating and Extracting Information from National Identification Numbers in R: The Case of Finland and Sweden




Kantanen, Pyry; Bülow, Erik; Lahtinen, Aleksi; Magnusson, Måns; Paananen, Jussi; Lahti, Leo

PublisherThe R Foundation

2024

 The R journal

16

3

4

14

2073-4859

DOIhttps://doi.org/10.32614/rj-2024-023

https://doi.org/10.32614/rj-2024-023

https://research.utu.fi/converis/portal/detail/Publication/505865827



National identification numbers (NIN) and similar identification code systems are widely used for uniquely identifying individuals and organizations in Finland, Sweden, and many other countries. To increase the general understanding of such techniques of identification, openly available methods and tools for NIN analysis and validation are needed. The hetu and sweidnumbr R packages provide functions for extracting embedded information, checking the validity, and generating random but valid numbers in the context of Finnish and Swedish NINs and other identification codes. In this article, we demonstrate these functions from both packages and provide theoretical context and motivation on the importance of the subject matter. Our work contributes to the growing toolkit of standardized methods for computational social science research, epidemiology, demographic studies, and other register-based inquiries.


LL, PK and AL were supported by the Research Council of Finland: decision 358720 (FIN-CLARIAH research infrastructure) and decision 352604 (Strategic Research Council, YOUNG Despair Research Consortium).


Last updated on 12/12/2025 03:13:26 PM