Power, sample size and sampling costs for clustered data




Tokola K, Larocque D, Nevalainen J, Oja H

PublisherELSEVIER SCIENCE BV

2011

Statistics and Probability Letters

STATISTICS & PROBABILITY LETTERS

STAT PROBABIL LETT

7

81

7

852

860

9

0167-7152

DOIhttps://doi.org/10.1016/j.spl.2011.02.006(external)

https://research.utu.fi/converis/portal/Publication/1870956(external)



The data collected in epidemiological or clinical studies are frequently clustered. In such settings, appropriate variance adjustments must be made in order to estimate the sufficient sample size correctly. This paper works through the sample size calculations for clustered data. Importantly, our explicit variance expressions also enable us to optimize the design with respect to the number of clusters and number of subjects; the objective could be either to maximize the power or to minimize the costs with given costs on the clusters and on the individuals. In our approach, units on different levels and treatment groups can have different costs, but the members of the same cluster are assumed to belong to the same treatment group. Design considerations in the health coaching project TERVA are used as motivating examples. R-functions for carrying out the computations presented are provided. (C) 2011 Elsevier B.V. All rights reserved.

Last updated on 2024-26-11 at 21:36