Modelling the cluster randomized design

The experiment is assumed to have measured responses from individuals in two groups (intervention group and control group). The individuals have been assigned to the one of the two groups based on cluster randomization.

Let $X_{ijl}$ be the l'th response in the j'th cluster of the i'th group. We assume that

$X_{ijl} = \mu_i + \beta_{ij} + \epsilon_{ijl}$

where the group effect $\mu_i$ is a fixed effect, and the cluster effect $\beta_{ij}$ is a random effect drawn from a normal distribution with mean 0 and standard devition $\sigma_c$. Likewise we assume that the noise $\epsilon_{ijl}$ is drawn from a normal distribution with mean 0 and standard deviation $\sigma_e$. At last, we assume independence of all the cluster effects and noise terms. Instead of parametrizing the variance components in the model with $\sigma_c^2$ and $\sigma_e^2$, we will often instead speak of the overall variance $\sigma^2 = \sigma_c^2 + \sigma_e^2$ and the intracluster correlation coefficient $\rho = \frac{\sigma_c^2}{\sigma_c^2 + \sigma_e^2}$.

Some notation:

Construction of the t test for cluster randomized data

We can test the null hypothesis

$H_0: \mu_1 - \mu_2 = 0$

with a t-test that has been adjusted to work with the cluster randomized design. This approach and all the following formulas are taken directly from (1), page 111-115.

We construct the cluster-adjusted t-statistic

$\tilde{t} = \frac{\overline{X}_1 - \overline{X}_2}{\widehat{SE}(\overline{X}_1 - \overline{X}_2)}$

where $\overline{X}_1$ is the mean of the intervention group, $\overline{X}_2$ is the mean of the control group, and $\widehat{SE}(\overline{X}_1 - \overline{X}_2)$ is the estimated standard deviation of $\overline{X}_1 - \overline{X}_2$. This estimate is adjusted according to the given cluster randomized design:

$\widehat{SE}(\overline{X}_1 - \overline{X}_2) = S_p\left[\frac{C_1}{M_1} + \frac{C_2}{M_2}\right]^{\frac{1}{2}}$

where $S_p$ is the usual pooled standard deviation over the two groups and

$C_i = 1 + (\overline{m}_{Ai}-1)\widehat{\rho}$

where $\widehat{\rho}$ is the estimate of the intracluster correlation coefficient and

$\overline{m}{Ai} = \sum{j=1}^{k_i} \frac{m_{ij}^2}{M_i}$

We get the estimate $\widehat{\rho}$ by first using the mean squared error within the clusters (MSW) and the mean squared error between the clusters (MSC) to get the estimates for $\sigma_c$ and $\sigma_e$. That is:

Then we estimate $\widehat{\rho}$ by:

$\widehat{\rho} = \frac{\widehat{\sigma_c^2}}{\widehat{\sigma_c^2} + \widehat{\sigma_e^2}}$

The adjusted t statistic $\tilde{t}$ will have a t-distrubtion with $K-2$ degrees of freedom. Since we know the distribution of the t statistic, we can now use a standard significance test to test the null hypothesis that the means of the groups are equal. The function t_test_clustered_stat calculates the value of $\tilde{t}$ for a given data set, and the function t_test_clustered_pval calculates the p-value of a significance test with $\tilde{t}$ on a given data set.

Construction of the power calculation for the adjusted t test

When we test the null hypothesis $H_0: \mu_1 - \mu_2 = 0$, we contruct the cluster-adjusted t-statistic $\tilde{t}$ and use this in a standard significance test with the t-distribution with $K-2$ degrees of freedom and a significance level $\alpha$. Call the critical region of the test $\mathcal{C}$. Given the truth of the alternative hypothesis

$\mu_1 - \mu_2 = \delta$

we now want to calculate the power $\mathcal{P}$ of the test. This is given as

$\mathcal{P} = \text{P}(\tilde{t} \in \mathcal{C})$

To calculate this quantity, we need to know the distribution of $\tilde{t}$ under the alternative hypothesis. If the design is balanced, then we get from (2) that $\tilde{t}$ follows a non-central t distribution with $K-2$ degrees of freedom and non-centrality-parameter

$\eta = \frac{|\delta|}{\sigma \left( \frac{4(1+(m-1)\rho)}{M} \right)^{\frac{1}{2}}}$

where $m$ is the number of individuals in each cluster. The design is balanced, if the number of clusters are the same in each group and the number of individuals are the same in each cluster across both groups.

If the design is unbalanced, we do not know the exact distribution of $\tilde{t}$. In this case, (1) page 57 suggests (in a related case) that we can try calculating the power using the non-central t distribution with $\overline{K}-2$ degrees of freedom and the non-centrality-parameter

$\eta = \frac{|\delta|}{\sigma \left( \frac{4(1+(\overline{m}-1)\rho)}{M} \right)^{\frac{1}{2}}}$

where $\overline{K}$ is the average of the number of clusters per group, and $\overline{m}$ is the average cluster size across both groups. Although using $\overline{K}$ degrees of freedom is conceptually dubious, this approach is implemented in power.t.test.clustered.

Instead of trying to calculate the exact power of a significance test with the statistic $\tilde{t}$, we can also try to determine it by simulation. That is, we can simulate data from the model $X_{ijl} = \mu_i + \beta_{ij} + \epsilon_{ijl}$ described above, and then we can use the test statistic $\tilde{t}$ to make a significance on the data and see if the null hypothesis is rejected or not. We can repeat this procedure n times, and the simulated power will then be the number times the null hypothesis was rejected out of the total number of tests. The function simulate_power_t_test_clustered implements this way of simulating the power.

Since $\tilde{t}$ works in both balanced and unbalanced designs, the power simulation works in both cases. Therefore, it can also be used to test how well power.t.test.clustered works for unbalanced designs. This kind of test is implemented in the program testing/Testprogram.R, which uses the function test_calcpower_against_simpower, which can be found in the file R/testing_power_calc_vs_power_sim.R. The tests suggest that power.t.test.clustered also works quite well for unbalanced designs, at least if the design is not too unbalanced. The way the testing has been done can be inspected further in the mentioned files, and the test results can be inspected further in the file data/testresults.RData.

References

(1) Allan Donner & Neil Klar: Design and Analysis of Cluster Randomization Trials in Health Research, 2000. (2) Allan Donner & Neil Klar: Statistical Considerations in the Design and Analysis of Community Intervention Trials, Journal of Clinical Epidemiology, vol. 49, no. 4, pp. 435-439, 1996.



AsgerAndersen/t.test.clustered documentation built on May 5, 2019, 8:12 a.m.