Description Usage Format Source Examples
Simulated hierarchical dataset containing 1000 independent communities, each (community j) containing n_j (non-fixed)
number of individuals where n_j is drawn from a normal with mean 50 and standard deviation 10 and round to the nearest
integer. Each row (observation) includes 2 measured community-level baseline covariates (E1, E2), 3 dependent
individual-level baseline covariates (W1, W2, W3), 1 dependent bianry exposure (A) and 1 dependent binary outcoem
(Y), along with one unique community identifier (id). The community-level baseline covariates (E1, E2)
were sampled as i.i.d across all communities, while the individual-level baseline covariates (W1, W2, W3) for each
individual i within communty j was generated conditionally on the values of j's community-level baseline
covariates (E1[j], E2[j]). Then the community-level exposure (A) for each community j was sampled
conditionally on the value of j's community-level baseline covariates (E1[j], E2[j]), together with all
invididuals' baseline covariates (W1[i], W2[i], W3[i]) within community j where i=1,..,n_j. Similary,
the individual-level binary outcome Y for each individual i within communty j was sampled conditionally
covariates and exposure (E1[j], E2[j], A[j]), as well as the value of individual i's baseline covariates
on the value of community j's baseline (W1[i], W2[i], W3[i]). The following section provides more
details regarding individual variables in simulated data.
1 |
A data frame with 1000 independent communities, each containing around 50 individuals (in total 50,457 observations (rows)), and 8 variables (columns):
integer (unique) community identifier from 1 to 1000, identical within the same community
continuous uniform community-level baseline covariate with min=0 and max=1 (independent and identical
across all individuals in the same community)
discrete uniform community-level baseline covariate with 5 elements (0, 0.2, 0.4, 0.8, 1) (independent and identical across all individuals in the same community)
binary individual-level baseline covariate that depends on the values of community-level baseline covaries (E1,E2)
continuous individual-level baseline covariate, together with W3, are drawn from a bivariate normal distribution
with correlation 0.6, depending on the values of community's baseline covaries (E1, E2)
continuous normal individual-level baseline covariate, correlated with W2, see details in above
binary exposure that depends on community's baseline covariate values in (E1, E2), and the mean of all individuals'
baseline covariates W1 within the same community
binary outcome that depends on community's baseline covariate and exposure values in (E1, E2, A),
and all individuals' baseline covariate values in (W2, W3)
https://github.com/chizhangucb/tmleCommunity/blob/master/tests/dataGeneration/get.cluster.dat.Abin.R
1 2 3 4 5 6 | data(comSample.wmT.bA.bY_list)
comSample.wmT.bA.bY <- comSample.wmT.bA.bY_list$comSample.wmT.bA.bY
head(comSample.wmT.bA.bY)
comSample.wmT.bA.bY_list$psi0.Y # 0.103716, True ATE
# summarize the number of individuals within each community
head(table(comSample.wmT.bA.bY$id))
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.