simdata: Simulated Data with Environment Dependent Correlations

Description Usage Format Note Source References Examples

Description

A dataset containing simulated data for example use of the eclust package functions. This data was generated using the s_modules and s_generate_data

Usage

1

Format

A matrix with 100 rows and 502 variables:

Y

continuous response vector

E

binary environment variable for ECLUST method. E = 0 for unexposed (n=50) and E = 1 for exposed (n=50)

columns 3:502

gene expression data for 1000 genes. column names are the gene names

Note

Code used to generate this data can be found on the GitHub page for this package. See URL below.

Source

https://raw.githubusercontent.com/sahirbhatnagar/eclust/master/data-raw/simulated-data-processing.R

References

Bhatnagar, SR., Yang, Y., Blanchette, M., Bouchard, L., Khundrakpam, B., Evans, A., Greenwood, CMT. (2016+). An analytic approach for interpretable predictive models in high dimensional data, in the presence of interactions with exposures Preprint

Examples

1
2
simdata[1:5, 1:10]
table(simdata[,"E"])

sahirbhatnagar/eclust documentation built on May 29, 2019, 12:58 p.m.