simpleREEMdata: Sample Data for RE-EM trees

simpleREEMdataR Documentation

Sample Data for RE-EM trees

Description

This data set is consists of a panel of 50 individuals with 12 observations per individual. The data is based on a regression tree with an initial split based on a dummy variable (D) and a second split based on time in the branch where D=1. The observations include both randomly generated individual-specific effects and observation-specific errors.

Format

The data has 600 rows and 5 columns. The columns are:

  • Y - the target variable

  • t - a numeric predictor ("time")

  • D - a catergorical predictor with two levels, 0 and 1

  • ID - the identifier for each individual

  • X - another covariate (which is intentionally unrelated to the target variable)

References

Sela, Rebecca J., and Simonoff, Jeffrey S., “RE-EM Trees: A Data Mining Approach for Longitudinal and Clustered Data”, Machine Learning (2011).


REEMtree documentation built on Oct. 25, 2023, 1:08 a.m.