SplitUplift: Split data with respect to uplift distribution
In tools4uplift: Tools for Uplift Modeling

Split a dataset into training and validation subsets with respect to the uplift sample distribution.

1	SplitUplift(data, p, group)

`data`	a data frame of interest that contains at least the response and the treatment variables.
`p`	The desired sample size. p is a value between 0 and 1 expressed as a decimal, it is set to be proportional to the number of observations per group.
`group`	Your grouping variables. Generally, for uplift modelling, this should be a vector of treatment and response variables names, e.g. c("treat", "y").

`train`	a training data frame of p percent
`valid`	a validation data frame of 1-p percent

Mouloud Belbahri

Belbahri, M., Murua, A., Gandouet, O., and Partovi Nia, V. (2019) Uplift Regression, <https://dms.umontreal.ca/~murua/research/UpliftRegression.pdf>

library(tools4uplift)
data("SimUplift")

split <- SplitUplift(SimUplift, 0.8, c("treat", "y"))
train <- split[[1]]
valid <- split[[2]]

tools4uplift documentation built on Jan. 6, 2021, 5:09 p.m.

tools4uplift index

rdrr.io home R language documentation Run R code online

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

Description