Description Usage Arguments Value
separates a data frame into two data frames for training and testing in ML classification.
1 | get_training_sample(df, class_category, factor.training, grep_rowname = NULL)
|
df |
data frame containing a whole data set. |
class_category |
boolean column name used for classification. |
factor.training |
percentage of data used for training. It takes the portion factor.training from the smallest class and a equivalent number of elements from the biggest class. |
grep_rowname |
string containing a regular expression that limits the elements used for the training by choosing matching elements at rowname. |
list with two data frames: trainset and testset
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.