Takes in data and seed, and returns the data with randomly ordered observations
a matrix, data.frame or data.table
an integer value
Some of the modeling algorithms pick top p percent of the observations for training the model, which could lead to skewed predictions. This function solves that problem by randomly ordering the observations so that the response variable has more or less the same distribution even if the algorithms don't pick training observations randomly.
data of same class as input with randomly ordered observations
1 2 3 4 5 6