Description Usage Arguments Value
Run_RF_Regression_Pipeline
1 2 3 4 5 6 7 8 | Run_RF_Regression_Pipeline(
feature_table,
actual,
sampling = NULL,
repeats,
path,
list_of_seeds
)
|
feature_table |
A feature table containing the samples (rows) and the features (columns) to run random forest regression on. Note that this table should not include the value that is trying to be predicted |
actual |
A vector containing the actual values for the value that this trying to be predectied. |
sampling |
The sampling technique to use during cross validation. Defaults to NULL. |
repeats |
The number of data splits that should occur between testing data and cross validation data. |
path |
The path that the output should be saved to. |
list_of_seeds |
A list of seeds equal to the length of repeats that is used for each random data split. |
Returns a list containing the following: "Object[[1]] contains all the median cross validation RMSE from each data split using the best mtry value" "Object[[2]] contains all the test RMSE values from each data split" "Object[[3]] contains all the tested mtry values and the median RMSE from each from each data split" "Object[[4]] contains the list of important features from the best model selected from each data split" "Object[[5]] contains each caret random forest model from each data split" "This function will also write a csv with cross validation RMSE and test RMSE, to the given path as well as an RDS file that contains the resulting object from this function"
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.