important_variables: Extract k most important variables in a random forest

Description Usage Arguments Value Examples

View source: R/measure_importance.R

Description

Get the names of k variables with highest sum of rankings based on the specified importance measures

Usage

1
2
3
4
5
6
important_variables(
  importance_frame,
  k = 15,
  measures = names(importance_frame)[2:min(5, ncol(importance_frame))],
  ties_action = "all"
)

Arguments

importance_frame

A result of using the function measure_importance() to a random forest or a randomForest object

k

The number of variables to extract

measures

A character vector specifying the measures of importance to be used

ties_action

One of three: c("none", "all", "draw"); specifies which variables to pick when ties occur. When set to "none" we may get less than k variables, when "all" we may get more and "draw" makes us get exactly k.

Value

A character vector with names of k variables with highest sum of rankings

Examples

1
2
forest <- randomForest::randomForest(Species ~ ., data = iris, localImp = TRUE, ntree = 300)
important_variables(measure_importance(forest), k = 2)

randomForestExplainer documentation built on July 12, 2020, 1:06 a.m.