Description Usage Arguments Details Value
View source: R/moeda_function.R
The main function of the package that runs Modeling-Oriented Exploratory Data Analysis on your data.
1 |
df |
A dataset you wish to analyze, typically a data.frame or a tibble |
target_var |
Unquoted expression or a scalar character representing the column in the 'df' that is your target (aka dependent) variable for analysis |
n_top_vars |
An integer scalar specifying up to how many top variables you wish to consider for your analysis. By default 4 |
cuts |
An integer scalar telling 'base::cut()' how many equal width buckets to create in the top_vars. By default 4 |
... |
Arguments passed to 'base::cut()' |
Currently, this function performs the following actions:
A random forest is run on the training set with 80 percent of observations
Permutation variable importance is exported from the model and reported to the console together with some additional information
Model's performance is assessed on the test set of the remaining 20 percent of observations and reported to the console
Top variables (their number can be specified with 'n_top_vars' argument) are discretized using equal widths discretization via 'base::cut()'. You can select the number of cuts with the 'cuts' argument.
An upset plot of the intersections from 4. together with target variables is printed.
A 'GGally::ggpairs()' plot of top variables is printed.
The function returns the original 'df' and joins top features columns that were cut together with resulting intersections. These additional columns have 'moedized' in their name.
The original df as a tibble with joined top features columns that were cut together with resulting intersections. These additional columns have 'moedized' in their name.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.