View source: R/tidyDiscreteContinuousMI.R
calculateDiscreteContinuousMI | R Documentation |
This is specifically designed to supprt tidy data where there are many features, with associated values and outcomes in different columns of a dataframe or database table
calculateDiscreteContinuousMI( df, discreteVars, continuousVar, method = "KWindow", ... )
df |
- may be grouped, in which case the value is interpreted as different types of continuous variable |
discreteVars |
- the column(s) of the categorical value (X) quoted by vars(...) |
continuousVar |
- the column of the continuous value (Y) |
method |
- the method employed - valid options are "KWindow","KNN","Discretise","Grassberger","Compression","Entropy","Quantile","PDF","SGolay","Kernel" |
... |
- the other parameters are passed onto the implementations |
N.B. this result is the mutual information between feature value and outcome GIVEN that the feature is present. It does not account for missing values.
a dataframe containing the disctinct values of the groups of df, and for each group a mutual information column (I). If df was not grouped this will be a single entry
observations %>% group_by(feature) %>% calculateDiscreteContinuousMI(vars(outcome), value)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.