View source: R/df_get_psi_score.R
df_get_psi_score | R Documentation |
This function takes two Spark DataFrames as input. One should contain the features with their expected values. The other should contain the features with their actual values. Example... if we're comparing Oct '18 to Nov '18 features, Oct '18 would be expected and Nov '18 would be actual.
df_get_psi_score(expected_, actual_, features_)
expected_ |
Required: A matrix containing features with the expected (old) data. |
actual_ |
Required: A matrix containing features from with the actual (new) data. |
features_ |
Optional: A vector of the feature names to validate. Note, the feature names must exist in both sdf_expected and sdf_actual and be of the same data type in each data frame. If not features are provided, all features in sdf_expected will be used. |
NOTE: This function currently only supports NUMERIC and/or CHARACTER data types. If you have other types of data, please filter them out before passing to the function.
A matrix containing the feature name, bin, min value, max value, expected count, expected
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.