Substitutes the native or in previously generated imputation points in the data set with the help of classical statistics or predictions from multivariate machine learning models.
Selects a feature for univariate graphical analysis.
Sets the model for substitution imputation sites.
Sets the number of neighbors to be analyzed by the knn-model based substitution method.
Sets the number of predictors given to the mice package.
Sets a threshold value for the outflow variable used in the mice package. According to Stef van Buuren's Flexible Imputation of Missing Data, the outflux of a feature quantifies how well its observed data connect to the missing data on other variables.
Sets the seed for the random number generator in order to guarantee reproducible substitution results.
Sets the number of times the substitution analysis is run. From all possible results, the best seed is kept.
Performs the substitution of imputation sites based on the selected model.
Resets the graphical user interface to the parameters of the last model used for imputation site substitution.
Overview graph of the frequency of imputation sites in the dataset. Left: Fraction of imputation sites per feature. Right: Heatmap listing recurring patterns in the occurrence of imputation sites in the instances. (The analysis is performed using the VIM package.)
Influx vs. Outflux plot. The influx of a variable quantifies how well its missing data connect to the observed data on other variables. The outflux of a variable quantifies how well its observed data connect to the missing data on other variables. See Stef van Buuren's Flexible Imputation of Missing Data for detailed information. (The plot is created using the mice package.)
A univariate graphical analysis of the selected feature. The analysis requires the selection of a numerical feature. A boxplot is shown with the individual values superimposed as a point cloud. Imputation sites are color coded. The corresponding distribution of values is shown as a bar chart.
Table that provides detailed univariate information on the imputation site values of the selected feature.
A table summarizing the results of the imputation site distribution. For each feature, the absolute number of imputation sites is displayed. In addition, the number of trusted instances is listed, as well as the fraction of imputation sites.
Tabular information about recurring patterns in the occurrence of imputation sites in the instances.
Tabular information of all instances containing imputation sites.
The transformed and normalized data set after substitution of imputation sites.
Any scripts or data that you put into this service are public.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.