Description Usage Arguments Details Value Note Author(s) References See Also Examples
Calculate or plot the amount of missing/imputed values in each variable and the amount of missing/imputed values in certain combinations of variables.
1 2 |
x |
a vector, matrix or |
delimiter |
a character-vector to distinguish between variables and
imputation-indices for imputed variables (therefore, |
hscale |
horizontal scale factor for plot to be embedded in a Tcl/Tk window (see ‘Details’). The default value depends on the number of variables. |
vscale |
vertical scale factor for the plot to be embedded in a Tcl/Tk window (see ‘Details’). The default value depends on the number of combinations. |
TKRpar |
a list of graphical parameters to be set for the plot to be
embedded in a Tcl/Tk window (see ‘Details’ and
|
... |
for |
Often it is of interest how many missing/imputed values are contained in each variable. Even more interesting, there may be certain combinations of variables with a high number of missing/imputed values.
If combined
is FALSE
, two separate plots are drawn for the
missing/imputed values in each variable and the combinations of
missing/imputed and non-missing values. The barplot on the left hand side
shows the amount of missing/imputed values in each variable. In the
aggregation plot on the right hand side, all existing combinations of
missing/imputed and non-missing values in the observations are visualized.
Available, missing and imputed data are color coded as given by col
.
Additionally, there are two possibilities to represent the frequencies of
occurrence of the different combinations. The first option is to visualize
the proportions or frequencies by a small bar plot and/or numbers. The
second option is to let the cell heights be given by the frequencies of the
corresponding combinations. Furthermore, variables may be sorted by the
number of missing/imputed values and combinations by the frequency of
occurrence to give more power to finding the structure of missing/imputed
values.
If combined
is TRUE
, a small version of the barplot showing
the amount of missing/imputed values in each variable is drawn on top of the
aggregation plot.
The graphical parameter oma
will be set unless supplied as an
argument.
TKRaggr
behaves like plot.aggr
, but uses
tkrplot
to embed the plot in a Tcl/Tk window.
This is useful if the number of variables and/or combinations is large,
because scrollbars allow to move from one part of the plot to another.
for aggr
, a list of class "aggr"
containing the
following components:
x the data used.
combinationsa character vector representing the combinations of variables.
count the frequencies of these combinations.
percent the percentage of these combinations.
missings a data.frame
containing the amount of
missing/imputed values in each variable.
tabcombthe indicator matrix for the combinations of variables.
Some of the argument names and positions have changed with version 1.3
due to extended functionality and for more consistency with other plot
functions in VIM
. For back compatibility, the arguments labs
and names.arg
can still be supplied to ...{}
and are handled
correctly. Nevertheless, they are deprecated and no longer documented. Use
ylabs
and labels
instead.
Andreas Alfons, Matthias Templ, modifications for displaying imputed values by Bernd Prantner
M. Templ, A. Alfons, P. Filzmoser (2012) Exploring incomplete data using visualization tools. Journal of Advances in Data Analysis and Classification, Online first. DOI: 10.1007/s11634-011-0102-y.
A. Kowarik, M. Templ (2016) Imputation with R package VIM. Journal of Statistical Software, 74(7), 1-16
1 2 3 4 5 6 7 8 9 10 11 |
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.