sens.mice: Multivariate Imputation by Chained Equations (Iteration Step...

Description Usage Arguments Details Value Author(s) References See Also

Description

Takes a mids object, and produces a new object of class mids with hypothetical non-ignorability included.

Usage

1
sens.mice(IM, ListMethod=ListMethod, SupPar=SupPar)

Arguments

IM

An object of class mids, typically produces by a previous call to mice() or mice.mids()

ListMethod

is a vector of strings with length ncol(IM[["data"]]), specifying the column(s) in data which has (have) to be imputed with an imputation model accounting for MNAR data. For variables which have to be imputed with a modified imputation model, the method has to be specified as "MyFunc". For variables which do not have to be imputed with a different imputation model, the method has to be be specified as "".

SupPar

is a vector of numbers, specifying the supplementary parameters to be added to the imputation model(s). For logreg and polyreg methods, the supplementary parameters are expressed as odds-ratios (correponding to the excess of risk to present the modality of interest for non responders as compared to responders). The value for the reference need not be specified. For pmm and norm methods, the supplementary parameters are the difference between the expected values in responders and non responders.

Details

This function is based on the mice function's principle, which uses the MICE algorithm. mice allows to generate multiple imputations for incomplete multivariate data by Gibbs sampling. The algorithm imputes an incomplete column (the target column) by generating ‘plausible’ synthetic values given other columns in the data. Each incomplete column must act as a target column, and has its own set of predictors. The default set of predictors for a given target consists of all others columns in the data. For predictors that are incomplete themselves, the most recently generated imputations are used to complete the predictors prior to imputation of the target column.

Built-in elementary imputation methods are: pmm Bayesian linear regression (numeric) norm Predictive mean matching (numeric) norm.nob Linear regression ignoring model error (numeric) mean Unconditional mean imputation (numeric) 2l.norm Two-level normal imputation (numeric) logreg Logistic regression (factor, 2 categories) polyreg Polytomous (unordered) logistic regression (factor, >=2 categories) lda Linear discriminant analysis (factor, >=2 categories) sample Random sample from the observed values (any)

This function enables to impute missing values under the hypothesis of MNAR data, for one or more variable(s). A sensitivity analysis can be performed using the mids object returned by the function.

It goes through 3 steps:

1. Estimate the parameters of the imputation model under ignorable missing data hypothesis using the function mice. The fitted imputation model depends on the type of the variable which contains missing values, i.e. bayesian linear regression for numeric variables, logistic regression for binary variables, polytomous unordered logistic regression for categorical variables.

2. Modify the imputation model according to the explored scenario by specifying supplementary parameters as arguments for the sens.mice function. Without fitting a mixture model, we appeal to its principle as proposed by Rubin. Indeed, the addition of this(these) supplementary parameter(s) allows to specify that the distribution of the variable of interest is different among subjects with missing value and among subjects witouht missing value, conditionnally on all variables included in the imputation model. The direction and the size of this difference is expressed by the supplementary parameter in the imputation model.

3. Impute the missing data using the function sens.mice, resulting into a mids object that contains m newly imputed data sets under MNAR hypothesis. For this step, missing values are imputed with the usual MICE algorithm, using the previously modified imputation model.

The sensitivity analysis can then be performed on the returned mids object, which contains the newly imputed data, according to the assumption that values for the variabl(s) imputed with the sens.mice function come from two different distributions (for responders and incomplete responders).

Built-in elementary imputation methods are:

- pmm: Predictive mean matching (numeric)

- norm: Bayesian linear regression (numeric)

- logreg: Logistic regression (factor, 2 categories)

- polyreg: Polytomous (unordered) regression (factor, >= 2 categories)

For logreg and polyreg methods, the supplementary parameters are expressed as odds-ratios (correponding to the excess of risk to present the modality of interest for non responders as compared to responders). The value for the reference need not be specified. For pmm and norm methods, the supplementary parameters are the difference between the expected values in responders and non responders. The name associated with the value identifies the target column(s) of the imputed data matrix.

Value

Returns an object of class mids (multiply imputed data set) with usual components

Author(s)

Noemie Resseguier, with contributions of Roch Giorgi, David Hajage, Yann De Rycke, Xavier Paoletti and Dave Armstrong

References

Resseguier, N., Giorgi, R. and Paoletti, X. (submitted) How to perform a senstivity analysis exploring the impact of missing not at random data under different sceanrios of non response mechanism with the R software.

Rubin, D.B. Multiple Imputation for Nonresponse in Surveys. New York: John Wiley & Sons, 1987.

van Buuren, S., Groothuis-Oudshoorn, K. MICE: Multivariate Imputation by Chained Equations in R.

See Also

mice, mids


davidaarmstrong/SensMiceDA documentation built on July 2, 2019, 12:38 p.m.