Home

/

CRAN

/

midastouch

/

mice.impute.midastouch: Predictive Mean Matching with distance aided selection of...

mice.impute.midastouch: Predictive Mean Matching with distance aided selection of...
In midastouch: Multiple Imputation by Distance Aided Donor Selection

Description Usage Arguments Details Value Author(s) References Examples

View source: R/mice.impute.midastouch.R

Imputes univariate missing data using predictive mean matching

1 2	mice.impute.midastouch(y, ry, x, ridge = 1e-05, midas.kappa = NULL, outout = TRUE, neff = NULL, debug = NULL, ...)

`y`	Numeric vector with incomplete data
`ry`	Response pattern of `y` (`TRUE`=observed, `FALSE`=missing)
`x`	Design matrix with `length(y)` rows and `p` columns containing complete covariates.
`ridge`	The ridge penalty applied to prevent problems with multicollinearity. The default is `ridge = 1e-05`, which means that 0.001 percent of the diagonal is added to the cross-product. Larger ridges may result in more biased estimates. For highly noisy data (e.g. many junk variables), set `ridge = 1e-06` or even lower to reduce bias. For highly collinear data, set `ridge = 1e-04` or higher.
`midas.kappa`	Scalar. If `NULL` (default) then the optimal `kappa` gets selected automatically. Alternatively, the user may specify a scalar. Siddique and Belin 2008 find `midas.kappa = 3` to be sensible.
`outout`	Logical. If `TRUE` (default) one model is estimated for each donor (leave-one-out principle). For speedup choose `outout = FALSE`, which estimates one model for all observations leading to in-sample predictions for the donors and out-of-sample predictions for the recipients. Mind the inappropriateness, though.
`neff`	FOR EXPERTS. Null or character string. The name of an existing environment in which the effective sample size of the donors for each loop (CE iterations times multiple imputations) is supposed to be written. The effective sample size is necessary to compute the correction for the total variance as originally suggested by Parzen, Lipsitz and Fitzmaurice 2005. The objectname is `midastouch.neff`.
`debug`	FOR EXPERTS. Null or character string. The name of an existing environment in which the input is supposed to be written. The objectname is `midastouch.inputlist`.
`...`	Other named arguments.

Imputation of y by predictive mean matching, based on Rubin (1987, p. 168, formulas a and b) and Siddique and Belin 2008. The procedure is as follows:

Draw a bootstrap sample from the donor pool.
Estimate a beta matrix on the bootstrap sample by the leave one out principle.
Compute type II predicted values for yobs (nobs x 1) and ymis (nmis x nobs).
Calculate the distance between all yobs and the corresponding ymis.
Convert the distances in drawing probabilities.
For each recipient draw a donor from the entire pool while considering the probabilities from the model.
Take its observed value in y as the imputation.

Numeric vector of length sum(!ry) with imputations

Philipp Gaffert, Florian Meinfelder, Volker Bosch 2015

Gaffert, P., Meinfelder, F., Bosch V. (2015) Towards an MI-proper Predictive Mean Matching, Discussion Paper. https://www.uni-bamberg.de/fileadmin/uni/fakultaeten/sowi_lehrstuehle/statistik/Personen/Dateien_Florian/properPMM.pdf

Little, R.J.A. (1988), Missing data adjustments in large surveys (with discussion), Journal of Business Economics and Statistics, 6, 287–301.

Parzen, M., Lipsitz, S. R., Fitzmaurice, G. M. (2005), A note on reducing the bias of the approximate bayesian bootstrap imputation variance estimator. Biometrika 92, 4, 971–974.

Rubin, D.B. (1987), Multiple imputation for nonresponse in surveys. New York: Wiley.

Siddique, J., Belin, T.R. (2008), Multiple imputation using an iterative hot-deck with distance-based donor selection. Statistics in medicine, 27, 1, 83–102

Van Buuren, S., Brand, J.P.L., Groothuis-Oudshoorn C.G.M., Rubin, D.B. (2006), Fully conditional specification in multivariate imputation. Journal of Statistical Computation and Simulation, 76, 12, 1049–1064.

Van Buuren, S., Groothuis-Oudshoorn, K. (2011), mice: Multivariate Imputation by Chained Equations in R. Journal of Statistical Software, 45, 3, 1–67. http://www.jstatsoft.org/v45/i03/

## from R:: mice, slightly adapted ##

# do default multiple imputation on a numeric matrix
library(midastouch)
library(mice)
imp <- mice(nhanes, method = 'midastouch')
imp

# list the actual imputations for BMI
imp$imp$bmi

# first completed data matrix
complete(imp)


# imputation on mixed data with a different method per column

mice(nhanes2, method = c('sample','midastouch','logreg','norm'))

Loading required package: lattice

Attaching package: 'mice'

The following object is masked from 'package:midastouch':

    mice.impute.midastouch

The following objects are masked from 'package:base':

    cbind, rbind


 iter imp variable
  1   1  bmi  hyp  chl
  1   2  bmi  hyp  chl
  1   3  bmi  hyp  chl
  1   4  bmi  hyp  chl
  1   5  bmi  hyp  chl
  2   1  bmi  hyp  chl
  2   2  bmi  hyp  chl
  2   3  bmi  hyp  chl
  2   4  bmi  hyp  chl
  2   5  bmi  hyp  chl
  3   1  bmi  hyp  chl
  3   2  bmi  hyp  chl
  3   3  bmi  hyp  chl
  3   4  bmi  hyp  chl
  3   5  bmi  hyp  chl
  4   1  bmi  hyp  chl
  4   2  bmi  hyp  chl
  4   3  bmi  hyp  chl
  4   4  bmi  hyp  chl
  4   5  bmi  hyp  chl
  5   1  bmi  hyp  chl
  5   2  bmi  hyp  chl
  5   3  bmi  hyp  chl
  5   4  bmi  hyp  chl
  5   5  bmi  hyp  chl
Class: mids
Number of multiple imputations:  5 
Imputation methods:
         age          bmi          hyp          chl 
          "" "midastouch" "midastouch" "midastouch" 
PredictorMatrix:
    age bmi hyp chl
age   0   1   1   1
bmi   1   0   1   1
hyp   1   1   0   1
chl   1   1   1   0
      1    2    3    4    5
1  30.1 29.6 35.3 21.7 35.3
3  30.1 29.6 35.3 30.1 33.2
4  27.2 24.9 25.5 25.5 27.4
6  22.7 25.5 25.5 25.5 24.9
10 28.7 22.5 26.3 30.1 27.4
11 20.4 29.6 35.3 21.7 33.2
12 30.1 28.7 28.7 26.3 27.4
16 30.1 29.6 22.0 22.7 29.6
21 27.5 27.5 28.7 30.1 35.3
   age  bmi hyp chl
1    1 30.1   1 187
2    2 22.7   1 187
3    1 30.1   1 187
4    3 27.2   2 284
5    1 20.4   1 113
6    3 22.7   1 184
7    1 22.5   1 118
8    1 30.1   1 187
9    2 22.0   1 238
10   2 28.7   2 204
11   1 20.4   1 113
12   2 30.1   2 204
13   3 21.7   1 206
14   2 28.7   2 204
15   1 29.6   1 187
16   1 30.1   1 187
17   3 27.2   2 284
18   2 26.3   2 199
19   1 35.3   1 218
20   3 25.5   2 184
21   1 27.5   1 131
22   1 33.2   1 229
23   1 27.5   1 131
24   3 24.9   1 204
25   2 27.4   1 186

 iter imp variable
  1   1  bmi  hyp  chl
  1   2  bmi  hyp  chl
  1   3  bmi  hyp  chl
  1   4  bmi  hyp  chl
  1   5  bmi  hyp  chl
  2   1  bmi  hyp  chl
  2   2  bmi  hyp  chl
  2   3  bmi  hyp  chl
  2   4  bmi  hyp  chl
  2   5  bmi  hyp  chl
  3   1  bmi  hyp  chl
  3   2  bmi  hyp  chl
  3   3  bmi  hyp  chl
  3   4  bmi  hyp  chl
  3   5  bmi  hyp  chl
  4   1  bmi  hyp  chl
  4   2  bmi  hyp  chl
  4   3  bmi  hyp  chl
  4   4  bmi  hyp  chl
  4   5  bmi  hyp  chl
  5   1  bmi  hyp  chl
  5   2  bmi  hyp  chl
  5   3  bmi  hyp  chl
  5   4  bmi  hyp  chl
  5   5  bmi  hyp  chl
Class: mids
Number of multiple imputations:  5 
Imputation methods:
         age          bmi          hyp          chl 
          "" "midastouch"     "logreg"       "norm" 
PredictorMatrix:
    age bmi hyp chl
age   0   1   1   1
bmi   1   0   1   1
hyp   1   1   0   1
chl   1   1   1   0

midastouch documentation built on May 2, 2019, 4:05 p.m.

midastouch index

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

midastouch
Multiple Imputation by Distance Aided Donor Selection

mice.impute.midastouch: Predictive Mean Matching with distance aided selection of...
In midastouch: Multiple Imputation by Distance Aided Donor Selection

Description

Usage

Arguments

Details

Value

Author(s)

References

Examples

Example output

Related to mice.impute.midastouch in midastouch...

R Package Documentation

Browse R Packages

We want your feedback!

midastouch Multiple Imputation by Distance Aided Donor Selection

mice.impute.midastouch: Predictive Mean Matching with distance aided selection of... In midastouch: Multiple Imputation by Distance Aided Donor Selection

Description

Usage

Arguments

Details

Value

Author(s)

References

Examples

Example output

Related to mice.impute.midastouch in midastouch...

R Package Documentation

Browse R Packages

We want your feedback!

midastouch
Multiple Imputation by Distance Aided Donor Selection

mice.impute.midastouch: Predictive Mean Matching with distance aided selection of...
In midastouch: Multiple Imputation by Distance Aided Donor Selection