Data Imputation

Share:

Description

The fill.missing function uses the transcan function from the Hmisc package to impute values for the given data.frame.

Usage

1
fill.missing(x, seed = 101, simplify = TRUE, idcol = "id", ...)

Arguments

x

A data.frame object. It should have missing values.

seed

Seed provided for random-number generation. Default value of 101.

simplify

logical: whether to remove duplicate missingness columns.

idcol

An integer value or character string. Indicates the column containing IDs, specified as column index or column name. Defaults to "id", or NA, when not found.

...

Additional arguments, potentially passed to transcan.

Details

The fill.missing function will fill the missing values within a data.frame with the values imputed with the transcan function. An idcol may be specified to prevent including the use of IDs in the imputation. In addition for every column that contains missing data, a new column will be attached to the data.frame containing an indicator of missingness. A "1" indicates that the value was missing and has been imputed.

Value

data.frame with imputed values

Author(s)

Cole Beck

See Also

transcan

Examples

1
2
3
4
set.seed(1)
df <- data.frame(id=LETTERS[1:25], val1=rnorm(25), val2=rnorm(25))
df[sample(seq_len(nrow(df)), ceiling(nrow(df)*0.1)), 2] <- NA
df <- fill.missing(df)