autoMAR: Function to identify and generate the Missing at Random...

Description Usage Arguments Value Examples

View source: R/autoMAR.R

Description

This function will automatically identify the missing patterns and flag the variables if they are not missing at random based on the AUC method

Usage

1
2
3
4
5
6
7
autoMAR(
  data,
  aucv = 0.9,
  strataname = NULL,
  stratasize = NULL,
  mar_method = "glm"
)

Arguments

data

[data.frame | Required] dataframe or data.table

aucv

[integer | Optional] auc cut-off value for the not missing at random variable selection

strataname

[text | Optional] vector of stratification variables

stratasize

[integer | Optional] vector of stratum sample sizes (in the order in which the strata are given in the input dataset).

mar_method

[text | Optional] missing at random classification method ("glm", "rf"). Default GLM is used (GLM runs faster for high dimensional data)

Value

list output including missing variable summary and number of MAR flag variables

Examples

1
2
# create missing at random features
marobj <- autoMAR (heart, aucv = 0.9, strataname = NULL, stratasize = NULL, mar_method = "glm")

daya6489/DriveML documentation built on July 22, 2021, 4:21 a.m.