data_check | R Documentation |
Conduct data quality check including checking missingness, variation, correlation and VIF of variables.
data_check(Y, Z, ID)
Y |
a numeric vector indicating the outcome variable. |
Z |
a matrix or data frame representing covariates. |
ID |
a numeric vector representing the provider identifier. |
The function performs the following checks:
Missingness: Checks for any missing values in the dataset and provides a summary of missing data.
Variation: Identifies covariates with zero or near-zero variance which might affect model stability.
Correlation: Analyzes pairwise correlation among covariates and highlights highly correlated pairs.
VIF: Computes the Variable Inflation Factors to identify covariates with potential multicollinearity issues.
If issues arise when using the model functions logis_fe
, linear_fe
and linear_re
,
this function can be called for data quality checking purposes.
No return value, called for side effects.
data(ExampleDataBinary)
outcome = ExampleDataBinary$Y
covar = ExampleDataBinary$Z
ID = ExampleDataBinary$ID
data_check(outcome, covar, ID)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.