View source: R/impute_mixedgc.R
impute_LRGC | R Documentation |
Fit a low rank Gaussian copula model from (continuous and ordinal) mixed data and impute the missing entries using the fitted model
impute_LRGC( X, rank, nlevels = 20, trunc_method = "Iterative", n_sample = 5000, n_update = 1, maxit = 50, eps = 0.01, verbose = FALSE, runiter = 0, ... )
X |
A matrix or data.frame with missing values. Observed entry of |
rank |
The rank, i.e. number of latent factors |
nlevels |
A column which has larger number of unique values than |
trunc_method |
Method for evaluating truncated normal moments: |
n_sample |
Number of MC samples, only used when |
n_update |
The number of updates, only used when |
maxit |
Maximum number of iterations |
eps |
Convergence threshold |
verbose |
Whether to print progress information |
runiter |
When set as a positive integer, the algorithm will run the specified number of iterations exactly. |
... |
Additional arguments for development use |
Impute the missing entries of continuous and ordinal mixed data by fitting a low rank Gaussian copula (LRGC) model to the data. LRGC is a subclass of Gaussian copula: it requires the copula correlation matrix to have a low rank plus diagonal decomposition: Σ = WW^\top + σ^2 \mathrm{I}_p where W\in \mathbb{R}\times {p\times k} and k<p.
A list containing:
Ximp
Imputed data matrix
W
Fitted latent low rank subspace matrix
sigma
Fitted noise variance
loglik
The log-likelihood achieved during iteration. This value approximates the true objective function we want to maximize, which is hard to evaluate. Monotonically increasing loglik
sequence indicates good fit
Zimp
The imputed Z matrix. On observed ordinal entries, the entry is the corresponding estimated conditional mean. Useful for constructing confidence intervals.
C
The conditional variance corresponding to the observed Z matrix. Useful for quantifying imputation uncertainty.
cutoffs
The estimated cutoffs for ordinal dimensions. Useful for quantifying imputation uncertainty.
Yuxuan Zhao, yz2295@cornell.edu and Madeleine Udell, udell@cornell.edu
Zhao, Y., & Udell, M. (2020). Matrix Completion with Quantified Uncertainty through Low Rank Gaussian Copula. NeurIPS 2020.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.