This page provides installation instruction and usage examples for the R package **rnmf**.

rNMF decomposes a p by n nonnegative matrix $X$ into $X\approx WH$ by minimizing a penalized objective function

$$||X-WH||*{\gamma}^2 + \alpha||W||_F + \beta\sum||H*{.j}||_1^2$$

Here rows of $X$ are variables and columns are observations. Both $W$ and $H$ are nonnegative. $W$ is p by k, $H$ is k by n with $k<\min{p,n}$. $\alpha$ and $\beta$ control the magnitude of $W$ and the sparsity of $H$.

As a major advantage over the regular NMF, rNMF detects outliers in $X$ and remove them from the fitting objective function. See [J. Sun, Y. Xu, K. Lopiano and S. Young 2013] for more details.

The procedure can be used to extract clean low dimensional structure from corrupted high dimensional data, such as images, DNA/RNA sequencing data and documents. It can also be used to compress a single corrupted image.

After downloading the package file "rnmf_0.5.tar.gz" from [GitHub repo address here], put it in your preferred working directory and run both of the following lines (remove the '##' from the first line):

install.packages("rnmf_0.5.tar.gz", repos = NULL, type = "source") library(rNMF)

rNMF can be used on any nonnegative data matrix. For illustration purposes, we give two examples with image data.

The first example demonstrates the decomposition and outlier extraction of multiple corrupted images by rNMF. The second example demonstrates the compression of a single corrupted image by rNMF.

First load the build-in data set 'Symbols_c', a 5625 by 30 matrix where each column contains a vectorized 75 by 75 image.

data(Symbols_c)

The function 'see()' shows the corrupted images:

see(Symbols_c, title = "Corrupted data set")

Solid boxes in the images are corruptions. In the following we compare the decomposition results by regular NMF and rNMF.

**Regular NMF** decomposition (gamma = FALSE, k = 4) gives the following result.

res <- rnmf(Symbols_c, k = 4, showprogress = FALSE, my.seed = 100)

The class of `res`

is "rnmf"(see ?rnmf for details). `res$fit`

gives the reconstruction:

see(res$fit, title = "Regular NMF reconstruction with k = 4")

Basis vectors are given by `res$W`

see(res$W, title = "Regular NMF basis", layout = c(1,4))

Shadows in above images indicate that **corruptions were not removed** and contaminated the decomposition.

Now the same data is decomposed with **rNMF** with trimming (gamma = 0.03, k = 4)

res2 <- rnmf(Symbols_c, k = 4, gamma = 0.03, showprogress = FALSE, my.seed = 100, tol = 0.0001, maxit = 50)

rnmf reconstruction:

see(res2$fit, title = "rNMF reconstruction with k = 4")

see(res2$W, title = "rNMF basis vectors", layout = c(1,4))

The results are more clear and the outliers are also extracted:

outliers <- matrix(0, nrow = nrow(Symbols_c), ncol = ncol(Symbols_c)) outliers[res2$trimmed[[res2$niter]]] <- 1 see(outliers, title = "Outliers extracted by rNMF")

In this example we compare the compression of a single corrupted image by both regular NMF and rNMF. First we load a build-in corrupted face image.

data(face)

First let's look at the corrupted face image:

see(face, title = "Corrupted face image", col = "grey", input = "single")

res <- rnmf(face, k = 10, showprogress = FALSE, my.seed = 100)

see(res$fit, title = "NMF compression", col = "grey", input = "single")

The compression is heavily contaminated.

res2 <- rnmf(face, k = 10, gamma = 0.025, showprogress = FALSE, my.seed = 100)

see(res2$fit, title = "rNMF compression", col = "grey", input = "single")

**Any scripts or data that you put into this service are public.**

Embedding an R snippet on your website

Add the following code to your website.

For more information on customizing the embed code, read Embedding Snippets.