ppcapM: Probabilistic PCA (pcaMethods version)
In HGray384/pcaNet: Probabilistic principal components analysis - covariance estimation and network reconstruction

Description Usage Arguments Details Value References Examples

Implements a probabilistic PCA missing value estimator, as in pcaMethods. Use of Rcpp makes this version faster and the emphasised output is the covariance matrix Sigma, which can be used for network reconstruction.

ppcapM(
  myMat,
  nPcs = 2,
  seed = NA,
  threshold = 1e-04,
  maxIterations = 1000,
  loglike = TRUE,
  verbose = TRUE
)

`myMat`	`matrix` – Pre-processed matrix (centered, scaled) with variables in columns and observations in rows. The data may contain missing values, denoted as `NA`.
`nPcs`	`numeric` – Number of components used for re-estimation. Choosing few components may decrease the estimation precision.
`seed`	`numeric` – the random number seed used, useful to specify when comparing algorithms.
`threshold`	`numeric` – Convergence threshold. If the increase in precision of an update falls below this then the algorithm is stopped.
`maxIterations`	`numeric` – Maximum number of estimation steps.
`loglike`	`logical` – should the log-likelihood of the estimated parameters be returned? See Details.
`verbose`	`logical` – verbose intermediary algorithm output.

Details about the probabilistic model underlying PPCA are found in Bishop 1999. The algorithm (Porta, 2005) uses an expectation maximisation approach together with a probabilistic model to approximate the principal axes (eigenvectors of the covariance matrix in PCA). The estimation is done iteratively, the algorithm terminates if either the maximum number of iterations is reached or if the estimated increase in precision falls below 1e^-4.

A list of 4 elements:

W: matrix – the estimated loadings.
sigmaSq: numeric – the estimated isotropic variance.
Sigma: matrix – the estimated covariance matrix.
pcaMethodsRes: class – see pcaRes.

Porta, J.M., Verbeek, J.J. and Kroese, B.J., 2005. link

Stacklies, W., Redestig, H., Scholz, M., Walther, D. and Selbig, J., 2007. doi.

# simulate a dataset from a zero mean factor model X = Wz + epsilon
# start off by generating a random binary connectivity matrix
n.factors <- 5
n.genes <- 200
# with dense connectivity
# set.seed(20)
conn.mat <- matrix(rbinom(n = n.genes*n.factors,
                          size = 1, prob = 0.7), c(n.genes, n.factors))

# now generate a loadings matrix from this connectivity
loading.gen <- function(x){
  ifelse(x==0, 0, rnorm(1, 0, 1))
}

W <- apply(conn.mat, c(1, 2), loading.gen)

# generate factor matrix
n.samples <- 100
z <- replicate(n.samples, rnorm(n.factors, 0, 1))

# generate a noise matrix
sigma.sq <- 0.1
epsilon <- replicate(n.samples, rnorm(n.genes, 0, sqrt(sigma.sq)))

# by the ppca equations this gives us the data matrix
X <- W%*%z + epsilon
WWt <- tcrossprod(W)
Sigma <- WWt + diag(sigma.sq, n.genes)

# select 10% of entries to make missing values
missFrac <- 0.1
inds <- sample(x = 1:length(X),
               size = ceiling(length(X)*missFrac),
               replace = FALSE)

# replace them with NAs in the dataset
missing.dataset <- X
missing.dataset[inds] <- NA

# run ppca
pp <- ppcapM(t(missing.dataset), nPcs = 5)
names(pp)

# sigmasq estimation
abs(pp$sigmaSq-sigma.sq)

# X reconstruction
recon.X <- pp$pcaMethodsRes@loadings%*%t(pp$pcaMethodsRes@scores)
norm(recon.X-X, type="F")^2/(length(X))

# covariance estimation
norm(pp$Sigma-Sigma, type="F")^2/(length(X))

HGray384/pcaNet documentation built on Nov. 14, 2020, 11:11 a.m.

HGray384/pcaNet index

README.md

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

HGray384/pcaNet
Probabilistic principal components analysis - covariance estimation and network reconstruction

ppcapM: Probabilistic PCA (pcaMethods version)
In HGray384/pcaNet: Probabilistic principal components analysis - covariance estimation and network reconstruction

Description

Usage

Arguments

Details

Value

References

Examples

Related to ppcapM in HGray384/pcaNet...

R Package Documentation

Browse R Packages

We want your feedback!

HGray384/pcaNet Probabilistic principal components analysis - covariance estimation and network reconstruction

ppcapM: Probabilistic PCA (pcaMethods version) In HGray384/pcaNet: Probabilistic principal components analysis - covariance estimation and network reconstruction

Description

Usage

Arguments

Details

Value

References

Examples

Related to ppcapM in HGray384/pcaNet...

R Package Documentation

Browse R Packages

We want your feedback!

HGray384/pcaNet
Probabilistic principal components analysis - covariance estimation and network reconstruction

ppcapM: Probabilistic PCA (pcaMethods version)
In HGray384/pcaNet: Probabilistic principal components analysis - covariance estimation and network reconstruction