PCS: A partial correlation screening approach for controlling the false positive rate in sparse Gaussian Graphical Models

################################################################################
##########         R Code by Ginette Lafit        ##############################
##########         ginette.lafit@kuleuven.be      ##############################
################################################################################

###############################################################################
###############################################################################
###############################################################################
##### PCS k-fold cross-validation for Nodewise regression using the 'AND' rule
###############################################################################
###############################################################################
###############################################################################

# k-folds CV for PCS: Nodewise regression 'AND' 

# Function that estimates the threshold parameter in Partial Correlation Screening when the GMM is estimated using Nodewise regression using the 'AND' rule
# the threshold is selected using k-fold cross-validation
# the input is the (n x p) data matrix (i.e., x)
# n is the number of rows of x
# p is the number of columns of x
# lambda.opt is the regularization parameter to estimate the GGM using Nodewise regression and the 'AND' rule
# fold is the number of folds

# The function returns the threhsold parameter that minimizes the mean squared error

cv_PCS_nei_and = function(x,lambda.opt,fold){

library(glmnet)

x = scale(x) 
 
  cv.part = function(n, k) {
    ntest = floor(n/k)
    ntrain = n - ntest
    ind = sample(n)
    trainMat = matrix(NA, nrow=ntrain, ncol=k)
    testMat = matrix(NA, nrow=ntest, ncol=k)
    nn = 1:n
    for (j in 1:k) {
      sel = ((j-1)*ntest+1):(j*ntest)
      testMat[,j] = ind[sel ]
      sel2 =nn[ !(nn %in% sel) ]
      trainMat[,j] = ind[sel2]
    }
    return(list(trainMat=trainMat,testMat=testMat))
  }

 loss.cv = function(x.train,x.test,lambda.opt,taulist){
  list.nei.alpha = neigh_and_lambda(x.train,lambda.opt)
  R = -cov2cor(list.nei.alpha[[2]])
  diag(R) = 1
  eps = lapply(1:length(taulist),function(i) Psi_Screen_Beta(x.train,R,taulist[i])[[1]])
  loss.re = unlist(lapply(1:length(taulist), function(i) sum(colSums((x.test - x.test%*%eps[[i]])^2))))
  return(loss.re)
 }
  
  n = nrow(x)
  p = ncol(x)
  part.list = cv.part(n, fold)

  taulist = seq(0.0001,1,length=100)

  loss.list = lapply(1:fold, function(k) 
  loss.cv(x[part.list$trainMat[, k], ],x[part.list$testMat[, k], ],lambda.opt,taulist))

  loss.re = matrix(unlist(loss.list), ncol = fold, byrow = FALSE)
  
  loss.mean = apply(loss.re, 1, mean)
  ind = which.min(loss.mean)
  tau_opt = taulist[ind]

  res = tau_opt
  return(res)  
}

ginettelafit/PCS documentation built on Nov. 11, 2020, 8:01 a.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

ginettelafit/PCS
A partial correlation screening approach for controlling the false positive rate in sparse Gaussian Graphical Models

R/cv_PCS_nei_and.R
In ginettelafit/PCS: A partial correlation screening approach for controlling the false positive rate in sparse Gaussian Graphical Models

Defines functions cv_PCS_nei_and

R Package Documentation

Browse R Packages

We want your feedback!

ginettelafit/PCS A partial correlation screening approach for controlling the false positive rate in sparse Gaussian Graphical Models

R/cv_PCS_nei_and.R In ginettelafit/PCS: A partial correlation screening approach for controlling the false positive rate in sparse Gaussian Graphical Models

Defines functions cv_PCS_nei_and

R Package Documentation

Browse R Packages

We want your feedback!

ginettelafit/PCS
A partial correlation screening approach for controlling the false positive rate in sparse Gaussian Graphical Models

R/cv_PCS_nei_and.R
In ginettelafit/PCS: A partial correlation screening approach for controlling the false positive rate in sparse Gaussian Graphical Models