iJRF_Perm: Derive importance scores for P permuted data sets.
In iJRF: Integrative Joint Random Forest

Description Usage Arguments Value References Examples

This function computes importance score for P permuted data sets. For each permuted data set, sample labels of response variable are randomly permuted and iJRF is implemented. Resulting importance scores can be used to derive an estimate of FDR.

1	iJRF_Perm(X, Y, W, ntree=NULL, mtry=NULL,res.name=NULL,cov.name=NULL,P)

`X`	List object containing predictors for each class, `X=list(x_1,x_2, ... )` where `x_j` is a `(M x n_j)` matrix with rows corresponding to predictors and columns to samples. Missing values are not allowed.
`Y`	List object containing response variables for each class, `Y=list(y_1,y_2, ... )` where `y_j` is a `(p x n_j)` matrix with rows corresponding to response variables and columns to samples. Missing values are not allowed.
`W`	`(M x p)` Matrix containing sampling scores based on prior information on interactions. Element `(i,j)` contains interaction score `(i -> j)`. Scores must be non-negative. Larger value of sampling score corresponds to higher likelihood of variable `i` interacting with variable `j`. Rows of `W` must be in the same order as the rows of `X`, while columns of `W` must be in the same order as the rows of `Y`.
`ntree`	Numeric value: number of trees. If omitted, `ntree` is set to `1000`.
`mtry`	Numeric value: number of predictors to be sampled at each node. If omitted, `mtry` is set to the square root of the number of predictors.
`res.name`	p-dimensional vector containing names of response variable.
`cov.name`	M-dimensional vector containing names of predictors.
`P`	Number of permutations.

A three dimensional array (I, P, C) where I=M x p is the total number of interactions, C is the number of classes and P the total number of permutations.

Petralia, F. et al (2017) A new method to study the change of miRNA-mRNA interactions due to environmental exposures, Submitted.

Petralia, F., Wang, P., Yang, J., and Tu Z. (2015) Integrative random forest for gene regulatory network inference. 31(12), i197-i205.

Petralia, F., Song, W.M., Tu, Z. and Wang, P. (2016). New method for joint network analysis reveals common and different coexpression patterns among genes and proteins in breast cancer. Journal of proteome research, 15(3), pp.743-754.

Some of the functions utilized are a modified version of functions contained in R package randomForest: A. Liaw and M. Wiener (2002). Classification and Regression by randomForest. R News 2, 18–22.

 # --- Generate data sets
 nclasses=2               # number of data sets / classes
 n1<-n2<-20               # sample size for each data sets
 p<-5                   # number of response variables 
 M<-10                   # number of predictor variables 
 W<-abs(matrix(rnorm(M*p),M,p))    # generate sampling scores

 Res1<-matrix(rnorm(p*n1),p,n1)       # generate response for class 1
 Res2<-matrix(rnorm(p*n2),p,n2)       # generate response for class 2
 Cov1<-matrix(rnorm(M*n1),M,n1)       # generate predictors for class 1
 Cov2<-matrix(rnorm(M*n2),M,n2)       # generate predictors for class 2
 
 # --- Standardize variables to mean 0 and variance 1
  Res1 <- t(apply(Res1, 1, function(x) { (x - mean(x)) / sd(x) } ))
  Res2 <- t(apply(Res2, 1, function(x) { (x - mean(x)) / sd(x) } ))

 # --- Run iJRF and obtain importance score for P permuted data sets
  out<-iJRF_Perm(X=list(Cov1,Cov2),Y=list(Res1,Res2),W=W,P=2)