View source: R/preprocess_data.R
| preprocess_data | R Documentation | 
The goal of preprocess_data() is to get relevant clusters for G-, S-, or BLLiM initialization, coupled with a feature selection for high-dimensional datasets. This function is an alternative to the default initialization implemented in gllim(), sllim() and bllim(). 
In this function, clusters are initialized with K-means, and variable selection is performed with a LASSO (glmnet) within each clusters. Then selected features are merged to get a subset variables before running any prediction method of xLLiM. 
preprocess_data(tapp,yapp,in_K,...)
| tapp | An  | 
| yapp | An  | 
| in_K | Initial number of components or number of clusters | 
| ... | Other arguments of glmnet can be passed | 
| selected.variables | Vector of the indexes of selected variables. Selection is made within clusters and merged hereafter. | 
| clusters | Initialization clusters with k-means | 
Emeline Perthame (emeline.perthame@pasteur.fr), Emilie Devijver (emilie.devijver@kuleuven.be), Melina Gallopin (melina.gallopin@u-psud.fr)
[1] E. Devijver, M. Gallopin, E. Perthame. Nonlinear network-based quantitative trait prediction from transcriptomic data. Submitted, 2017, available at https://arxiv.org/abs/1701.07899.
xLLiM-package, glmnet-package, kmeans
x <- 1
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.