run_rpiCOOL | R Documentation |
This function can predict lncRNA/RNA-protein interactions using rebuilt model trained with rpiCOOL's feature set. Model retraining and feature extraction are also supported. The codes of this function slightly differ from rpiCOOL's script.
run_rpiCOOL(
seqRNA,
seqPro,
mode = c("prediction", "retrain", "feature"),
retrained.model = NULL,
label = NULL,
positive.class = NULL,
folds.num = 10,
ntree = 3000,
mtry.ratios = c(0.1, 0.2, 0.4, 0.6, 0.8),
seed = 1,
parallel.cores = 2,
cl = NULL,
...
)
seqRNA |
RNA sequences loaded by function |
seqPro |
protein sequences loaded by function |
mode |
a string. Set |
retrained.model |
(only when |
label |
a string or a vector of strings or |
positive.class |
(only when |
folds.num |
(only when |
ntree |
integer, number of trees to grow. See |
mtry.ratios |
(only when |
seed |
(only when |
parallel.cores |
an integer that indicates the number of cores for parallel computation.
Default: |
cl |
parallel cores to be passed to this function. |
... |
(only when |
If mode = "prediction"
, this function returns a data frame that contains the predicted results.
If mode = "retrain"
, this function returns a random forest classifier.
If mode = "feature"
, this function returns a data frame that contains the extracted features.
Akbaripour-Elahabad M, Zahiri J, Rafeh R, et al. rpiCOOL: A tool for In Silico RNA-protein interaction detection using random forest. J. Theor. Biol. 2016; 402:1-8
# Following codes only show how to use this function
# and cannot reflect the genuine performance of tools or classifiers.
data(demoPositiveSeq)
seqRNA <- demoPositiveSeq$RNA.positive
seqPro <- demoPositiveSeq$Pro.positive
# Predicting ncRNA-protein pairs:
Res_rpiCOOL_1 <- run_rpiCOOL(seqRNA = seqRNA, seqPro = seqPro, mode = "prediction",
retrained.model = NULL, label = "rpiCOOL_res",
parallel.cores = 2) # using default rebuilt model
# Train a new model:
# Argument "label" which indicates the class of each input pair is required here.
# "label" should correspond to the classes of "seqRNA" and "seqPro".
# "positive.class" should be one of the classes in argument "label" or can be set as "NULL".
# In the latter case, the first label in "label" will be used as the positive class.
# Parameters of random forest, such as "replace", can be passed using "..." argument.
rpiCOOL_model = run_rpiCOOL(seqRNA = seqRNA, seqPro = seqPro, mode = "retrain",
label = rep(c("Interact", "Non.Interact"), each = 10),
positive.class = NULL, folds.num = 5, ntree = 300,
seed = 1, parallel.cores = 2, replace = FALSE)
# Predicting using new built model by setting "retrained.model = rpiCOOL_model":
Res_rpiCOOL_2 <- run_rpiCOOL(seqRNA = seqRNA, seqPro = seqPro, mode = "prediction",
retrained.model = rpiCOOL_model, label = NULL,
parallel.cores = 2)
# Only extracting features:
rpiCOOL_feature_df <- run_rpiCOOL(seqRNA = seqRNA, seqPro = seqPro, mode = "feature",
label = "feature", parallel.cores = 2)
# Extracted features can be used to build classifiers using other machine learning
# algorithms, which provides users with more flexibility.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.