run_LION | R Documentation |
This function can predict lncRNA/RNA-protein interactions using LION method. Model retraining and feature extraction are also supported.
run_LION(
seqRNA,
seqPro,
mode = c("prediction", "retrain", "feature"),
retrained.model = NULL,
label = NULL,
positive.class = NULL,
folds.num = 10,
ntree = 3000,
mtry.ratios = c(0.1, 0.2, 0.4, 0.6, 0.8),
seed = 1,
parallel.cores = 2,
cl = NULL,
...
)
seqRNA |
RNA sequences loaded by function |
seqPro |
protein sequences loaded by function |
mode |
a string. Set |
retrained.model |
(only when |
label |
a string or a vector of strings or |
positive.class |
(only when |
folds.num |
(only when |
ntree |
integer, number of trees to grow. See |
mtry.ratios |
(only when |
seed |
(only when |
parallel.cores |
an integer that indicates the number of cores for parallel computation.
Default: |
cl |
parallel cores to be passed to this function. |
... |
(only when |
If mode = "prediction"
, this function returns a data frame that contains the predicted results.
If mode = "retrain"
, this function returns a random forest classifier.
If mode = "feature"
, this function returns a data frame that contains the extracted features.
Han S, Yang X, Sun H, et al. LION: an integrated R package for effective prediction of ncRNA–protein interaction. Briefings in Bioinformatics. 2022; 23(6):bbac420
# Following codes only show how to use this function
# and cannot reflect the genuine performance of tools or classifiers.
data(demoPositiveSeq)
seqRNA <- demoPositiveSeq$RNA.positive
seqPro <- demoPositiveSeq$Pro.positive
# Predicting ncRNA-protein pairs:
Res_LION_1 <- run_LION(seqRNA = seqRNA, seqPro = seqPro,
parallel.cores = 2) # using the default setting
# the above command is equivalent to:
Res_LION_2 <- run_LION(seqRNA = seqRNA, seqPro = seqPro, mode = "prediction",
retrained.model = NULL, label = NULL,
parallel.cores = 2)
# Train a new model:
# Argument "label" which indicates the class of each input pair is required here.
# "label" should correspond to the classes of "seqRNA" and "seqPro".
# "positive.class" should be one of the classes in argument "label" or can be set as "NULL".
# In the latter case, the first label in "label" will be used as the positive class.
# Parameters of random forest, such as "replace", can be passed using "..." argument.
LION_model <- run_LION(seqRNA = seqRNA, seqPro = seqPro, mode = "retrain",
label = rep(c("Interact", "Non.Interact"), each = 10),
positive.class = NULL, folds.num = 5, ntree = 100,
seed = 1, parallel.cores = 2, replace = FALSE)
# Predicting using new built model by setting "retrained.model = LION_model":
Res_LION_2 <- run_LION(seqRNA = seqRNA, seqPro = seqPro, mode = "prediction",
retrained.model = LION_model,
label = rep(c("Interact", "Non.Interact"), each = 10),
parallel.cores = 2)
# Only extracting features:
LION_feature_df <- run_LION(seqRNA = seqRNA, seqPro = seqPro, mode = "feature",
label = "LION_feature", parallel.cores = 2)
# Extracted features can be used to build classifiers using other machine learning
# algorithms, which provides users with more flexibility.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.