run_RPISeq | R Documentation |
This function can predict lncRNA/RNA-protein interactions using RPISeq method. Both the web-based original version and retrained model are available. Network is required to use the original version. Model retraining and feature extraction are also supported.
run_RPISeq(
seqRNA,
seqPro,
mode = c("prediction", "retrain", "feature"),
prediction = c("web", "retrained"),
retrained.model = NULL,
label = NULL,
positive.class = NULL,
folds.num = 10,
ntree = 3000,
mtry.ratios = c(0.1, 0.2, 0.4, 0.6, 0.8),
seed = 1,
parallel.cores = 2,
cl = NULL,
...
)
seqRNA |
RNA sequences loaded by function |
seqPro |
protein sequences loaded by function |
mode |
a string. Set |
prediction |
(only when |
retrained.model |
(only when |
label |
a string or a vector of strings or |
positive.class |
(only when |
folds.num |
(only when |
ntree |
integer, number of trees to grow. See |
mtry.ratios |
(only when |
seed |
(only when |
parallel.cores |
an integer that indicates the number of cores for parallel computation.
Default: |
cl |
parallel cores to be passed to this function. |
... |
(only when |
If mode = "prediction"
, this function returns a data frame that contains the predicted results.
If mode = "retrain"
, this function returns a random forest classifier.
If mode = "feature"
, this function returns a data frame that contains the extracted features.
Muppirala UK, Honavar VG, Dobbs D. Predicting RNA-protein interactions using only sequence information. BMC Bioinformatics 2011; 12:489
# Following codes only show how to use this function
# and cannot reflect the genuine performance of tools or classifiers.
data(demoPositiveSeq)
seqRNA <- demoPositiveSeq$RNA.positive
seqPro <- demoPositiveSeq$Pro.positive
# Predicting ncRNA-protein pairs:
Res_RPISeq_1 <- run_RPISeq(seqRNA = seqRNA, seqPro = seqPro, mode = "prediction",
label = "Interact", prediction = "web",
parallel.cores = 2) # using web server
Res_RPISeq_2 <- run_RPISeq(seqRNA = seqRNA, seqPro = seqPro, mode = "prediction",
prediction = "retrained", retrained.model = NULL,
parallel.cores = 2) # using default rebuilt model
# Train a new model:
# Argument "label" which indicates the class of each input pair is required here.
# "label" should correspond to the classes of "seqRNA" and "seqPro".
# "positive.class" should be one of the classes in argument "label" or can be set as "NULL".
# In the latter case, the first label in "label" will be used as the positive class.
# Parameters of random forest, such as "nodesize", can be passed using "..." argument.
RPI_model <- run_RPISeq(seqRNA = seqRNA, seqPro = seqPro, mode = "retrain",
label = rep(c("Interact", "Non.Interact"), each = 10),
positive.class = "Interact", folds.num = 5,
ntree = 300, seed = 1, parallel.cores = 2, nodesize = 2)
# Predicting using new built model by setting "retrained.model = RPI_model":
Res_RPISeq_3 <- run_RPISeq(seqRNA = seqRNA, seqPro = seqPro, mode = "prediction",
prediction = "retrained", retrained.model = RPI_model,
label = rep(c("Interact", "Non.Interact"), each = 10),
parallel.cores = 2)
# Only extracting features:
RPISeq_feature_df <- run_RPISeq(seqRNA = seqRNA, seqPro = seqPro, mode = "feature",
label = "Interact", parallel.cores = 2)
# Extracted features can be used to build classifiers using other machine learning
# algorithms, which provides users with more flexibility.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.