run_LncADeep | R Documentation |
This function can predict lncRNA/RNA-protein interactions using rebuilt model trained with LncADeep's feature set. Model retraining and feature extraction are also supported. LncADeep selects 110 features to build its classifier. Here, the 110 top features are determined by averaging feature scores of 33 evaluation results provided by LncADeep. LncADeep's original model is trained using deep neural network (DNN). Considering that DNN architecture is hard to perform parameter tuning, we rebuild the model using the same machine algorithm (random forest) as the other methods. Users can build DNN model with the features generated by this function.
run_LncADeep(
seqRNA,
seqPro,
mode = c("prediction", "retrain", "feature"),
retrained.model = NULL,
label = NULL,
positive.class = NULL,
folds.num = 10,
ntree = 3000,
mtry.ratios = c(0.1, 0.2, 0.4, 0.6, 0.8),
seed = 1,
parallel.cores = 2,
cl = NULL,
...
)
seqRNA |
RNA sequences loaded by function |
seqPro |
protein sequences loaded by function |
mode |
a string. Set |
retrained.model |
(only when |
label |
a string or a vector of strings or |
positive.class |
(only when |
folds.num |
(only when |
ntree |
integer, number of trees to grow. See |
mtry.ratios |
(only when |
seed |
(only when |
parallel.cores |
an integer that indicates the number of cores for parallel computation.
Default: |
cl |
parallel cores to be passed to this function. |
... |
(only when |
If mode = "prediction"
, this function returns a data frame that contains the predicted results.
If mode = "retrain"
, this function returns a random forest classifier.
If mode = "feature"
, this function returns a data frame that contains the extracted features.
Yang C, Yang L, Zhou M, et al. LncADeep: an ab initio lncRNA identification and functional annotation tool based on deep learning. Bioinformatics. 2018; 34(22):3825-3834.
# Following codes only show how to use this function
# and cannot reflect the genuine performance of tools or classifiers.
data(demoPositiveSeq)
seqRNA <- demoPositiveSeq$RNA.positive
seqPro <- demoPositiveSeq$Pro.positive
# Predicting ncRNA-protein pairs:
Res_LncADeep_1 <- run_LncADeep(seqRNA = seqRNA, seqPro = seqPro, mode = "prediction",
retrained.model = NULL, label = "LncADeep_res",
parallel.cores = 2) # using default rebuilt model
# Train a new model:
# Argument "label" which indicates the class of each input pair is required here.
# "label" should correspond to the classes of "seqRNA" and "seqPro".
# "positive.class" should be one of the classes in argument "label" or can be set as "NULL".
# In the latter case, the first label in "label" will be used as the positive class.
# Parameters of random forest, such as "nodesize", can be passed using "..." argument.
LncADeep_model <- run_LncADeep(seqRNA = seqRNA, seqPro = seqPro, mode = "retrain",
label = rep(c("Interact", "Non.Interact"), each = 10),
positive.class = NULL, folds.num = 5, ntree = 100,
seed = 1, parallel.cores = 2, nodesize = 2)
# Predicting using new built model by setting "retrained.model = LncADeep_model":
Res_LncADeep_2 <- run_LncADeep(seqRNA = seqRNA, seqPro = seqPro, mode = "prediction",
retrained.model = LncADeep_model, label = NULL,
parallel.cores = 2)
# Only extracting features:
LncADeep_feature_df <- run_LncADeep(seqRNA = seqRNA, seqPro = seqPro, mode = "feature",
label = "feature", parallel.cores = 2)
# Extracted features can be used to build classifiers using other machine learning
# algorithms, which provides users with more flexibility.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.