Description Usage Arguments Details Value Note Author(s) References See Also Examples
A function to build a random forest classifier for RNA structure change
1 | classifyRNA(data=NULL, cutoff=NULL)
|
data |
Optional data to build the classifier. Default is pre-loaded data. |
cutoff |
An optional vector of length equal to number of classes. The winning class for an observation is the one with the maximum ratio of proportion of votes to cutoff. Default is 1/k where k is the number of classes (i.e., majority vote wins). |
This function builds a random forest classifier for RNA structure change using the randomForest package.
A classifyRNA object, based on randomForest object (see randomForest package)
The original call to randomForest
One of regression, classification, or unsupervised.
The predicted values of the input data based on out-of-bag samples.
A matrix with nclass + 2 columns. The first nclass columns are the class-specific measures computed as mean descrease in accuracy. The nclass + 1st column is the mean descrease in accuracy over all classes. The last column is the mean decrease in Gini index.
The standard errors of the permutation-based importance measure. A p by nclass + 1 matrix corresponding to the first nclass + 1 columns of the importance matrix.
Number of trees grown.
Number of predictors sampled for spliting at each node.
A list that contains the entire forest
Vector error rates of the prediction on the input data, the i-th element being the (OOB) error rate for all trees up to the i-th.
The confusion matrix of the prediction (based on OOB data).
A matrix with one row for each input data point and one column for each class, giving the fraction or number of (OOB) votes from the random forest.
Number of times cases are out-of-bag (and thus used in computing OOB error estimate)
A matrix of proximity measures among the input (based on the frequency that pairs of data points are in the same terminal nodes).
Organization of the data file: header=TRUE, tab-delimited .txt file
"column 1" class label
"column 2" pattern correlation
"column 3" dynamic time warping
"column 4" contiguousness
"column 5" magnitude correlation
"column 6" change variance
"column 7" eSDC
"column 8" change range
"column 9" L2 norm
The default data has been gathered from the RNA Mapping Database mutate and map experiments.
Chanin Tolson
A. Liaw and M. Wiener (2002). Classification and Regression by randomForest. R News 2(3), 18–22 (randomForest package)
RNA Mapping Database
1 2 3 4 | #build classifier
rf = classifyRNA()
#get confusion matrix
rf$confusion
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.