Description Usage Arguments Details Value Examples
This function calculates multiple classifiers that are used to predict the class of a new sample. It implements the class prediction tool with multiple methods in BRB-ArrayTools.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 | classPredict(
exprTrain,
exprTest = NULL,
isPaired = FALSE,
pairVar.train = NULL,
pairVar.test = NULL,
geneId,
cls,
pmethod = c("ccp", "bcc", "dlda", "knn", "nc", "svm"),
geneSelect = "igenes.univAlpha",
univAlpha = 0.001,
univMcr = 0.2,
foldDiff = 2,
rvm = FALSE,
filter = NULL,
ngenePairs = 25,
nfrvm = 10,
cvMethod = 1,
kfoldValue = 10,
bccPrior = 1,
bccThresh = 0.8,
nperm = 0,
svmCost = 1,
svmWeight = 1,
fixseed = 1,
prevalence = NULL,
projectPath,
outputName = "ClassPrediction",
generateHTML = FALSE
)
|
exprTrain |
matrix of gene expression data for training samples. Rows are genes and columns are arrays. Its column names must be provided. |
exprTest |
matrix of gene expression data for new samples. Its column names must be provided. |
isPaired |
logical. If |
pairVar.train |
vector of pairing variables for training samples. |
pairVar.test |
vector of pairing variables for new samples. |
geneId |
matrix/data frame of gene IDs. |
cls |
vector of training sample classes. |
pmethod |
character string vector of prediction methods to be employed.
|
geneSelect |
character string for gene selection method.
|
univAlpha |
numeric for a significance level. Default is 0.001. |
univMcr |
numeric for univariate misclassification rate. Default is 0.2. |
foldDiff |
numeric for fold ratio of geometric means between two classes exceeding. 0 means not to enable this option. Default is 2. |
rvm |
logical. If |
filter |
vector of 1/0's of the same length as genes. 1 means to keep the gene while 0 means to exclude genes
from class comparison analysis. If |
nfrvm |
numeric specifying the number of features selected by the support vector machine recursive feature elimination method. Default is 10. |
cvMethod |
numeric for the cross validation method. Default is 1.
|
kfoldValue |
numeric specifying the number of folds if K-fold method is selected. Default is 10. |
bccPrior |
numeric specifying the prior probability option for the Baysian compound covariate prediction.
If |
bccThresh |
numeric specifying the uncertainty threshold for the Bayesian compound covariate prediction. Default is 0.8. |
nperm |
numeric specifying the number of permutations for the significance test of cross-validated mis-classification rate. It should be equal to zero or greater than 50. Default is 0. |
svmCost |
numeric specifying the cost values for SVM. Default is 1. |
svmWeight |
numeric specifying the weight values for SVM. Default is 1. |
fixseed |
numeric. |
prevalence |
vector for class prevalences. When prevalence is |
projectPath |
character string specifying the full project path. |
outputName |
character string specifying the output folder name. Default is "ClassPrediction". |
generateHTML |
logical. If |
ngenePairs: |
numeric specifying the number of gene pairs selected by the greedy pairs method. Default is 25. |
Please see the BRB-ArrayTools manual (https://brb.nci.nih.gov/BRB-ArrayTools/Documentation.html) for details.
A list that may include the following objects:
performClass
: a data frame with the performance of classifiers during cross-validation:
percentCorrectClass
: a data frame with the mean percent of correct classification for each sample using
different prediction methods.
predNewSamples
:s a data frame with predicted class for each
new sample. 'NC' means that a sample is not classified. In this example, there are four new samples.
probNew
: a data frame with the predicted probability of each new sample belonginG to the class (BRCA1) from the the Bayesian Compound Covariate method.
classifierTable
: a data frame with composition of classifiers such as geometric means of values in each class, p-values and Gene IDs.
probInClass
: a data frame with predicted probability of each training sample belonging to
aclass during cross-validation from the Bayesian Compound Covariate
CCPSenSpec
: a data frame with performance (i.e., sensitivity, specificity, positive prediction value,
negative prediction value) of the Compound Covariate Predictor Classifier.
LDASenSpec
: a data frame with performance (i.e., sensitivity, specificity, positive prediction value,
negative prediction value) of the Diagonal Linear Discriminant Analysis Classifier.
K1NNSenSpec
: a data frame with performance (i.e., sensitivity, specificity, positive prediction value,
negative prediction value) of the 1-Nearest Neighbor Classifier.
K3NNSenSpec
: a data frame with performance (i.e., sensitivity, specificity, positive prediction value,
negative prediction value) of the 3-Nearest Neighbor Classifier.
CentroidSenSpec
: a data frame with performance (i.e., sensitivity, specificity, positive prediction value,
negative prediction value) of the Nearest Centroid Classifierr.
SVMSenSpec
: a data frame with performance (i.e., sensitivity, specificity, positive prediction value,
negative prediction value) of the Support Vector Machine Classifier.
BCPPSenSpec
: a data frame with performance (i.e., sensitivity, specificity, positive prediction value,
negative prediction value) of the Bayesian Compound Covariate Classifier.
weightLinearPred
: a data frame with gene weights for linear predictors such as Compound Covariate Predictor,
Diagonal Linear Discriminat Analysis and Support Vector Machine.
thresholdLinearPred
: a numeric vector of the thresholds for the linear prediction rules related with weightLinearPred
.
Each prediction rule is defined by the inner sum of the weights (w_i)
and log expression values (x_i) of significant genes.
In this case, a sample is classified to the class BRCA1 if
the sum is greater than the threshold; that is, ∑_i w_i x_i > threshold.
GRPCentroid
: a data frame with centroid of each class for each predictor gene.
ppval
: a vector of permutation p-values of statistical significance tests of cross-validated estimate of misclassification rate from specified #' prediction methods.
pmethod
: a vector of prediction methods that are specified.
workPath
: the path for fortran and other intermediate outputs.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 | dataset<-"Brca"
# gene IDs
geneId <- read.delim(system.file("extdata", paste0(dataset, "_GENEID.txt"),
package = "classpredict"), as.is = TRUE, colClasses = "character")
# expression data
x <- read.delim(system.file("extdata", paste0(dataset, "_LOGRAT.TXT"),
package = "classpredict"), header = FALSE)
# filter information, 1 - pass the filter, 0 - filtered
filter <- scan(system.file("extdata", paste0(dataset, "_FILTER.TXT"),
package = "classpredict"), quiet = TRUE)
# class information
expdesign <- read.delim(system.file("extdata", paste0(dataset, "_EXPDESIGN.txt"),
package = "classpredict"), as.is = TRUE)
# training/test information
testSet <- expdesign[, 10]
trainingInd <- which(testSet == "training")
predictInd <- which(testSet == "predict")
ind1 <- which(expdesign[trainingInd, 4] == "BRCA1")
ind2 <- which(expdesign[trainingInd, 4] == "BRCA2")
ind <- c(ind1, ind2)
exprTrain <- x[, ind]
colnames(exprTrain) <- expdesign[ind, 1]
exprTest <- x[, predictInd]
colnames(exprTest) <- expdesign[predictInd, 1]
projectPath <- file.path(Sys.getenv("HOME"),"Brca")
outputName <- "ClassPrediction"
generateHTML <- TRUE
resList <- classPredict(exprTrain = exprTrain, exprTest = exprTest, isPaired = FALSE,
pairVar.train = NULL, pairVar.test = NULL, geneId,
cls = c(rep("BRCA1", length(ind1)), rep("BRCA2", length(ind2))),
pmethod = c("ccp", "bcc", "dlda", "knn", "nc", "svm"),
geneSelect = "igenes.univAlpha",
univAlpha = 0.001, univMcr = 0, foldDiff = 0, rvm = TRUE,
filter = filter, ngenePairs = 25, nfrvm = 10, cvMethod = 1,
kfoldValue = 10, bccPrior = 1, bccThresh = 0.8, nperm = 0,
svmCost = 1, svmWeight =1, fixseed = 1, prevalence = NULL,
projectPath = projectPath, outputName = outputName, generateHTML)
if (generateHTML)
browseURL(file.path(projectPath, "Output", outputName,
paste0(outputName, ".html")))
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.