Description Usage Arguments Details Value Author(s) Examples
Ranks features by largest Kolmogorov-Smirnov distance and chooses the features which have best resubstitution performance.
1 2 3 4 5 6 7 8 | ## S4 method for signature 'matrix'
KolmogorovSmirnovSelection(measurements, classes, ...)
## S4 method for signature 'DataFrame'
KolmogorovSmirnovSelection(measurements, classes, datasetName,
trainParams, predictParams, resubstituteParams, ...,
selectionName = "Kolmogorov-Smirnov Test", verbose = 3)
## S4 method for signature 'MultiAssayExperiment'
KolmogorovSmirnovSelection(measurements, targets = names(measurements), ...)
|
measurements |
Either a |
classes |
Either a vector of class labels of class |
targets |
If |
... |
Variables not used by the |
datasetName |
A name for the data set used. Stored in the result. |
trainParams |
A container of class |
predictParams |
A container of class |
resubstituteParams |
An object of class |
selectionName |
A name to identify this selection method by. Stored in the result. |
verbose |
Default: 3. A number between 0 and 3 for the amount of progress messages to give. This function only prints progress messages if the value is 3. |
Features are sorted in order of biggest distance to smallest. The top number of features is used in a classifier, to determine which number of features has the best resubstitution performance.
Data tables which consist entirely of non-numeric data cannot be analysed. If measurements
is an object of class MultiAssayExperiment
, the factor of sample classes must be stored
in the DataFrame accessible by the colData
function with column name "class"
.
An object of class SelectResult
or a list of such objects, if the classifier which
was used for determining the specified performance metric made a number of prediction varieties.
Dario Strbenac
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 | # First 20 features have bimodal distribution for Poor class.
# Other 80 features have normal distribution for both classes.
set.seed(1984)
genesMatrix <- sapply(1:25, function(sample)
{
randomMeans <- sample(c(8, 12), 20, replace = TRUE)
c(rnorm(20, randomMeans, 1), rnorm(80, 10, 1))
}
)
genesMatrix <- cbind(genesMatrix, sapply(1:25, function(sample) rnorm(100, 10, 1)))
rownames(genesMatrix) <- paste("Gene", 1:nrow(genesMatrix))
classes <- factor(rep(c("Poor", "Good"), each = 25))
resubstituteParams <- ResubstituteParams(nFeatures = seq(5, 25, 5),
performanceType = "balanced error",
better = "lower")
selected <- KolmogorovSmirnovSelection(genesMatrix, classes, "Example",
trainParams = TrainParams(naiveBayesKernel),
predictParams = PredictParams(NULL),
resubstituteParams = resubstituteParams)
head(selected@chosenFeatures)
plotFeatureClasses(genesMatrix, classes, "Gene 13", dotBinWidth = 0.25,
xAxisLabel = bquote(log[2]*'(expression)'))
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.