distribution | R Documentation |
There are two modes. For aggregating feature selection results, the function counts the number of times each feature was selected in all cross-validations. For aggregating predictive results, the accuracy or C-index for each sample is visualised. This is useful in identifying samples that are difficult to predict well.
result |
An object of class |
... |
Further parameters, such as |
dataType |
Default: |
plotType |
Whether to draw a probability density curve or a histogram. |
summaryType |
If feature selection, whether to summarise as a proportion or count. |
plot |
Whether to draw a plot of the frequency of selection or error rate. |
xMax |
Maximum data value to show in plot. |
fontSizes |
A vector of length 3. The first number is the size of the title. The second number is the size of the axes titles. The third number is the size of the axes values. |
ordering |
Default: |
If dataType
is "features", a vector as long as the number of
features that were chosen at least once containing the number of times the
feature was chosen in cross validations or the proportion of times chosen.
If dataType
is "samples", a vector as long as the number of samples,
containing the cross-validation error rate of the sample. If plot
is
TRUE
, then a plot is also made on the current graphics device.
Dario Strbenac
#if(require(sparsediscrim))
#{
data(asthma)
result <- crossValidate(measurements, classes, nRepeats = 5)
featureDistribution <- distribution(result, "features", summaryType = "count",
plotType = "histogram", binwidth = 1)
print(head(featureDistribution))
#}
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.