Description Arguments Slots Constructors Identifiers Subset and Combine Summaries Plots Accessors See Also
The MLGWAS contains one or more classifiers with information about their global performance, local errors and feature importance. It is meant to simplify machine learning in genome-wide association studies.
x |
|
performances
Global performance measures of classifiers.
First two columns are ClassifierID
s and PartitionID
s.
Other columns are performance measures.
For categorical responses, these are:
sensitivity, specificity,
positive predictive value (PPV), negative predictive value (NPV),
ROC plot AUC and F1 score.
For continuous respones, these are:
root-mean-square deviation (RMSD) and coefficient of determination (r2).
featimp
Variable importance of FeatureID
as estimated by varImp
.
Columns are ClassifierID
, PartitionID
,
FeatureID
and Importance
.
errors
Errors per partition and per sample.
Columns are ClassifierID
, PartitionID
,
SampleID
and Error
.
For categorical variables, values are integers: 1 correspond to a mistake,
0 to a success.
For continuous, values the square of the difference between the prediction
and the actual response.
In both cases, NA means that this sample was not part of the test set.
classifiers
Classifiers.
Coded as a list
of list
s of train
objects.
First level corresponds to ClassifierID
and second level to PartitionID
.
partitions
Train and test set partitions
Coded as a list
of list
s.
First level corresponds to ClassifierID
s and second level to
PartitionID
s.
predictions
Predictions of response(mdt)
for test set.
Coded as a list
of list
s.
First level corresponds to ClassifierID
s and second level to
PartitionID
s.
call
List of calls. Names of list are ClassifierIDs
trainClassifier
trains a classifier,
using an MDT
object,
that that predicts response(mdt)
affection using mtable(mdt)
or
phenotype(mdt)
or both.
trainClassifiers
allows to call trainClassifier
using different combination of parameters specified as a data.frame
.
Four identifiers are present in this class:
ClassifierID
as specified by the id
argument
in trainClassifier
.
PartitionID
as specified by the names
of the
output of the partition
argument in trainClassifier
.
FeatureID
as specified by the FeatureID
s
of the MDT
object.
SampleID
as specified by the Sample
s
of the MDT
object.
x[ClassifierIDs]
subsets MLGWAS
objects. Must be a character
.
bindML(...)
combines MLGWAS
objects.
The following functions summarize results throughout partitions:
summaryPerf
summarizes performance per classifier.
summaryFeatimp
summarizes feature importance per classifier.
summaryErrors
summarizes errors per classifier and per sample.
plotPerf
plots performance as boxplots.
plotFeatimp
plots feature importance as boxplots.
plotErrors
plots sample error as boxplots.
manhattanPlot
creates a Manhattan plot using the variable
importance of features as estimated by varImp
instead
of -log10 pvalues.
performances(x)
and performances(x) <- value
gets or sets
performances
.
featimp(x)
and featimp(x) <- value
gets or sets
featimp
.
errors(x)
and errors(x) <- value
gets or sets
errors
.
classifiers(x)
and classifiers(x) <- value
gets or sets
classifiers
.
partitions(x)
and partitions(x) <- value
gets or sets
partitions
.
predictions(x)
and predictions(x) <- value
gets or sets
predictions
.
classifierNames(x)
and classifierNames(x) <- value
gets or sets
ClassifiersID
s.
partitionNames(x)
and partitionNames(x) <- value
get or sets PartitionID
s.
features(x)
and features(x) <- value
get or sets
FeatureID
.
samples(x)
and samples(x) <- value
get or sets
SampleID
s
trainClassifier
trainClassifiers
MDT
train
data.table
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.