View source: R/fit_predict_smlc.R
| predict_smlc | R Documentation |
Prediction based on the hidden genome sparse multinomial logistic classifier
predict_smlc(
fit,
Xnew,
Ynew = NULL,
return_lin_pred = FALSE,
normalize_rows = NULL,
...
)
predict_mlogit(
fit,
Xnew,
Ynew = NULL,
return_lin_pred = FALSE,
normalize_rows = NULL,
...
)
fit |
fitted hidden genome mlogit classifier, an output of fit_smlc. |
Xnew |
test data design (or meta-design) matrix (observations across rows and variables predictors/features across columns) for which predictions are to be made from a fitted model. For a typical hidden genome classifier this will be a matrix whose rows correspond to the test set tumors, and columns correspond to (normalized by some functions of the total mutation burdens in tumors) binary 1-0 presence/absence of raw variants, counts of mutations at specific genes and counts of mutations corresponding to specific mutation signatures etc. |
Ynew |
the actual cancer categories for the test samples. This is not used in computation, but is return as a component in the output, for possibly easier post-processing. |
normalize_rows |
vector of the same length as |
a list with entries (a) probs_predicted:
a ncol(Xnew) by n_cancer (determined from fit)
matrix of multinomial probabilities, providing
the predicted probability of each sample unit in Xnew
being classified into each cancer site,
and (b) predicted : a character vector listing
hard classes based on the predicted multinomial
probabilities (obtained by assigning individuals to
the classes with the highest predicted probabilities), and
optionally, (c) observed: if Ynew is supplied, then it
is returned as is.
Predictors in Xnew that are not present in the
training set design matrix (stored in fit) are dropped, and predictors
not included in Xnew but present in training set design matrix are
all assumed to have zero values. This is convenient for a typical
hidden genome classifier where most predictors are (some normalized versions
of) counts (e.g. for gene and mutation signatures) or
binary presence/absence indicators (e.g., for raw variants) so that a zero
predictor value essentially indicates some form of "absence".
However, care must be taken for predictors whose 0 values
do not indicate absence.
fit_mlogit
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.