CaDrA | R Documentation |
Perform permutation-based testings on a sample of permuted input scores
using candidate_search
as the main iterative function for each run.
CaDrA(
FS,
input_score,
method = c("ks_pval", "ks_score", "wilcox_pval", "wilcox_score", "revealer", "custom"),
method_alternative = c("less", "greater", "two.sided"),
custom_function = NULL,
custom_parameters = NULL,
weights = NULL,
search_start = NULL,
top_N = 1,
search_method = c("both", "forward"),
max_size = 7,
n_perm = 1000,
perm_alternative = c("one.sided", "two.sided"),
obs_best_score = NULL,
smooth = TRUE,
plot = FALSE,
ncores = 1,
cache = FALSE,
cache_path = NULL,
verbose = FALSE
)
FS |
a matrix of binary features or a SummarizedExperiment class object from SummarizedExperiment package where rows represent features of interest (e.g. genes, transcripts, exons, etc...) and columns represent the samples. The assay of FS contains binary (1/0) values indicating the presence/absence of omics features. |
input_score |
a vector of continuous scores representing a phenotypic readout of interest such as protein expression, pathway activity, etc. NOTE: |
method |
a character string specifies a scoring method that is
used in the search. There are 6 options: ( |
method_alternative |
a character string specifies an alternative
hypothesis testing ( NOTE: This argument only applies to |
custom_function |
If method is NOTE: |
custom_parameters |
If method is |
weights |
If method is NOTE: |
search_start |
a vector of character strings (separated by commas)
specifies feature names in the FS object to start the search with.
If |
top_N |
an integer specifies the number of features to start the search over. By default, it starts with the feature that has the highest best score (top_N = 1). NOTE: If |
search_method |
a character string specifies an algorithm to filter out
the best candidates ( |
max_size |
an integer specifies a maximum size that a meta-feature can
extend to do for a given search. Default is |
n_perm |
an integer specifies the number of permutations to perform.
Default is |
perm_alternative |
an alternative hypothesis type for calculating
permutation-based p-value. Options: one.sided, two.sided. Default is
|
obs_best_score |
a numeric value corresponding to the best observed
score. This value is used to compare against the |
smooth |
a logical value indicates whether or not to add a smoothing
factor of 1 to the calculation of permutation-based p-value. This option is
used to avoid a returned p-value of 0. Default is |
plot |
a logical value indicates whether or not to plot the empirical
null distribution of the permuted best scores. Default is |
ncores |
an integer specifies the number of cores to perform
parallelization for permutation-based testing. Default is |
cache |
a logical value determines whether or not to cache the
permuted best scores. This helps to save time for future loading instead
of re-computing the permutation-based testing every time.
Default is |
cache_path |
a path to cache permuted best scores. Default is |
verbose |
a logical value indicates whether or not to print the
diagnostic messages. Default is |
a list of 4 objects: key
, perm_best_scores
,
obs_best_score
, perm_pval
-key
: a list of parameters that was used to cache the
results of the permutation-based testing. This is useful as the
permuted best scores can be recycled to save time for future loading.
-perm_best_scores
: a vector of permuted best scores obtained
by performing candidate_search
over n_perm
iterations of
permuted input scores.
-obs_best_score
: a user-provided best score or an observed best score
obtained by performing candidate_search
on a given dataset and input
parameters. This value is later used to compare against the permuted best
scores (perm_best_scores
).
perm_pval
: a permutation-based p-value obtained by calculating
sum(perm_best_scores > obs_best_score)/n_perm
NOTE: If smooth = TRUE, a smoothing factor of 1 will be added to the
calculation of perm_pval
.
e.g. (sum(perm_best_scores > obs_best_score) + 1) / (n_perm + c)
This is just to not return a p-value of 0
# Load pre-computed feature set
data(sim_FS)
# Load pre-computed input-score
data(sim_Scores)
# Set seed for permutation
set.seed(21)
# Define additional parameters and start the function
cadra_result <- CaDrA(
FS = sim_FS, input_score = sim_Scores, method = "ks_pval",
weights = NULL, method_alternative = "less", top_N = 1,
search_start = NULL, search_method = "both", max_size = 7,
n_perm = 10, perm_alternative = "one.sided", plot = FALSE,
smooth = TRUE, obs_best_score = NULL,
ncores = 1, cache = FALSE, cache_path = NULL
)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.