Description Usage Arguments Details Value References See Also Examples
Implements the SAFFRON algorithm for asynchronous online testing, as presented by Zrnic et al. (2018).
1 2 3 4 5 6 7 8 9 10 11 | SAFFRONstar(
d,
alpha = 0.05,
version,
gammai,
w0,
lambda = 0.5,
batch.sizes,
discard = FALSE,
tau.discard = 0.5
)
|
d |
Either a vector of p-values, or a dataframe with three columns: an identifier (‘id’), p-value (‘pval’), and either ‘decision.times’, or ‘lags’, depending on which version you're using. See version for more details. |
alpha |
Overall significance level of the procedure, the default is 0.05. |
version |
Takes values 'async', 'dep' or 'batch'. This specifies the
version of SAFFRONstar to use. |
gammai |
Optional vector of γ_i. A default is provided with γ_j proportional to 1/j^(1.6). |
w0 |
Initial ‘wealth’ of the procedure, defaults to α/10. |
lambda |
Optional threshold for a ‘candidate’ hypothesis, must be between 0 and 1. Defaults to 0.5. |
batch.sizes |
A vector of batch sizes, this is required for
|
discard |
Logical. If |
tau.discard |
Optional threshold for hypotheses to be selected for
testing. Must be between 0 and 1, defaults to 0.5. This is required if
|
The function takes as its input either a vector of p-values, or a dataframe with three columns: an identifier (‘id’), p-value (‘pval’), or a column describing the conflict sets for the hypotheses. This takes the form of a vector of decision times or lags. Batch sizes can be specified as a separate argument (see below).
Zrnic et al. (2018) present explicit three versions of SAFFRONstar:
1) version='async'
is for an asynchronous testing process, consisting
of tests that start and finish at (potentially) random times. The discretised
finish times of the test correspond to the decision times. These decision
times are given as the input decision.times
for this version of the
SAFFRONstar algorithm. For this version of SAFFRONstar, Tian and Ramdas
(2019) presented an algorithm that can improve the power of the procedure in
the presence of conservative nulls by adaptively ‘discarding’ these p-values.
This can be called by setting the option discard=TRUE
.
2) version='dep'
is for online testing under local dependence of the
p-values. More precisely, for any t>0 we allow the p-value p_t to
have arbitrary dependence on the previous L_t p-values. The fixed
sequence L_t is referred to as ‘lags’, and is given as the input
lags
for this version of the SAFFRONstar algorithm.
3) version='batch'
is for controlling the mFDR in mini-batch testing,
where a mini-batch represents a grouping of tests run asynchronously which
result in dependent p-values. Once a mini-batch of tests is fully completed,
a new one can start, testing hypotheses independent of the previous batch.
The batch sizes are given as the input batch.sizes
for this version of
the SAFFRONstar algorithm.
Given an overall significance level α, SAFFRONstar depends on constants w_0 and λ, where w_0 satisfies 0 ≤ w_0 ≤ (1 - λ)α and represents the intial ‘wealth’ of the procedure, and 0 < λ < 1 represents the threshold for a ‘candidate’ hypothesis. A ‘candidate’ refers to p-values smaller than λ, since SAFFRONstar will never reject a p-value larger than λ. The algorithms also require a sequence of non-negative non-increasing numbers γ_i that sum to 1.
Note that these SAFFRONstar algorithms control the modified FDR (mFDR). The ‘async’ version also controls the usual FDR if the p-values are assumed to be independent.
Further details of the SAFFRONstar algorithms can be found in Zrnic et al. (2018).
d.out |
A dataframe with the original p-values |
Zrnic, T., Ramdas, A. and Jordan, M.I. (2018). Asynchronous Online Testing of Multiple Hypotheses. arXiv preprint, https://arxiv.org/abs/1812.05068.
Tian, J. and Ramdas, A. (2019). ADDIS: an adaptive discarding algorithm for online FDR control with conservative nulls. arXiv preprint, https://arxiv.org/abs/1905.11465.
SAFFRON
presents versions of SAFFRON for synchronous
p-values, i.e. where each test can only start when the previous test has
finished.
If version='async'
and discard=TRUE
, then SAFFRONstar is
identical to ADDIS
with option async=TRUE
.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 | sample.df <- data.frame(
id = c('A15432', 'B90969', 'C18705', 'B49731', 'E99902',
'C38292', 'A30619', 'D46627', 'E29198', 'A41418',
'D51456', 'C88669', 'E03673', 'A63155', 'B66033'),
pval = c(2.90e-08, 0.06743, 0.01514, 0.08174, 0.00171,
3.60e-05, 0.79149, 0.27201, 0.28295, 7.59e-08,
0.69274, 0.30443, 0.00136, 0.72342, 0.54757),
decision.times = seq_len(15) + 1)
SAFFRONstar(sample.df, version='async')
sample.df2 <- data.frame(
id = c('A15432', 'B90969', 'C18705', 'B49731', 'E99902',
'C38292', 'A30619', 'D46627', 'E29198', 'A41418',
'D51456', 'C88669', 'E03673', 'A63155', 'B66033'),
pval = c(2.90e-08, 0.06743, 0.01514, 0.08174, 0.00171,
3.60e-05, 0.79149, 0.27201, 0.28295, 7.59e-08,
0.69274, 0.30443, 0.00136, 0.72342, 0.54757),
lags = rep(1,15))
SAFFRONstar(sample.df2, version='dep')
sample.df3 <- data.frame(
id = c('A15432', 'B90969', 'C18705', 'B49731', 'E99902',
'C38292', 'A30619', 'D46627', 'E29198', 'A41418',
'D51456', 'C88669', 'E03673', 'A63155', 'B66033'),
pval = c(2.90e-08, 0.06743, 0.01514, 0.08174, 0.00171,
3.60e-05, 0.79149, 0.27201, 0.28295, 7.59e-08,
0.69274, 0.30443, 0.00136, 0.72342, 0.54757))
SAFFRONstar(sample.df3, version='batch', batch.sizes = c(4,6,5))
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.