sumPvals | R Documentation |
This function uses permutation p-values to determine confidence bounds for the number of true discoveries, the true discovery proportion and the false discovery proportion within a set of interest. The bounds are simultaneous over all sets, and remain valid under post-hoc selection.
sumPvals(G, S = NULL, alpha = 0.05, truncFrom = NULL, truncTo = 0.5,
type = "vovk.wang", r = 0, nMax = 50)
G |
numeric matrix of p-values, where columns correspond to variables, and rows to data transformations (e.g. permutations). The first transformation is the identity. |
S |
vector of indices for the variables of interest (if not specified, all variables). |
alpha |
significance level. |
truncFrom |
truncation parameter: values greater than |
truncTo |
truncation parameter: truncated values are set to |
type |
p-value combination among |
r |
parameter for Vovk and Wang's p-value combination. |
nMax |
maximum number of iterations. |
A p-value p
is transformed as following.
Edgington: p
(Edgington, 1972)
Fisher: -2log(p)
(Fisher, 1925)
Pearson: 2log(1-p)
(Pearson, 1933)
Liptak: qnorm(1-p)
(Liptak, 1958; Stouffer et al., 1949)
Cauchy: tan[(0.5-p)pi]
with pi=3.142
(Liu and Xie, 2020)
Harmonic mean: 1/p
(Wilson, 2019)
Vovk and Wang: p^r
(log(p)
for r
=0) (Vovk and Wang, 2020)
An error message is returned if the transformation produces infinite values.
For Vovk and Wang, r=0
corresponds to Fisher, and r=-1
to the harmonic mean.
Truncation parameters should be such that truncTo
is not smaller than truncFrom
.
As Pearson's and Liptak's transformations produce infinite values in 1, for such methods
truncTo
should be strictly smaller than 1.
The significance level alpha
should be in the interval [1/B
, 1), where
B
is the number of data transformations (rows in G
).
sumPvals
returns an object of class sumObj
, containing
total
: total number of variables (columns in G
)
size
: size of S
alpha
: significance level
TD
: lower (1-alpha
)-confidence bound for the number of true discoveries in S
maxTD
: maximum value of TD
that could be found under convergence of the algorithm
iterations
: number of iterations of the algorithm
Anna Vesely.
Goeman J. J. and Solari A. (2011). Multiple testing for exploratory research. Statistical Science, doi: 10.1214/11-STS356.
Vesely A., Finos L., and Goeman J. J. (2023). Permutation-based true discovery guarantee by sum tests. Journal of the Royal Statistical Society, Series B (Statistical Methodology), doi: 10.1093/jrsssb/qkad019.
True discovery guarantee using generic statistics: sumStats
Access a sumObj
object: discoveries
, tdp
, fdp
# generate matrix of p-values for 5 variables and 10 permutations
G <- simData(prop = 0.6, m = 5, B = 10, alpha = 0.4, seed = 42)
# subset of interest (variables 1 and 2)
S <- c(1,2)
# create object of class sumObj
# combination: harmonic mean (Vovk and Wang with r = -1)
res <- sumPvals(G, S, alpha = 0.4, r = -1)
res
summary(res)
# lower confidence bound for the number of true discoveries in S
discoveries(res)
# lower confidence bound for the true discovery proportion in S
tdp(res)
# upper confidence bound for the false discovery proportion in S
fdp(res)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.