Description Usage Arguments Details Value Warning Note Author(s) References See Also Examples
After fitting an object of class 'mdr'
, performs a permutation test to assess the statistical significance of the balanced accuracy evaluation measure of the 'best model'.
1 |
accuracy |
the accuracy measure reported from the MDR model fit (after fitting |
loci |
the identified loci from the MDR model fit with |
N.permute |
the number of data permutations to perform |
method |
internal validation method used to fit the model: "CV" for |
data |
dataset used to fit the MDR model; first column is the binary response vector and subsequent columns are numeric SNP data |
cv |
if method="CV", the number of cross-validation intervals |
K |
the maximum size of interaction to consider |
x |
if method="3WS", the number of models to save from the training set to be evaluated in the testing set; if NULL, default is number of total loci |
proportion |
if method="3WS", a vector with the ratio of data for training:testing:validation sets; if NULL, default is c(2,2,1) |
ratio |
case/control ratio threshold to ascribe high-risk/low-risk status of a genotype combination; if NULL, default is the ratio of cases to controls in the whole dataset |
equal |
how to treat genotype combinations with case/control ratio equal to the threshold; if NULL, default is "HR" for high-risk, but can also consider "LR" for low-risk |
genotype |
a numeric vector of possible genotypes arising in |
LRT |
a logical indicating if a likelihood ratio test for significant interaction should be performed |
Obtains permuted datasets by permuting the response vector only, in order to preserve the LD structure within the genetic data.
Returns a list with:
Permutation P-value |
the empirical p-value based on the permutation distribution; i.e. the proportion of permutations with balanced accuracy > |
Permutation Distribution |
a vector with the top balanced accuracies from all |
LRT P-value |
if LRT=TRUE, the empirical p-value for a test of interaction based on the LRT distribution |
LRT Distribution |
if LRT=TRUE, a vector with p-values for the LRT test of interaction from all |
...
MDR is a combinatorial search approach, so considering high-order interactions and a large number of permutations can be computationally expensive.
When using permute.mdr
in conjunction with mdr.cv
and mdr.3WS
, the full internal validation and selection procedure is repeated for each permutation. For mdr
, permutation is only consider for the specified variable combination, so internal validation or selection are not performed within each permutation.
Stacey Winham
Ritchie MD et al (2001). Multifactor-dimensionality reduction reveals high-order interactions among estrogen-metabolism genes in sporadic breast cancer. Am J Hm Genet 69(1): 138-147.
Hahn LW, Ritchie MD, Moore JH (2003). Multifactor dimensionality reduction software for detecting gene-gene and gene-environment interactions. Bioinformatics 19(3):376-82.
Velez DR et al (2007). A balanced accuracy function for epistasis modeling in imbalanced datasets using multifactor dimensionality reduction. Genet Epidemiol 31(4): 306-315.
Motsinger-Reif AA (2008). The effect of alternative permutation testing strategies on the performance of multifactor dimensionality reduction. BMC Research Notes 1:139.
Edwards TL et al (2010). A General Framework for Formal Tests of Interaction after Exhaustive Search Methods with Applications to MDR and MDR-PDT. PLoS One 5(2).
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 | #load data
data(mdr1)
#fit an mdr object to a subset of the sample data
fit<-mdr.3WS(data=mdr1[,1:11],K=2)
####save the accuracy
acc<-fit$'final model accuracy'
###save the final model loci
loc<-fit$'final model'
####run permutation test on 10 permutations
perm<-permute.mdr(accuracy=acc, loci=loc, N.permute=10, method="3WS",data=mdr1[,1:11], K=2, LRT=TRUE)
###empirical p-value
perm$'Permutation P-value'
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.