Description Usage Arguments Details Value Warning Note Author(s) References See Also Examples
After fitting an object of class 'mdr', performs a permutation test to assess the statistical significance of the balanced accuracy evaluation measure of the 'best model'.
1 |
accuracy |
the accuracy measure reported from the MDR model fit (after fitting |
loci |
the identified loci from the MDR model fit with |
N.permute |
the number of data permutations to perform |
method |
internal validation method used to fit the model: "CV" for |
data |
dataset used to fit the MDR model; first column is the binary response vector and subsequent columns are numeric SNP data |
cv |
if method="CV", the number of cross-validation intervals |
K |
the maximum size of interaction to consider |
x |
if method="3WS", the number of models to save from the training set to be evaluated in the testing set; if NULL, default is number of total loci |
proportion |
if method="3WS", a vector with the ratio of data for training:testing:validation sets; if NULL, default is c(2,2,1) |
ratio |
case/control ratio threshold to ascribe high-risk/low-risk status of a genotype combination; if NULL, default is the ratio of cases to controls in the whole dataset |
equal |
how to treat genotype combinations with case/control ratio equal to the threshold; if NULL, default is "HR" for high-risk, but can also consider "LR" for low-risk |
genotype |
a numeric vector of possible genotypes arising in |
LRT |
a logical indicating if a likelihood ratio test for significant interaction should be performed |
Obtains permuted datasets by permuting the response vector only, in order to preserve the LD structure within the genetic data.
Returns a list with:
Permutation P-value |
the empirical p-value based on the permutation distribution; i.e. the proportion of permutations with balanced accuracy > |
Permutation Distribution |
a vector with the top balanced accuracies from all |
LRT P-value |
if LRT=TRUE, the empirical p-value for a test of interaction based on the LRT distribution |
LRT Distribution |
if LRT=TRUE, a vector with p-values for the LRT test of interaction from all |
...
MDR is a combinatorial search approach, so considering high-order interactions and a large number of permutations can be computationally expensive.
When using permute.mdr in conjunction with mdr.cv and mdr.3WS, the full internal validation and selection procedure is repeated for each permutation. For mdr, permutation is only consider for the specified variable combination, so internal validation or selection are not performed within each permutation.
Stacey Winham
Ritchie MD et al (2001). Multifactor-dimensionality reduction reveals high-order interactions among estrogen-metabolism genes in sporadic breast cancer. Am J Hm Genet 69(1): 138-147.
Hahn LW, Ritchie MD, Moore JH (2003). Multifactor dimensionality reduction software for detecting gene-gene and gene-environment interactions. Bioinformatics 19(3):376-82.
Velez DR et al (2007). A balanced accuracy function for epistasis modeling in imbalanced datasets using multifactor dimensionality reduction. Genet Epidemiol 31(4): 306-315.
Motsinger-Reif AA (2008). The effect of alternative permutation testing strategies on the performance of multifactor dimensionality reduction. BMC Research Notes 1:139.
Edwards TL et al (2010). A General Framework for Formal Tests of Interaction after Exhaustive Search Methods with Applications to MDR and MDR-PDT. PLoS One 5(2).
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 | #load data
data(mdr1)
#fit an mdr object to a subset of the sample data
fit<-mdr.3WS(data=mdr1[,1:11],K=2)
####save the accuracy
acc<-fit$'final model accuracy'
###save the final model loci
loc<-fit$'final model'
####run permutation test on 10 permutations
perm<-permute.mdr(accuracy=acc, loci=loc, N.permute=10, method="3WS",data=mdr1[,1:11], K=2, LRT=TRUE)
###empirical p-value
perm$'Permutation P-value'
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.