vim.logicFS: Importance Measures

Description Usage Arguments Value Author(s) References See Also

Description

Computes the value of the single or the multiple tree measure, respectively, for each prime implicant contained in a logic bagging model to specify the importance of the prime implicant for classification, if the response is binary. If the response is quantitative, the importance is specified by a measure based on the log2-transformed mean square prediction error. If the response is a time to an event, performance measures for time-to-event models are employed to determine the importance measures.

Usage

1
2
3
4
vim.logicFS(log.out, neighbor = NULL, adjusted = FALSE, useN = TRUE, 
   onlyRemove = FALSE, prob.case = 0.5, addInfo = FALSE, 
	 score = c("DPO", "Conc", "Brier", "PL"), ensemble = FALSE, 
	 addMatImp = TRUE)

Arguments

log.out

an object of class logicBagg, i.e.\ the output of logic.bagging.

neighbor

a list consisting of character vectors specifying SNPs that are in LD. If specified, all SNPs need to occur exactly one time in this list. If specified, the importance measures are adjusted for LD by considering the SNPs within a LD block as exchangable.

adjusted

logical specifying whether the measures should be adjusted for noise. Often, the interaction actually associated with the response is not exactly found in some iterations of logic bagging, but an interaction is identified that additionally contains one (or seldomly more) noise SNPs. If adjusted is set to TRUE, the values of the importance measure is corrected for this behaviour.

useN

logical specifying if the number of correctly classified out-of-bag observations should be used in the computation of the importance measure. If FALSE, the proportion of correctly classified oob observations is used instead. Ignored in the survival case.

onlyRemove

should in the single tree case the multiple tree measure be used? If TRUE, the prime implicants are only removed from the trees when determining the importance in the single tree case. If FALSE, the original single tree measure is computed for each prime implicant, i.e.\ a prime implicant is not only removed from the trees in which it is contained, but also added to the trees that do not contain this interaction. Ignored in all other than the classification case.

prob.case

a numeric value between 0 and 1. If the logistic regression approach of logic regression is used (i.e.\ if the response is binary, and in logic.bagging ntrees is set to a value larger than 1, or glm.if.1tree is set to TRUE), then an observation will be classified as a case (or more exactly as 1), if the class probability of this observation estimated by the logic bagging model is larger than prob.case.

addInfo

should further information on the logic regression models be added?

score

a character string naming the score that should be used in the computation of the importance measure for a survival time analysis. By default, the distance between predicted outcomes (score = "DPO") proposed by Tietz et al.\ (2018) is used in the determination of the importance of the variables. Alternatively, Harrell's C-Index ("Conc"), the Brier score ("Brier"), or the predictive partial log-likelihood ("PL") can be used.

ensemble

in the case of a survival outcome, should ensemble importance measures (as, e.g., in randomSurvivalSRC be used? If FALSE, importance measures analogous to the ones in the logicFS analysis of other outcomes are used (see Tietz et al., 2018).

addMatImp

should the matrix containing the improvements due to the prime implicants in each of the iterations be added to the output? (For each of the prime implicants, the importance is computed by the average over the B improvements.) Must be set to TRUE, if standardized importances should be computed using vim.norm, or if permutation based importances should be computed using vim.signperm. If ensemble = TRUE and addMatImp = TRUE in the survival case, the respective score of the full model is added to the output instead of an improvement matrix.

Value

An object of class logicFS containing

primes

the prime implicants,

vim

the importance of the prime implicants,

prop

the proportion of logic regression models containing the prime implicants (or the neighbors of the prime implicants, if neighbor != NULL; or the extended primes of the prime implicants, if adjusted = TRUE; or the extended primes of the neighbors of the prime implicants, if neighbor != NULL and adjusted = TRUE),

type

the type of model (1: classification, 2: linear regression, 3: logistic regression, 4: Cox regression),

param

further parameters (if addInfo = TRUE),

mat.imp

either the matrix containing the improvements if addMatImp = TRUE and ensemble = FALSE, or the respective score of the full model if addMatImp = TRUE and ensemble = TRUE, or NULL if addMatImp = FALSE,

measure

the name of the used importance measure,

neighbor

neighbor,

useN

the value of useN,

threshold

NULL,

mu

NULL.

Author(s)

Holger Schwender, holger.schwender@hhu.de; Tobias Tietz, tobias.tietz@hhu.de

References

Schwender, H., Ickstadt, K. (2007). Identification of SNP Interactions Using Logic Regression. Biostatistics, 9(1), 187-198.

Tietz, T., Selinski, S., Golka, K., Hengstler, J.G., Gripp, S., Ickstadt, K., Ruczinski, I., Schwender, H. (2018). Identification of Interactions of Binary Variables Associated with Survival Time Using survivalFS. Submitted.

See Also

logic.bagging, logicFS, vim.norm, vim.signperm


holgerschw/logicFS documentation built on April 15, 2020, 10:42 p.m.