extree_control: Control for extensible trees

Description Usage Arguments Details Value

Description

Various parameters that control aspects of the ‘extree’ fit.

Usage

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
extree_control(
    criterion, 
    logmincriterion, 
    minsplit = 20L,
    minbucket = 7L, 
    minprob = 0.01, 
    nmax = Inf,
    stump = FALSE,
    lookahead = FALSE, 
    maxsurrogate = 0L, 
    numsurrogate = FALSE,
    mtry = Inf,
    maxdepth = Inf, 
    multiway = FALSE, 
    splittry = 2L,
    majority = FALSE, 
    caseweights = TRUE, 
    applyfun = NULL, 
    cores = NULL,
    saveinfo = TRUE,
    bonferroni = FALSE,
    update = NULL,
    selectfun, 
    splitfun, 
    svselectfun,
    svsplitfun 
)

Arguments

criterion

a character specifying the criterion used for variable and split point selection. Either "p.value" or "statistic".

logmincriterion

the value of the test statistic or 1 - p-value that must be exceeded in order to implement a split on the log-scale.

minsplit

the minimum sum of weights in a node in order to be considered for splitting.

minbucket

the minimum sum of weights in a terminal node.

minprob

proportion of observations needed to establish a terminal node.

nmax
stump

a logical determining whether a stump (a tree with a maximum of three nodes only) is to be computed.

lookahead

a logical determining whether a split is implemented only after checking if trafo in both daughter nodes converges.

maxsurrogate

number of surrogate splits to evaluate.

numsurrogate

a logical for backward-compatibility with party. If TRUE, only at least ordered variables are considered for surrogate splits.

mtry

number of input variables randomly sampled as candidates at each node for random forest like algorithms. The default mtry = Inf means that no random selection takes place.

maxdepth

maximum depth of the tree. The default maxdepth = Inf means that no restrictions are applied to tree sizes.

multiway

CURRENTLY IGNORED. A logical indicating if multiway splits for all factor levels are implemented for unordered factors.

splittry

number of variables that are inspected for admissible splits if the best split doesn't meet the sample size constraints.

majority

if FALSE, observations which can't be classified to a daughter node because of missing information are randomly assigned (following the node distribution). If TRUE, they go with the majority.

caseweights

a logical interpreting weights as case weights.

applyfun

an optional lapply-style function with arguments function(X, FUN, ...). It is used for computing the variable selection criterion. The default is to use the basic lapply function unless the cores argument is specified (see below).

cores

numeric. If set to an integer the applyfun is set to mclapply with the desired number of cores.

saveinfo

logical. Store information about variable selection procedure in info slot of each partynode.

update

logical. If TRUE, the data transformation is updated in every node.

bonferroni

logical. Should p values be Bonferroni corrected?

selectfun

a function or list of functions with arguments model, trafo, data, subset, weights, j, split_only, control. Each list element must have a unique name according to the variable type (?).

splitfun

a function or list of functions with arguments model, trafo, data, subset, weights, whichvar, ctrl. Each list element must have a unique name according to the variable type (?).

svselectfun

an optional function or list of functions for selecting surrogate variables.

svsplitfun

an optional function or list of functions for selecting surrogate splits.

Details

The arguments splitfun determines how to selected a splitting variable for a node. It returns a list of p-values and test statistics per variable. The variables are sorted by the p-value or test statistic (according to criterion) and their sorted indices are the input for the selectfun in form of the argument whichvar.

For details on the other arguments, see ctree_control. For an example, see extree.

Value

A list.


partykitx documentation built on Sept. 3, 2020, 3:01 p.m.