Description Usage Arguments References Examples

BART algorithm implemented in C++, but without predict() support.

1 2 3 4 5 | ```
SL.dbarts(Y, X, newX, family, obsWeights, id, sigest = NA, sigdf = 3,
sigquant = 0.9, k = 2, power = 2, base = 0.95, binaryOffset = 0,
ntree = 200, ndpost = 1000, nskip = 100, printevery = 100,
keepevery = 1, keeptrainfits = TRUE, usequants = FALSE, numcut = 100,
printcutoffs = 0, nthread = 1, keepcall = TRUE, verbose = FALSE, ...)
``` |

`Y` |
Outcome variable |

`X` |
Covariate dataframe |

`newX` |
Optional dataframe to predict the outcome. dbarts does not support predict() so any prediction needs to be via newX passed during model training. |

`family` |
"gaussian" for regression, "binomial" for binary classification. |

`obsWeights` |
Optional observation-level weights. |

`id` |
Optional id to group observations from the same unit (not used currently). |

`sigest` |
For continuous response models, an estimate of the error
variance, |

`sigdf` |
Degrees of freedom for error variance prior. Not applicable when y is binary. |

`sigquant` |
The quantile of the error variance prior that the rough
estimate (sigest) is placed at. The closer the quantile is to 1, the more
aggresive the fit will be as you are putting more prior weight on error
standard deviations ( |

`k` |
For numeric y, k is the number of prior standard deviations E(Y|x) = f(x) is away from +/- 0.5. The response (Y) is internally scaled to range from -0.5 to 0.5. For binary y, k is the number of prior standard deviations f(x) is away from +/- 3. In both cases, the bigger k is, the more conservative the fitting will be. |

`power` |
Power parameter for tree prior. |

`base` |
Base parameter for tree prior. |

`binaryOffset` |
Used for binary y. When present, the model is P(Y = 1 |
x) = |

`ntree` |
The number of trees in the sum-of-trees formulation. |

`ndpost` |
The number of posterior draws after burn in, ndpost / keepevery will actually be returned. |

`nskip` |
Number of MCMC iterations to be treated as burn in. |

`printevery` |
As the MCMC runs, a message is printed every printevery draws. |

`keepevery` |
Every keepevery draw is kept to be returned to the user. Useful for "thinning" samples. |

`keeptrainfits` |
If TRUE the draws of f(x) for x corresponding to the rows of x.train are returned. |

`usequants` |
When TRUE, determine tree decision rules using estimated quantiles derived from the x.train variables. When FALSE, splits are determined using values equally spaced across the range of a variable. See details for more information. |

`numcut` |
The maximum number of possible values used in decision rules (see usequants, details). If a single number, it is recycled for all variables; otherwise must be a vector of length equal to ncol(x.train). Fewer rules may be used if a covariate lacks enough unique values. |

`printcutoffs` |
The number of cutoff rules to printed to screen before the MCMC is run. Given a single integer, the same value will be used for all variables. If 0, nothing is printed. |

`nthread` |
Integer specifying how many threads to use for rudimentary calculations such as means/variances. Depending on the CPU architecture, using more than one can degrade performance for small/medium data sets. As such some calculations may be executed single threaded regardless. |

`keepcall` |
Logical; if FALSE, returned object will have call set to call("NULL"), otherwise the call used to instantiate BART. |

`verbose` |
If T output additional information during training. |

`...` |
Any remaining arguments (unused) |

Chipman, H. A., George, E. I., & McCulloch, R. E. (2010). BART: Bayesian additive regression trees. The Annals of Applied Statistics, 4(1), 266-298. doi: 10.1214/09-AOAS285 (URL: http://doi.org/10.1214/09-AOAS285).

1 2 3 4 5 6 7 8 9 10 11 12 13 14 | ```
data(Boston, package = "MASS")
Y = Boston$medv
# Remove outcome from covariate dataframe.
X = Boston[, -14]
set.seed(1)
# Sample rows to speed up example.
row_subset = sample(nrow(X), 30)
sl = SuperLearner(Y[row_subset], X[row_subset, ], family = gaussian(),
cvControl = list(V = 2), SL.library = c("SL.mean", "SL.dbarts"))
print(sl)
``` |

ecpolley/SuperLearner documentation built on April 8, 2018, 10:48 p.m.

Embedding an R snippet on your website

Add the following code to your website.

For more information on customizing the embed code, read Embedding Snippets.