In parsnip: A Common API to Modeling and Analysis Functions

#| child: aaa.Rmd
#| include: false

r descr_models("boost_tree", "spark"). However, multiclass classification is not supported yet.

Tuning Parameters

#| label: spark-param-info
#| echo: false
defaults <- 
  tibble::tibble(parsnip = c("tree_depth", "trees", "learn_rate", "mtry", "min_n", "loss_reduction", "sample_size"),
                 default = c("5L", "20L", "0.1", "see below", "1L", "0.0", "1.0"))

# For this model, this is the same for all modes
param <-
 boost_tree() |> 
  set_engine("spark") |> 
  set_mode("regression") |> 
  make_parameter_list(defaults)

This model has r nrow(param) tuning parameters:

#| label: spark-param-list
#| echo: false
#| results: asis
param$item

The mtry parameter is related to the number of predictors. The default depends on the model mode. For classification, the square root of the number of predictors is used and for regression, one third of the predictors are sampled.

Translation from parsnip to the original package (regression)

#| label: spark-reg
boost_tree(
  mtry = integer(), trees = integer(), min_n = integer(), tree_depth = integer(),
  learn_rate = numeric(), loss_reduction = numeric(), sample_size = numeric()
) |>
  set_engine("spark") |>
  set_mode("regression") |>
  translate()

Translation from parsnip to the original package (classification)

#| label: spark-cls
boost_tree(
  mtry = integer(), trees = integer(), min_n = integer(), tree_depth = integer(),
  learn_rate = numeric(), loss_reduction = numeric(), sample_size = numeric()
) |> 
  set_engine("spark") |> 
  set_mode("classification") |> 
  translate()

Preprocessing requirements

#| child: template-tree-split-factors.Rmd

Case weights

#| child: template-uses-case-weights.Rmd

Note that, for spark engines, the case_weight argument value should be a character string to specify the column with the numeric case weights.

Other details

#| child: template-spark-notes.Rmd

References

Luraschi, J, K Kuo, and E Ruiz. 2019. Mastering Spark with R. O'Reilly Media
Kuhn, M, and K Johnson. 2013. Applied Predictive Modeling. Springer.

Any scripts or data that you put into this service are public.

parsnip documentation built on June 8, 2025, 12:10 p.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

parsnip
A Common API to Modeling and Analysis Functions

In parsnip: A Common API to Modeling and Analysis Functions

Tuning Parameters

Translation from parsnip to the original package (regression)

Translation from parsnip to the original package (classification)

Preprocessing requirements

Case weights

Other details

References

Try the parsnip package in your browser

R Package Documentation

Browse R Packages

We want your feedback!

parsnip A Common API to Modeling and Analysis Functions

In parsnip: A Common API to Modeling and Analysis Functions

Tuning Parameters

Translation from parsnip to the original package (regression)

Translation from parsnip to the original package (classification)

Preprocessing requirements

Case weights

Other details

References

Try the parsnip package in your browser

R Package Documentation

Browse R Packages

We want your feedback!

parsnip
A Common API to Modeling and Analysis Functions