mlr_resamplings_repeated_spcv_env | R Documentation |
Splits data by clustering in the feature space.
See the upstream implementation at blockCV::cv_cluster()
and
Valavi et al. (2018) for further information.
Useful when the dataset is supposed to be split on environmental information which is present in features. The method allows for a combination of multiple features for clustering.
The input of raster images directly as in blockCV::cv_cluster()
is not
supported. See mlr3spatial and its raster DataBackends for such
support in mlr3.
folds
(integer(1)
)
Number of folds.
features
(character()
)
The features to use for clustering.
repeats
(integer(1)
)
Number of repeats.
mlr3::Resampling
-> ResamplingRepeatedSpCVEnv
iters
integer(1)
Returns the number of resampling iterations, depending on the
values stored in the param_set
.
new()
Create an "Environmental Block" repeated resampling instance.
For a list of available arguments, please see blockCV::cv_cluster.
ResamplingRepeatedSpCVEnv$new(id = "repeated_spcv_env")
id
character(1)
Identifier for the resampling strategy.
folds()
Translates iteration numbers to fold number.
ResamplingRepeatedSpCVEnv$folds(iters)
iters
integer()
Iteration number.
repeats()
Translates iteration numbers to repetition number.
ResamplingRepeatedSpCVEnv$repeats(iters)
iters
integer()
Iteration number.
instantiate()
Materializes fixed training and test splits for a given task.
ResamplingRepeatedSpCVEnv$instantiate(task)
task
Task
A task to instantiate.
clone()
The objects of this class are cloneable with this method.
ResamplingRepeatedSpCVEnv$clone(deep = FALSE)
deep
Whether to make a deep clone.
Valavi R, Elith J, Lahoz-Monfort JJ, Guillera-Arroita G (2018). “blockCV: an R package for generating spatially or environmentally separated folds for k-fold cross-validation of species distribution models.” bioRxiv. \Sexpr[results=rd]{tools:::Rd_expr_doi("10.1101/357798")}.
if (mlr3misc::require_namespaces(c("sf", "blockCV"), quietly = TRUE)) {
library(mlr3)
task = tsk("ecuador")
# Instantiate Resampling
rrcv = rsmp("repeated_spcv_env", folds = 4, repeats = 2)
rrcv$instantiate(task)
# Individual sets:
rrcv$train_set(1)
rrcv$test_set(1)
intersect(rrcv$train_set(1), rrcv$test_set(1))
# Internal storage:
rrcv$instance
}
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.