importance_dtw_legacy_cpp | R Documentation |
Computes the contribution of individual variables to the
similarity/dissimilarity between two irregular multivariate time series.
In opposition to the robust version, least-cost paths for each combination
of variables are computed independently, which makes the results of individual
variables harder to compare. This function should only be used when the objective is
replicating importance scores generated with previous versions of the package distantia
.
This function generates a data frame with the following columns:
variable: name of the individual variable for which the importance
is being computed, from the column names of the arguments x
and y
.
psi: global dissimilarity score psi
of the two time series.
psi_only_with: dissimilarity between x
and y
computed from the given variable alone.
psi_without: dissimilarity between x
and y
computed from all other variables.
psi_difference: difference between psi_only_with
and psi_without
.
importance: contribution of the variable to the similarity/dissimilarity
between x
and y
, computed as ((psi_all - psi_without) * 100) / psi_all
.
Positive scores represent contribution to dissimilarity,
while negative scores represent contribution to similarity.
importance_dtw_legacy_cpp(
y,
x,
distance = "euclidean",
diagonal = FALSE,
weighted = TRUE,
ignore_blocks = FALSE,
bandwidth = 1
)
y |
(required, numeric matrix) multivariate time series with the same number of columns as 'x'. |
x |
(required, numeric matrix) multivariate time series. |
distance |
(optional, character string) distance name from the "names"
column of the dataset |
diagonal |
(optional, logical). If TRUE, diagonals are included in the computation of the cost matrix. Default: TRUE. |
weighted |
(optional, logical). Only relevant when diagonal is TRUE. When TRUE, diagonal cost is weighted by y factor of 1.414214 (square root of 2). Default: TRUE. |
ignore_blocks |
(optional, logical). If TRUE, blocks of consecutive path coordinates are trimmed to avoid inflating the psi distance. Default: FALSE. |
bandwidth |
(required, numeric) Size of the Sakoe-Chiba band at both sides of the diagonal used to constrain the least cost path. Expressed as a fraction of the number of matrix rows and columns. Unrestricted by default. Default: 1 |
data frame
Other Rcpp_importance:
importance_dtw_cpp()
,
importance_ls_cpp()
#simulate two regular time series
x <- zoo_simulate(
seed = 1,
rows = 100
)
y <- zoo_simulate(
seed = 2,
rows = 150
)
#different number of rows
#this is not a requirement though!
nrow(x) == nrow(y)
#compute importance
df <- importance_dtw_legacy_cpp(
x = x,
y = y,
distance = "euclidean"
)
df
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.