Description Usage Arguments Value References
This function is just a helper to make sure that the default values of the split data frame is correct when unspecified. It helps reduce type error, especially when moving to use dplyr which is stricter in data types.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 |
number |
Row index of the data frame. |
var |
Whether it is a leaf, or the name of the next split variable. |
cut |
The splitting value, so values (of |
n |
Cluster size. Number of observations in that cluster. |
inertia |
Inertia value of the cluster at that node. |
bipartsplitrow |
Position of the next split row in the data set (that position will belong to left node (smaller)). |
bipartsplitcol |
Position of the next split variable in the data set. |
inertiadel |
The proportion of inertia value of the cluster at that node to the inertia of the root. |
inertia_explained |
Percent inertia explained as described in Chavent (2007) |
medoid |
Position of the data point regarded as the medoid of its cluster. |
loc |
y-coordinate of the splitting node to facilitate showing on the
tree. See |
split.order |
Order of the splits. Root is 0, and increasing. |
alt |
Indicator of an alternative cut yielding the same reduction in inertia at that split. |
A tibble with only one row and correct default data type for even an unspecified variables.
Chavent, M., Lechevallier, Y., & Briant, O. (2007). DIVCLUS-T: A monothetic divisive hierarchical clustering method. Computational Statistics & Data Analysis, 52(2), 687–701. https://doi.org/10.1016/j.csda.2007.03.013
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.