goodall4 | R Documentation |
The function calculates a dissimilarity matrix based on the G4 similarity measure.
goodall4(data, var.weights = NULL)
data |
A data.frame or a matrix with cases in rows and variables in columns. |
var.weights |
A numeric vector setting weights to the used variables. One can choose the real numbers from zero to one. |
The Goodall 4 similarity measure was presented in (Boriah et al., 2008). It is a simple modification of the original Goodall measure (Goodall, 1966). It assigns higher weights to the frequent categories matches.
The function returns an object of the class "dist".
Zdenek Sulc.
Contact: zdenek.sulc@vse.cz
Boriah S., Chandola V., Kumar V. (2008). Similarity measures for categorical data: A comparative evaluation.
In: Proceedings of the 8th SIAM International Conference on Data Mining, SIAM, p. 243-254.
Goodall V.D. (1966). A new similarity index based on probability. Biometrics, 22(4), p. 882.
anderberg
,
burnaby
,
eskin
,
gambaryan
,
goodall1
,
goodall2
,
goodall3
,
iof
,
lin
,
lin1
,
of
,
sm
,
smirnov
,
ve
,
vm
.
# sample data
data(data20)
# dissimilarity matrix calculation
prox.goodall4 <- goodall4(data20)
# dissimilarity matrix calculation with variable weights
weights.goodall4 <- goodall4(data20, var.weights = c(0.7, 1, 0.9, 0.5, 0))
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.