cosine | R Documentation |
Calculates the cosine measure between two vectors or between all column vectors of a matrix.
cosine(x, y = NULL)
x |
A vector or a matrix (e.g., a document-term matrix). |
y |
Optional: a vector with compatible dimensions to |
cosine()
calculates a similarity matrix between all column
vectors of a matrix x
. This matrix might be a document-term
matrix, so columns would be expected to be documents and
rows to be terms.
When executed on two vectors x
and y
,
cosine()
calculates the cosine similarity between them.
Returns a n*n similarity matrix of cosine values, comparing all n column vectors against each other. Executed on two vectors, their cosine similarity value is returned.
The cosine measure is nearly identical with the pearson correlation
coefficient (besides a constant factor) cor(method="pearson")
.
For an investigation on the differences in the context of textmining see
(Leydesdorff, 2005).
Fridolin Wild f.wild@open.ac.uk
Leydesdorff, L. (2005) Similarity Measures, Author Cocitation Analysis,and Information Theory. In: JASIST 56(7), pp.769-772.
cor
## the cosinus measure between two vectors vec1 = c( 1, 1, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0 ) vec2 = c( 0, 0, 1, 1, 1, 1, 1, 0, 1, 0, 0, 0 ) cosine(vec1,vec2) ## the cosine measure for all document vectors of a matrix vec3 = c( 0, 1, 0, 1, 1, 0, 0, 1, 0, 0, 0, 0 ) matrix = cbind(vec1,vec2, vec3) cosine(matrix)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.