simhash | R Documentation |
Simhash worker uses the keyword extraction worker to find the keywords
and uses simhash algorithm to compute simhash. dict
hmm
, idf
and stop_word
should be provided when initializing
jiebaR worker.
simhash(code, jiebar)
vector_simhash(code, jiebar)
code |
For |
jiebar |
jiebaR Worker. |
There is a symbol <=
for this function.
Qin Wenfeng
MS Charikar - Similarity Estimation Techniques from Rounding Algorithms
<=.simhash
worker
## Not run:
### Simhash
words = "hello world"
simhasher = worker("simhash",topn=1)
simhasher <= words
distance("hello world" , "hello world!" , simhasher)
## End(Not run)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.