jaccard_distance: Optimized Computation of Jaccard Distance

Description Usage Arguments Value Author(s) References Examples

Description

Utilizes the slam package to efficiently calculate jaccard distance on large sparse matrices.

Usage

1
2
3
4
5
6
7
jaccard_distance(x, ...)

## S3 method for class 'DocumentTermMatrix'
jaccard_distance(x, ...)

## S3 method for class 'TermDocumentMatrix'
jaccard_distance(x, ...)

Arguments

x

A data type (e.g., DocumentTermMatrix or TermDocumentMatrix).

...

ignored.

Value

Returns a jaccard distance object of class "dist".

Author(s)

user41844 of StackOverflow, Dmitriy Selivanov, and Tyler Rinker <tyler.rinker@gmail.com>.

References

http://stackoverflow.com/a/36373333/1000343 http://stats.stackexchange.com/a/89947/7482

Examples

1
2
3
4
5
6
library(gofastr)
library(dplyr)

out <- presidential_debates_2012 %>%
    with(q_dtm(dialogue)) %>%
    jaccard_distance()

trinker/clustext documentation built on May 31, 2019, 8:41 p.m.