text_similarity_id: Text Similarity Id
In dessyamirudin/similaRText: Measure text similarity score

Description Usage Arguments Value Author(s) Examples

View source: R/text_similarity_id.R

Generate unique ID for identical text

text_similarity_id(
  input_text,
  space = FALSE,
  ignore_case = TRUE,
  score = 80,
  eps = 0.15
)

`input_text`	A vector of text that need to be cleaned
`space`	A logical indicating whether to include space in the calculation. Default TRUE
`ignore_case`	A logical indicating whether to ignore case. Default TRUE
`score`	A numeric value indicating the score accepted as similar. Default 80
`eps`	A numeric value to measure if text is close to each other, the larger the value indicating that we accept the similarity loosely. Default 0.15

A data frame contain of the original text and the unique ID assigned to the text

Dessy Amirudin

1	text_similarity_id(c("South Korea","germany","southkorea","Germany"))

dessyamirudin/similaRText documentation built on Oct. 23, 2020, 12:21 a.m.

dessyamirudin/similaRText index

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

Tweet to @rdrrHQ

GitHub issue tracker

ian@mutexlabs.com