text_similarity_id: Text Similarity Id

Description Usage Arguments Value Author(s) Examples

View source: R/text_similarity_id.R

Description

Generate unique ID for identical text

Usage

1
2
3
4
5
6
7
text_similarity_id(
  input_text,
  space = FALSE,
  ignore_case = TRUE,
  score = 80,
  eps = 0.15
)

Arguments

input_text

A vector of text that need to be cleaned

space

A logical indicating whether to include space in the calculation. Default TRUE

ignore_case

A logical indicating whether to ignore case. Default TRUE

score

A numeric value indicating the score accepted as similar. Default 80

eps

A numeric value to measure if text is close to each other, the larger the value indicating that we accept the similarity loosely. Default 0.15

Value

A data frame contain of the original text and the unique ID assigned to the text

Author(s)

Dessy Amirudin

Examples

1
text_similarity_id(c("South Korea","germany","southkorea","Germany"))

dessyamirudin/similaRText documentation built on Oct. 23, 2020, 12:21 a.m.