Description Usage Arguments Value Examples
Delete rows in a text.table where the number of identical records within a group is more than a certain threshold
| 1 2 3 4 5 6 7 8 9 | 
| x | A text.table created by as.text.table(). | 
| text | A string, the name of the column in x used to determine deletion of rows based on the term frequency. | 
| count_col_name | A string, the name to assign to the new column containing the count of each word. If NULL, does not return the counts. | 
| group_by | A vector of column names to group by. Doesn't work if the group by column is a list column. | 
| max_count | A number, the maximum number of times a word can occur to keep. | 
| max_count_is_ratio | TRUE/FALSE, if TRUE, implies the value passed to max_count should be considered a ratio. | 
| total_count_col | Name of the column containing the denominator (likely total count of records within a group) to use to calculate the ratio of a word count vs total if max_count_is_ratio is TRUE. | 
A text.table, with rows having a duplicate count over a certain threshold deleted.
| 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 | rm_frequent_words(
as.text.table(
  x = as.data.table(
    list(
      col1 = c(
        "a",
        "b"
      ),
      col2 = c(
        tolower("The dog is nice because it picked up the newspaper."),
        tolower("The dog is extremely nice because it does the dishes.")
      )
    )
  ),
  text = "col2",
  split = " "
),
text = "col2",
count_col_name = "count",
max_count = 1
)
 | 
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.