add_collocation_label: Choose and add collocation strings based on collocation...

Description Usage Arguments

View source: R/collocations.r

Description

Given a collocation category (e.g., named entity ids), this function finds the most frequently occuring string in this category and adds it as a label for the category

Usage

1
2
add_collocation_label(tc, colloc_id, feature = "token",
  new_feature = sprintf("%s_l", colloc_id), pref_subset = NULL)

Arguments

tc

a tcorpus object

colloc_id

the data column containing the unique id for collocation tokens

feature

the name of the feature column

new_feature

the name of the new feature column

pref_subset

Optionally, a subset call, to specify a subset that has priority for finding the most frequently occuring string


kasperwelbers/corpustools documentation built on Sept. 1, 2018, 1:03 p.m.