tokenize_ations_m_u_prob: Joins two objects together that come back from the...

tokenize_ations_m_u_probR Documentation

Joins two objects together that come back from the tokenize_ations function, joins the token counts together, and calculates the m and u probs for each token.

Description

Joins two objects together that come back from the tokenize_ations function, joins the token counts together, and calculates the m and u probs for each token.

Usage

tokenize_ations_m_u_prob(x, y, ..., suffix = TOKEN_SUFFIX_DEFAULT)

Arguments

x

list returned from tokenize_ations

y

list returned from tokenize_ations

...

ignored

suffix

String vector of length 2, identifies which original dataframe a column in the result comes from. Default TOKEN_SUFFIX_DEFAULT

Examples

dat_ceo <- readr::read_csv('https://tinyurl.com/2p8etjr6')
dat_alb <- readr::read_csv('https://tinyurl.com/2p8ap4ad')
tokenize_ations_m_u_prob(
  tokenize_ations(dat_ceo, col_nms = 'coname', token_types = 'TT'),
  tokenize_ations(dat_alb, col_nms = 'companyName', token_types = 'TT')
)



csps-efpc/TokenLink documentation built on Feb. 10, 2023, 3:30 a.m.