find_posterior_positive_evidence_only: Creates a subset of pairs to check in more detail.
In csps-efpc/TokenLink: Joins two dataframes using tokens or like words

find_posterior_positive_evidence_only

R Documentation

Creates a subset of pairs to check in more detail.

Description

Creates a subset of pairs to check in more detail.

Usage

find_posterior_positive_evidence_only(
  t_dat,
  min_posterior = MIN_POSTERIOR_POSITIVE_EVIDENCE_ONLY,
  token_join_by = TOKEN_TOKEN_TYPE_VEC,
  tokens_to_keep = NULL,
  return_all = TRUE,
  priori_func = calculate_priori,
  remove_identical_row_index = FALSE,
  x_rows_filter = NULL,
  y_rows_filter = NULL,
  ...
)

Arguments

`t_dat`	t_dat object
`min_posterior`	filter posterior results above this value. Default 0.2.
`token_join_by`	vector column names that joins the tokens. Default TOKEN_TOKEN_TYPE_VEC
`tokens_to_keep`	NULL or dataframe with a list of tokens, and m_prob and u_prob for each token, in the case where it is NULL we use tokenized$tokens_all filtered by min_token_u_prob
`return_all`	if TRUE it returns the whole object if FALSE it just returns the dataframe. Default TRUE
`priori_func`	A function that will calculate the priori. Default calculate_priori
`...`	passed to keep_tokens() if tokens_to_keep is NULL

Examples

token_links(
 dat_x = readr::read_csv('https://tinyurl.com/2p8etjr6'),
 dat_y = readr::read_csv('https://tinyurl.com/2p8ap4ad' ),
 args_x = list(col_nms = 'coname'),
 args_y = list(col_nms = 'companyName'),
 token_types = 'company_name',
 token_index = '',
 suffix = c('ceo', 'alb')
) |> find_posterior_positive_evidence_only(return_all = FALSE)

csps-efpc/TokenLink documentation built on Feb. 10, 2023, 3:30 a.m.