calc_m_prob | Calculates what m_prob should be, takes in a dataframe and... |
calculate_priori | Generates vector of priori values one for each row in... |
clean_str | Replaces tokens, and cleans a string using regex stuff... |
clean_str_2 | Cleans a string using after it has been tokenized as a like... |
find_posterior | Appends dataframes with posteriors and returns it |
find_posterior_all_evidence | t_dat should have been run through... |
find_posterior_chunked | Like "find_posterior" but it uses less memory at any one time... |
find_posterior_positive_evidence_only | Creates a subset of pairs to check in more detail. |
find_posterior_subset | Like "find_posterior" but this will always set "return_all =... |
generate_all_tokens | Generates a dataframe with the total counts of each tokens... |
get_paired_row_names | Returns a dataframe with two columns indicating the rows of... |
joined_results | Returns a joined dataframe |
keep_tokens | Given a dataframe of all tokens, object will return a... |
maybe_add | adds value to lst with the key nm if nm is not already in lst |
maybe_do | will apply func to x if bool is TRUE. Saves us from an ugly... |
read_replacements_token_type | Reads in a replacement token file |
read_replacements_token_type_get_fn | get the name of the token replacement file |
reclin_pair_blocking | creates a pair blocking based on columns passed into... |
refine_posterior | After generating probabilities for a list of pairs this will... |
scale_to_prob | Scales a vector from 1-priori_delta to priori_delta |
token_count | Takes a dataframe with columns from cols and counts the... |
tokenize_ations | Takes a dataframe and tokenizes the columns indicated and... |
tokenize_ations_m_u_prob | Joins two objects together that come back from the... |
tokenize_col | turns a column of strings into a tokenized dataframe this... |
tokenize_df | Tokenize a dataframe and multiple columns in the dataframe |
tokenizer_basic | tokenizes a column in a dataframe |
token_links | Returns the required information about the joint probability... |
token_most_common | Returns a dataframe of common ngrams |
token_replacement_generator | returns a vector of replacement tokens |
write_token_replacement | Given a vector of strings this will create or append a file... |
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.