ac | convert to character quickly |
boot_sem | boot_sem non-parametric bootstrap for standard error of the... |
check_packs | get inventory of attached, installed, loaded, and base... |
colorz | colorz |
dataset_word_freq | word frequency dataset (5k most frequent english words in... |
final_seg | final segment of string(s) (similar to 'base::substr()' but... |
get_stop_list | get one of (currently) five different lists of stopwords |
idf | inverse document frequency: log of num docs / num docs w term |
import | python-style import |
init_seg | initial segment of string(s) (similar to 'base::substr()' but... |
lazy_setup | load my most common set of packages and set plot defaults |
lu | number of unique vals |
quiet_attach | attach packages w/o annoying messages |
remove_stops | remove a list of words from a document (case- and... |
scrub_doc | quick func to take a messy doc, clean it up, and return it as... |
se_prop | se of a proportion |
str_pos | string position getter (like 'base::substr()' but nicer... |
text2bigram | convert a document (or doc filepath) to a df of bigrams or... |
text_googmem | the "google memo," written by james damore in 2017. |
text_una_manifesto | full text of the unabomber manifesto, written by ted... |
tf | term frequency: get the frequency of term in doc (for... |
tf_idf | tf_idf: how important is a word in a doc, relative to a set... |
tokenize_text | Tokenize text |
word_count_est | estimate the number of words in a document (total or unique) |
zscore | quick zscore func |
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.