| ac | convert to character quickly |
| boot_sem | boot_sem non-parametric bootstrap for standard error of the... |
| check_packs | get inventory of attached, installed, loaded, and base... |
| colorz | colorz |
| dataset_word_freq | word frequency dataset (5k most frequent english words in... |
| final_seg | final segment of string(s) (similar to 'base::substr()' but... |
| get_stop_list | get one of (currently) five different lists of stopwords |
| idf | inverse document frequency: log of num docs / num docs w term |
| import | python-style import |
| init_seg | initial segment of string(s) (similar to 'base::substr()' but... |
| lazy_setup | load my most common set of packages and set plot defaults |
| lu | number of unique vals |
| quiet_attach | attach packages w/o annoying messages |
| remove_stops | remove a list of words from a document (case- and... |
| scrub_doc | quick func to take a messy doc, clean it up, and return it as... |
| se_prop | se of a proportion |
| str_pos | string position getter (like 'base::substr()' but nicer... |
| text2bigram | convert a document (or doc filepath) to a df of bigrams or... |
| text_googmem | the "google memo," written by james damore in 2017. |
| text_una_manifesto | full text of the unabomber manifesto, written by ted... |
| tf | term frequency: get the frequency of term in doc (for... |
| tf_idf | tf_idf: how important is a word in a doc, relative to a set... |
| tokenize_text | Tokenize text |
| word_count_est | estimate the number of words in a document (total or unique) |
| zscore | quick zscore func |
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.