| ACMI_contribution | Calculate Average Conditional Mutual Information (ACMI)... |
| calculate_document_pair_distances | Document Distances |
| check_directory_name | A function to ensure that a directory name is in the proper... |
| clean_document_text | A function which cleans the raw text of a document provided... |
| color_words_by_frequency | A function to generate LaTeX output from a dataframe... |
| color_word_table | A function to generate LaTeX output from a dataframe... |
| combine_document_term_matrices | A function to combine multiple document term matrices into a... |
| compare_tf_idf_scalings | A function that performs a bunch of different forms of TF-IDF... |
| congress_bills | All versions of the first 20 bills introduced in the House... |
| contingency_table | Generates a contingency table from user-specified document... |
| convert_quanteda_to_slam | A function to convert a quanteda dfm object to a... |
| corenlp | Runs Stanford CoreNLP on a collection of documents |
| corenlp_blocked | Runs Stanford CoreNLP on a collection of .txt files and... |
| count_ngrams | An experimental function to efficiently generate a vocabulary... |
| count_words | A function to efficiently form aggregate word counts and a... |
| dice_coefficient_diff_table | Lines In Both Documents via Dice Coefficients |
| dice_coefficient_line_matching | Lines In Both Documents via Dice Coefficients |
| distinct_words | A function to find (semi)-distinct words in a list of term... |
| document_similarities | Calculate sequence based document similarities |
| document_term_count_list | Document Term Count List: Conressional Bills |
| document_term_vector_list | Document Term Vector List: Conressional Bills |
| download_corenlp | Checks the java version on your computer and downloads... |
| download_mallet | Checks the java version on your computer and downloads MALLET... |
| edit_metrics | Calculate Edit Metrics Between Two Document Versions |
| estimate_plots | A function to parameter estimate plots with 95 percent... |
| feature_selection | A function that implements a number of feature selection... |
| fightin_words_plot | A function that generates plots similar to those in Monroe et... |
| frequency_threshold | A function to frequency threshold a vector of strings. |
| generate_blocked_document_term_vectors | A function to generate and save blocks of document term... |
| generate_document_term_matrix | A function to generate a document term matrix from a list of... |
| generate_document_term_vectors | A function to generate document term vectors from a variety... |
| generate_sparse_large_document_term_matrix | A function to generate a sparse large document term matrix in... |
| get_file_paths | A function the returns the file paths to two example raw... |
| get_unique_values_and_counts | Find unique values and the counts of those variables for a... |
| kill_zombies | A function which takes no arguments and kills zombie R... |
| mallet_lda | A wrapper function for LDA using the MALLET machine learning... |
| multi_dice_coefficient_matching | Multiple N-Gram Lngth Dice Coefficient Document Matching |
| multi_plot | An implementation of matplot with nice coloring and automatic... |
| mutual_information | Mutual Information |
| ngrams | Extracts N-Grams and phrases from a collection od documents... |
| ngram_sequence_matching | N-Gram Sequence Matching |
| ngram_sequnce_plot | N-Gram Sequence Matching |
| order_by_counts | A function to generate an ordered word count dataframe from a... |
| pmi | A function to calculate a number of information-theoretic... |
| Processed_Text | Twenty bills tokenized and tagged by CORENLP |
| reference_distribution_distance | Reference distribtuion distances |
| sparse_doc_term_parallel | Only to be used internally. A function to generate a sparse... |
| sparse_to_dense_matrix | A function to convert a slam::simple_triplet_matrix sparse... |
| SpeedReader | SpeedReader: functions to facilitate high performance text... |
| speed_set_vocabulary | A function the reorgaizes vocabulary to speed up document... |
| tfidf | A function to calculate TF-IDF and other related statistics... |
| topic_coherence | A function to calculate topic coherence for a given topic... |
| unlist_and_concatenate | A function to unlist and concatenate a subset of a... |
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.