Man pages for textTools
Functions for Text Cleansing and Text Analysis

as.text.table	Convert a data.table column of character vectors into a...
flag_words	Flag rows in a text.table with specific words
label_parts_of_speech	Add a column with the parts of speech for each word in a...
l_pos	Parts of speech for English words from the Moby Project.
ngrams	Create n-grams
pos	Parts of speech for English words from the Moby Project.
regex_paragraph	Regular expression that might be used to split strings of...
regex_sentence	Regular expression that might be used to split strings of...
regex_word	Regular expression that might be used to split strings of...
rm_frequent_words	Delete rows in a text.table where the number of identical...
rm_infrequent_words	Delete rows in a text.table where the number of identical...
rm_long_words	Delete rows in a text.table where the word has more than a...
rm_no_overlap	Delete rows in a text.table where the records within a group...
rm_overlap	Delete rows in a text.table where the records within a group...
rm_parts_of_speech	Delete rows in a text.table where the word has a certain part...
rm_regexp_match	Delete rows in a text.table where the record has a certain...
rm_short_words	Delete rows in a text.table where the word has less than a...
rm_words	Remove rows from a text.table with specific words
sampleStr	Generates (pseudo)random strings of the specified char length
stopwords	Vector of lowercase English stop words.
str_any_match	Detect if there are any words in a vector also found in...
str_count_intersect	Count the intersecting words in a vector that are found in...
str_count_jaccard_similarity	Calculates the intersect divided by union of two vectors of...
str_count_match	Count the words in a vector that are found in another vector.
str_count_nomatch	Count the words in a vector that are not found in another...
str_count_positional_match	Count words from a vector that are found in the same position...
str_count_positional_nomatch	Count words from a vector that are not found in the same...
str_counts	Create a list of a vector of unique words found in x and a...
str_count_setdiff	Count the words in a vector that don't intersect with another...
str_dt_col_combine	Combine columns of a data.table into a list in a new column,...
str_extract_match	Extract words from a vector that are found in another vector.
str_extract_nomatch	Extract words from a vector that are not found in another...
str_extract_positional_match	Extract words from a vector that are found in the same...
str_extract_positional_nomatch	Extract words from a vector that are not found in the same...
str_rm_blank_space	Remove and replace excess white space from strings.
str_rm_long_words	Remove words from a vector that have more than a maximum...
str_rm_non_alphanumeric	Remove and replace non-alphanumeric characters from strings.
str_rm_non_printable	Remove and replace non-printable characters from strings.
str_rm_numbers	Remove and replace numbers from strings.
str_rm_punctuation	Remove and replace punctuation from strings.
str_rm_regexp_match	Remove words from a vector that match a regular expression.
str_rm_short_words	Remove words from a vector that don't have a minimum number...
str_rm_words	Remove words from a vector of words found in another vector...
str_rm_words_by_length	Remove words from a vector based on the number of characters...
str_stopwords_by_part_of_speech	Create a vector of English words associated with particular...
str_tolower	Calls base::tolower(), which converts letters to lowercase....
str_weighted_count_match	Weighted count of the words in a vector that are found in...