speech_tokens: Speech Tokenizer
In JCRascal/speechiespeech: What the Package Does (One Line, Title Case)

Description Usage Arguments Value Examples

View source: R/speech_tokens.R

speech_tokens function receives a url input and uses CSS selectors to parse and tokenize the portion of the target url believed to contain the txt of a speech; returns a tibble containing each word in that speech

1
2
3

speech_tokens(text, title, author)

speech_tokens_dfr(.data)

`text`	string containing full text to be tokenized
`title`	string value of speech title
`author`	string value of speech author's full name
`.data`	A data frame or tibble containing columns of text, title, and author of the desired documents

A tibble where the elements represent all words in a given document, filtered for common stopwords. Contains the documents of a single document if the call is to speech_tokens, several documents if the call is to speech_tokens_dfr

text <- sample_speech_data$text[5]
title <- sample_speech_data$title[5]
author <- sample_speech_data$author[5]

speech_tokens(text, title, author)

speech_tokens_dfr(sample_speech_data)