speech_tokens: Speech Tokenizer

Description Usage Arguments Value Examples

View source: R/speech_tokens.R

Description

speech_tokens function receives a url input and uses CSS selectors to parse and tokenize the portion of the target url believed to contain the txt of a speech; returns a tibble containing each word in that speech

Usage

1
2
3

Arguments

text

string containing full text to be tokenized

title

string value of speech title

author

string value of speech author's full name

.data

A data frame or tibble containing columns of text, title, and author of the desired documents

Value

A tibble where the elements represent all words in a given document, filtered for common stopwords. Contains the documents of a single document if the call is to speech_tokens, several documents if the call is to speech_tokens_dfr

Examples

1
2
3
4
5
6
7

JCRascal/speechiespeech documentation built on Nov. 27, 2020, 5:18 p.m.