load_corpus: Load text corpus

Description Usage Arguments Value

View source: R/load_corpus.R

Description

Decides upon a text corpus.

Usage

1
load_corpus(source, type, sub_token)

Arguments

source

A string denoting the data source. Possible values are imdb_sentences, amazon_sentences, yelp_sentences and english_words.

type

Type of texts to be sampled. Possible values are texts, paragraphs, sentences, words, and characters.

sub_token

A string specifying the text unit for filtering texts by length via min_length and max_length.

Value

Text corpus.


nproellochs/textsampler documentation built on Nov. 4, 2019, 10:10 p.m.