vocab_docs: Create Vocab for Docs

Description Usage Arguments Value Examples

View source: R/cleaning.R

Description

collect all unique tokens in the full set of docs which serve as the base vocabulary for the corpus. Returns a character vector of the unique tokens across all entries in the docs.

Usage

1
vocab_docs(docs)

Arguments

docs

a vector or list of text samples

Value

a character vector containing each unique token entry that appears in the docs.

Examples

1
vocab_docs( c('First entry of the doc,', 'Second entry of the doc.') )

duncankmckinnon/ezRnlp documentation built on Aug. 6, 2020, 1:28 a.m.