tokens: Contents of Speeches Tokenized

Description Usage Arguments Author(s) References

Description

The Python NLTK software is used to produce a list of tokens from all the speeches supplied in the argument. The list corresponding to a particular speaker is unlisted and the token "$" used as a line separator is replaced with new line characters.

Usage

1
tokens(speeches, .get = NA)

Arguments

speeches

Proxy for a Python object containing the relevant speeches. This may be a list of speeches or and object for which getSpeeches() returns such a list. Possiblities include a Play, Act or Scene object.

.get

The standard option for converting the result to R. Supplying as TRUE converts to R as a named list.

Author(s)

John M. Chambers

References

Chambers, John M. (2016) Extending R, Chapman & Hall/CRC.


johnmchambers/shakespeare documentation built on May 19, 2019, 5:16 p.m.