plays_tokens: Tokens from Shakespeare's plays

Description Usage Format Source

Description

A data frame containing individual words from Shakespeare' plays along with their frequency and character count. We only include words with frequency n such that n >= 75th percentile, n <= 99.95th percentile. Number of observations: 3,180 tokens.

Usage

1

Format

A data frame with the following variables:

word

Individual token, may include apostrophes and proper names.

n

This is the word frequency across all 37 plays.

nchar

Number of characters in token.

Source

http://www.ibiblio.org/xml/examples/shakespeare/


rubenhm/shksprwrdl documentation built on Jan. 26, 2022, 6:08 a.m.