dataset_word_freq: word frequency dataset (5k most frequent english words in...

Description Usage Format Source

Description

a dataset containing corpus freqency, pos, freq rank, and dispersion for the 5k most frequent words in the corpus of contemporary american english (COCA)

Usage

1

Format

a data frame with 5000 rows and five columns

Rank

a word's frequency rank in COCA

Word

an english word

PartOfSpeech

a part of speech (see COCA documentation for codes)

Frequency

number of occurrences in COCA

Dispersion

dispersion score for the word (see COCA documentation)

Source

https://www.wordfrequency.info/


lefft/lefftpack documentation built on May 8, 2019, 1:13 p.m.