list_ngsl_all: Full NGSL

Description Usage Format Details Source

Description

A dataset containing the full frequency data provided by the New General Service List researchers. #' For more information and rationale, see the source documentation.

Usage

1

Format

A data frame with 31241 observations and 4 variables

lemma

base form of the word, English

group

"difficulty grouping"

on_list

general/sup/nawl/NA - not particularly useful here, but preserved for consistency

rank

word frequency rank according to the researchers

Details

Difficulty groupings have been arbitrarily set by me, as follows:

- Group 1: first 500 words of NGSL by frequency & "supplementary" words - months/numbers etc - Group 2: next 500 words of NGSL by frequency - Group 3: next 1000 words of NGSL by frequency - Group 4: remaining NGSL words by frequency (about 800 words) + NAWL (about 900 words) - Groups 5-13: frequency groupings by first significant digit of rank (9,000-9,999, 10,000-19,999, 20,000, 29,999 etc)

Source

https://www.newgeneralservicelist.org/


antdurrant/live.analytics documentation built on Dec. 13, 2021, 12:20 p.m.