list_ngsl_all: Full NGSL

list_ngsl_allR Documentation

Full NGSL

Description

A dataset containing the full frequency data provided by the New General Service List researchers. #' For more information and rationale, see the source documentation.

Usage

list_ngsl_all

Format

A data frame with 31241 observations and 4 variables

lemma

base form of the word, English

group

"difficulty grouping"

on_list

general/sup/nawl/NA - not particularly useful here, but preserved for consistency

rank

word frequency rank according to the researchers

Details

Difficulty groupings have been arbitrarily set by me, as follows:

  • Group 1: first 500 words of NGSL by frequency & "supplementary" words - months/numbers etc

  • Group 2: next 500 words of NGSL by frequency

  • Group 3: next 1000 words of NGSL by frequency

  • Group 4: remaining NGSL words by frequency (about 800 words) + NAWL (about 900 words)

  • Groups 5-13: frequency groupings by first significant digit of rank (9,000-9,999, 10,000-19,999, 20,000-29,999 etc)

Note

LICENSE: Creative Commons Attribution-ShareAlike 4.0 International License.

Source

https://www.newgeneralservicelist.org/


antdurrant/word.lists documentation built on July 20, 2023, 3:57 p.m.