get_wordlist: Make wordlists from udpiped dataframes
In antdurrant/word.lists: A Collection Of Word Lists For ESL

get_wordlist

R Documentation

Make wordlists from udpiped dataframes

Description

This will add the first four unique translations that match the words and pos provided. Anything that is not a verb, adjective, noun, or adverb will return blanks, as will anything that does not have a translation in OMW for the language requested. It will not look for proper nouns; udpipe does a great job of parsing pos, with the exception of Title Case. Consider pre-processing if you have a lot of Title Case. This is intended to be used in conjunction with the word frequency lists to aid in building wordlists for ESL teachers.

Usage

get_wordlist(data, language, def = TRUE)

Arguments

`data`	a dataframe containing doc_id, sentence_id, token_id, token, lemma, upos. Should be piped from udpipe.
`language`	three-character abbreviation of language to find translations for. Refer to `wordlists::nltk_languages`.
`def`	TRUE will add a definition column, anything else will leave it out. Defaults to TRUE