dot-process_word_recursive: Split a Word into Pieces

.process_word_recursiveR Documentation

Split a Word into Pieces

Description

Recursively splits a word into constituent pieces, based on Wiktionary annotations. There are two main categories of word pieces used: inflections (standard verb/noun/comparative adjective forms, defined in practice here as endings identified in Wiktionary by inflectional function without reference to the actual form of the ending) and morphemes (typically denoted in Wiktionary by etymology templates).

Usage

.process_word_recursive(
  word,
  sight_words = default_sight_words(),
  use_lookup = TRUE,
  current_depth = 1L,
  max_depth = 30L,
  stop_at = NULL
)

Arguments

word

Character; a word to process.

sight_words

Character vector; words to not break down further. Defaults to the included sight_words list; to include no sight words, pass in an empty character vector to this parameter.

use_lookup

Logical; whether to use a cached lookup table (if available) or always process the word from scratch. If the word is not available in the lookup, processing (and likely a call to the Wiktionary API) will still occur. You might want to set this value to FALSE if you've made recent edits to Wiktionary or otherwise want to see if something has changed recently.

current_depth

Integer; current recursion depth.

max_depth

Integer; maximum recursion depth.

stop_at

Character; a word to stop processing at. Used to help prevent loops.

Value

Character; the word split into pieces.


jonthegeek/wikimorphemes documentation built on Sept. 1, 2022, 10:32 p.m.