tests/testthat/_snaps/R4.4/tokenize_bpe.md

Errors if vocabulary size is set to low.

Code
  prep(step_tokenize_bpe(recipe(~text1, data = test_data), text1,
  vocabulary_size = 10))
Condition
  Error in `step_tokenize_bpe()`:
  Caused by error in `prep()`:
  ! `vocabulary_size` of 10 is too small for column `text1` which has a unique character count of 23


EmilHvitfeldt/textrecipes documentation built on June 10, 2025, 1:21 a.m.