tests/testthat/_snaps/sequence_onehot.md

padding and truncating works correctly

Code
  rec %>% step_tokenize(text) %>% step_sequence_onehot(text, padding = "not pre")
Condition
  Error in `step_sequence_onehot()`:
  ! `padding` should be one of: 'pre', 'post'
Code
  rec %>% step_tokenize(text) %>% step_sequence_onehot(text, truncating = "Wrong")
Condition
  Error in `step_sequence_onehot()`:
  ! `truncating` should be one of: 'pre', 'post'
Code
  rec %>% step_tokenize(text) %>% step_sequence_onehot(text, padding = c("pre",
    "pre"))
Condition
  Error in `step_sequence_onehot()`:
  ! `padding` should be one of: 'pre', 'post'
Code
  rec %>% step_tokenize(text) %>% step_sequence_onehot(text, truncating = "Wrong")
Condition
  Error in `step_sequence_onehot()`:
  ! `truncating` should be one of: 'pre', 'post'

check_name() is used

Code
  prep(rec, training = dat)
Condition
  Error in `step_sequence_onehot()`:
  Caused by error in `bake()`:
  ! Name collision occured. The following variable names already exists:
  i  seq1hot_text_1

empty printing

Code
  rec
Message

  -- Recipe ----------------------------------------------------------------------

  -- Inputs 
  Number of variables by role
  outcome:    1
  predictor: 10

  -- Operations 
  * Sequence 1 hot encoding for: <none>
Code
  rec
Message

  -- Recipe ----------------------------------------------------------------------

  -- Inputs 
  Number of variables by role
  outcome:    1
  predictor: 10

  -- Training information 
  Training data contained 32 data points and no incomplete rows.

  -- Operations 
  * Sequence 1 hot encoding for: <none> | Trained

keep_original_cols - can prep recipes with it missing

Code
  rec <- prep(rec)
Condition
  Warning:
  'keep_original_cols' was added to `step_sequence_onehot()` after this recipe was created.
  Regenerate your recipe to avoid this warning.

printing

Code
  print(rec)
Message

  -- Recipe ----------------------------------------------------------------------

  -- Inputs 
  Number of variables by role
  predictor: 1

  -- Operations 
  * Tokenization for: text
  * Sequence 1 hot encoding for: text
Code
  prep(rec)
Message

  -- Recipe ----------------------------------------------------------------------

  -- Inputs 
  Number of variables by role
  predictor: 1

  -- Training information 
  Training data contained 4 data points and no incomplete rows.

  -- Operations 
  * Tokenization for: text | Trained
  * Sequence 1 hot encoding for: text | Trained


Try the textrecipes package in your browser

Any scripts or data that you put into this service are public.

textrecipes documentation built on Nov. 16, 2023, 5:06 p.m.