tests/testthat/_snaps/sequence_onehot.md

padding and truncating works correctly

Code
  rec %>% step_tokenize(text) %>% step_sequence_onehot(text, padding = "not pre")
Condition
  Error in `step_sequence_onehot()`:
  ! `padding` should be one of: 'pre', 'post'
Code
  rec %>% step_tokenize(text) %>% step_sequence_onehot(text, truncating = "Wrong")
Condition
  Error in `step_sequence_onehot()`:
  ! `truncating` should be one of: 'pre', 'post'
Code
  rec %>% step_tokenize(text) %>% step_sequence_onehot(text, padding = c("pre",
    "pre"))
Condition
  Error in `step_sequence_onehot()`:
  ! `padding` should be one of: 'pre', 'post'
Code
  rec %>% step_tokenize(text) %>% step_sequence_onehot(text, truncating = "Wrong")
Condition
  Error in `step_sequence_onehot()`:
  ! `truncating` should be one of: 'pre', 'post'

check_name() is used

Code
  prep(rec, training = dat)
Condition
  Error in `step_sequence_onehot()`:
  Caused by error in `bake()`:
  ! Name collision occurred. The following variable names already exist:
  * `seq1hot_text_1`

empty printing

Code
  rec
Message

  -- Recipe ----------------------------------------------------------------------

  -- Inputs 
  Number of variables by role
  outcome:    1
  predictor: 10

  -- Operations 
  * Sequence 1 hot encoding for: <none>
Code
  rec
Message

  -- Recipe ----------------------------------------------------------------------

  -- Inputs 
  Number of variables by role
  outcome:    1
  predictor: 10

  -- Training information 
  Training data contained 32 data points and no incomplete rows.

  -- Operations 
  * Sequence 1 hot encoding for: <none> | Trained

keep_original_cols - can prep recipes with it missing

Code
  rec <- prep(rec)
Condition
  Warning:
  `keep_original_cols` was added to `step_sequence_onehot()` after this recipe was created.
  i Regenerate your recipe to avoid this warning.

printing

Code
  print(rec)
Message

  -- Recipe ----------------------------------------------------------------------

  -- Inputs 
  Number of variables by role
  predictor: 1

  -- Operations 
  * Tokenization for: text
  * Sequence 1 hot encoding for: text
Code
  prep(rec)
Message

  -- Recipe ----------------------------------------------------------------------

  -- Inputs 
  Number of variables by role
  predictor: 1

  -- Training information 
  Training data contained 4 data points and no incomplete rows.

  -- Operations 
  * Tokenization for: text | Trained
  * Sequence 1 hot encoding for: text | Trained


EmilHvitfeldt/textrecipes documentation built on April 7, 2024, 5:02 a.m.