model_wordpiece: An implementation of the WordPiece algorithm
In tok: Fast Text Tokenization

model_wordpiece

R Documentation

An implementation of the WordPiece algorithm

Description

An implementation of the WordPiece algorithm

Super class

tok::tok_model -> tok_model_wordpiece

Methods

Public methods

model_wordpiece$new()
model_wordpiece$clone()

Method `new()`

Constructor for the wordpiece tokenizer

Usage

model_wordpiece$new(
  vocab = NULL,
  unk_token = NULL,
  max_input_chars_per_word = NULL
)

Arguments

vocab: A dictionary of string keys and their corresponding ids. Default: NULL.
unk_token: The unknown token to be used by the model. Default: NULL.
max_input_chars_per_word: The maximum number of characters to allow in a single word. Default: NULL.

Method `clone()`

The objects of this class are cloneable with this method.

Usage

model_wordpiece$clone(deep = FALSE)

Arguments

deep: Whether to make a deep clone.

tok
Fast Text Tokenization

model_wordpiece: An implementation of the WordPiece algorithm
In tok: Fast Text Tokenization

An implementation of the WordPiece algorithm

Description

Super class

Methods

Public methods

Method `new()`

Usage

Arguments

Method `clone()`

Usage

Arguments

See Also

Related to model_wordpiece in tok...

R Package Documentation

Browse R Packages

We want your feedback!

tok Fast Text Tokenization

model_wordpiece: An implementation of the WordPiece algorithm In tok: Fast Text Tokenization

An implementation of the WordPiece algorithm

Description

Super class

Methods

Public methods

Method new()

Usage

Arguments

Method clone()

Usage

Arguments

See Also

Related to model_wordpiece in tok...

R Package Documentation

Browse R Packages

We want your feedback!

tok
Fast Text Tokenization

model_wordpiece: An implementation of the WordPiece algorithm
In tok: Fast Text Tokenization

Method `new()`

Method `clone()`