transformer_vocab: Returns the vocabulary of a model
In pangoling: Access to Large Language Model Predictions

transformer_vocab

R Documentation

Returns the vocabulary of a model

Description

Returns the (decoded) vocabulary of a model.

Usage

transformer_vocab(
  model = getOption("pangoling.causal.default"),
  add_special_tokens = NULL,
  decode = FALSE,
  config_tokenizer = NULL
)

Arguments

`model`	Name of a pre-trained model or folder. One should be able to use models based on "gpt2". See hugging face website.
`add_special_tokens`	Whether to include special tokens. It has the same default as the AutoTokenizer method in Python.
`decode`	Logical. If `TRUE`, decodes the tokens into human-readable strings, handling special characters and diacritics. Default is `FALSE`.
`config_tokenizer`	List with other arguments that control how the tokenizer from Hugging Face is accessed.

Value

A vector with the vocabulary of a model.

Examples


transformer_vocab(model = "gpt2") |>
 head()

pangoling documentation built on April 11, 2025, 6:16 p.m.

pangoling index

Package overview Troubleshooting the use of Python in R Using a Bert model to get the predictability of words in their context Using a GPT2 transformer model to get word predictability Worked-out example: Surprisal from a causal (GPT) model as a cognitive processing bottleneck in reading

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

pangoling
Access to Large Language Model Predictions

transformer_vocab: Returns the vocabulary of a model
In pangoling: Access to Large Language Model Predictions

Returns the vocabulary of a model

Description

Usage

Arguments

Value

See Also

Examples

Related to transformer_vocab in pangoling...

R Package Documentation

Browse R Packages

We want your feedback!

pangoling Access to Large Language Model Predictions

transformer_vocab: Returns the vocabulary of a model In pangoling: Access to Large Language Model Predictions

Returns the vocabulary of a model

Description

Usage

Arguments

Value

See Also

Examples

Related to transformer_vocab in pangoling...

R Package Documentation

Browse R Packages

We want your feedback!

pangoling
Access to Large Language Model Predictions

transformer_vocab: Returns the vocabulary of a model
In pangoling: Access to Large Language Model Predictions