ntokens: The number of tokens in a string or vector of strings
In pangoling: Access to Large Language Model Predictions

ntokens

R Documentation

The number of tokens in a string or vector of strings

Description

The number of tokens in a string or vector of strings

Usage

ntokens(
  x,
  model = getOption("pangoling.causal.default"),
  add_special_tokens = NULL,
  config_tokenizer = NULL
)

Arguments

`x`	character input
`model`	Name of a pre-trained model or folder. One should be able to use models based on "gpt2". See hugging face website.
`add_special_tokens`	Whether to include special tokens. It has the same default as the AutoTokenizer method in Python.
`config_tokenizer`	List with other arguments that control how the tokenizer from Hugging Face is accessed.

Value

The number of tokens in a string or vector of words.

Examples


ntokens(x = c("The apple doesn't fall far from the tree."), model = "gpt2")

pangoling documentation built on April 11, 2025, 6:16 p.m.

pangoling index

Package overview Troubleshooting the use of Python in R Using a Bert model to get the predictability of words in their context Using a GPT2 transformer model to get word predictability Worked-out example: Surprisal from a causal (GPT) model as a cognitive processing bottleneck in reading

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

pangoling
Access to Large Language Model Predictions

ntokens: The number of tokens in a string or vector of strings
In pangoling: Access to Large Language Model Predictions

The number of tokens in a string or vector of strings

Description

Usage

Arguments

Value

See Also

Examples

Related to ntokens in pangoling...

R Package Documentation

Browse R Packages

We want your feedback!

pangoling Access to Large Language Model Predictions

ntokens: The number of tokens in a string or vector of strings In pangoling: Access to Large Language Model Predictions

The number of tokens in a string or vector of strings

Description

Usage

Arguments

Value

See Also

Examples

Related to ntokens in pangoling...

R Package Documentation

Browse R Packages

We want your feedback!

pangoling
Access to Large Language Model Predictions

ntokens: The number of tokens in a string or vector of strings
In pangoling: Access to Large Language Model Predictions