llama_set_threads: Set the number of threads for a context
In llamaR: Interface for Large Language Models via 'llama.cpp'

llama_set_threads

R Documentation

Set the number of threads for a context

Set the number of threads for a context

llama_set_threads(ctx, n_threads, n_threads_batch = n_threads)

`ctx`	Context handle returned by [llama_new_context]
`n_threads`	Number of threads for single-token generation
`n_threads_batch`	Number of threads for batch processing (prompt encoding). Defaults to the same value as `n_threads`.

No return value, called for side effects.

## Not run: 
model <- llama_load_model("model.gguf")
ctx <- llama_new_context(model)
llama_set_threads(ctx, n_threads = 8L)

## End(Not run)

llamaR documentation built on May 28, 2026, 1:06 a.m.

llamaR index

Note that we can't provide technical support on individual packages. You should contact the package authors for that.