| llama_numa_init | R Documentation |
Call once for better performance on NUMA systems.
llama_numa_init(strategy = "disabled")
strategy |
NUMA strategy: |
No return value, called for side effects.
## Not run:
# On multi-socket servers, distribute memory across NUMA nodes
# for better memory bandwidth during inference
llama_numa_init("distribute")
# Call before loading any models — affects all subsequent allocations
model <- llama_load_model("model.gguf", n_gpu_layers = 0L)
## End(Not run)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.