| llama_set_warmup | R Documentation |
When 'warmup = TRUE', the context runs in warmup mode which pre-caches model weights in GPU memory without producing meaningful outputs. Call with 'warmup = FALSE' to return to normal inference mode.
llama_set_warmup(ctx, warmup)
ctx |
Context handle returned by [llama_new_context] |
warmup |
Logical; 'TRUE' to enable warmup mode, 'FALSE' to disable. |
No return value, called for side effects.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.