| llama_synchronize | R Documentation |
Blocks until all pending GPU/async operations for this context are complete. Normally not needed — 'llama_decode' and 'llama_generate' are synchronous — but useful when using low-level batch APIs in async mode.
llama_synchronize(ctx)
ctx |
Context handle returned by [llama_new_context] |
No return value, called for side effects.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.