llama_synchronize: Synchronize asynchronous computation

View source: R/llama.R

llama_synchronizeR Documentation

Synchronize asynchronous computation

Description

Blocks until all pending GPU/async operations for this context are complete. Normally not needed — 'llama_decode' and 'llama_generate' are synchronous — but useful when using low-level batch APIs in async mode.

Usage

llama_synchronize(ctx)

Arguments

ctx

Context handle returned by [llama_new_context]

Value

No return value, called for side effects.


llamaR documentation built on May 28, 2026, 1:06 a.m.