llama_memory_breakdown_print: Print memory breakdown by device

View source: R/llama.R

llama_memory_breakdown_printR Documentation

Print memory breakdown by device

Description

Prints a debug summary of how model weights are distributed across compute devices (CPU, GPU layers). Useful for diagnosing memory allocation with partial GPU offload.

Usage

llama_memory_breakdown_print(ctx)

Arguments

ctx

Context handle returned by [llama_new_context]

Value

No return value, called for side effects.


llamaR documentation built on May 28, 2026, 1:06 a.m.