llama_get_embeddings_ith: Get embeddings for the i-th token in the batch
In llamaR: Interface for Large Language Models via 'llama.cpp'

llama_get_embeddings_ith

R Documentation

Get embeddings for the i-th token in the batch

Description

Returns the embedding vector for a specific token position after a decode call with embeddings enabled. Negative indices count from the end (-1 = last token).

Usage

llama_get_embeddings_ith(ctx, i)

Arguments

`ctx`	Context handle returned by [llama_new_context]
`i`	Integer index of the token (0-based, or negative for reverse indexing)

Value

A numeric vector of length n_embd.

Examples

## Not run: 
model <- llama_load_model("model.gguf")
ctx <- llama_new_context(model)
llama_generate(ctx, "Hello world", max_new_tokens = 1L)

# Get the embedding of the last decoded token
emb <- llama_get_embeddings_ith(ctx, -1L)
cat("Embedding dim:", length(emb), "\n")

## End(Not run)

llamaR documentation built on May 28, 2026, 1:06 a.m.