agent_evals: Performance & Benchmarking: Agent Evals

agent_evalsR Documentation

Performance & Benchmarking: Agent Evals

Description

Testing infrastructure for LLM-powered code. Provides testthat integration with custom expectations for evaluating AI agent performance, tool accuracy, and hallucination rates.


aisdk documentation built on May 29, 2026, 9:07 a.m.