agent_evals: Performance & Benchmarking: Agent Evals
In aisdk: Unified Interface for AI Model Providers

agent_evals

R Documentation

Performance & Benchmarking: Agent Evals

Description

Testing infrastructure for LLM-powered code. Provides testthat integration with custom expectations for evaluating AI agent performance, tool accuracy, and hallucination rates.

aisdk documentation built on May 29, 2026, 9:07 a.m.

aisdk index

Package overview README.md

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

aisdk
Unified Interface for AI Model Providers

agent_evals: Performance & Benchmarking: Agent Evals
In aisdk: Unified Interface for AI Model Providers

Performance & Benchmarking: Agent Evals

Description

Related to agent_evals in aisdk...

R Package Documentation

Browse R Packages

We want your feedback!

aisdk Unified Interface for AI Model Providers

agent_evals: Performance & Benchmarking: Agent Evals In aisdk: Unified Interface for AI Model Providers

Performance & Benchmarking: Agent Evals

Description

Related to agent_evals in aisdk...

R Package Documentation

Browse R Packages

We want your feedback!

aisdk
Unified Interface for AI Model Providers

agent_evals: Performance & Benchmarking: Agent Evals
In aisdk: Unified Interface for AI Model Providers