home/categories/machine-learning/panaversity-agentfactory-docs-skills-archive-rare-agent-evals-skill-md
machine-learningdata-ai

agent-evals

Design and implement evaluation frameworks for AI agents. Use when testing agent reasoning quality, building graders, doing error analysis, or establishing regression protection. Framework-agnostic concepts that apply to any SDK.

panaversity
maintainer
panaversity
Updated 1/19/2026
Stars
109
Forks
95
quick start

Installation and usage

Design and implement evaluation frameworks for AI agents. Use when testing agent reasoning quality, building graders, doing error analysis, or establishing regression protection. Framework-agnostic concepts that apply to any SDK.

Installation
$ install --globalskills.sh
Usage

Once installed, you can use this skill by running the following command in your terminal:

skills use agent-evals