home/categories/machine-learning/panaversity-agentfactory-docs-skills-archive-rare-agent-evals-skill-md
machine-learningdata-ai

agent-evals

Design and implement evaluation frameworks for AI agents. Use when testing agent reasoning quality, building graders, doing error analysis, or establishing regression protection. Framework-agnostic concepts that apply to any SDK.

panaversity
maintainer
panaversity
Обновлено 1/19/2026
Звёзды
109
Форки
95
quick start

Installation and usage

Design and implement evaluation frameworks for AI agents. Use when testing agent reasoning quality, building graders, doing error analysis, or establishing regression protection. Framework-agnostic concepts that apply to any SDK.

Установка
$ install --globalskills.sh
Использование

После установки вы можете использовать этот skill, выполнив следующую команду в терминале:

skills use agent-evals