home/categories/machine-learning/panaversity-agentfactory-docs-skills-archive-rare-agent-evals-skill-md
machine-learningdata-ai
agent-evals
Design and implement evaluation frameworks for AI agents. Use when testing agent reasoning quality, building graders, doing error analysis, or establishing regression protection. Framework-agnostic concepts that apply to any SDK.
maintainer
panaversity
Обновлено 1/19/2026
Звёзды
109
Форки
95
quick start
Installation and usage
Design and implement evaluation frameworks for AI agents. Use when testing agent reasoning quality, building graders, doing error analysis, or establishing regression protection. Framework-agnostic concepts that apply to any SDK.
Установка
$ install --globalskills.sh
Использование
После установки вы можете использовать этот skill, выполнив следующую команду в терминале:
skills use agent-evals