home/categories/machine-learning/panaversity-agentfactory-docs-skills-archive-rare-agent-evals-skill-md
machine-learningdata-ai
agent-evals
Design and implement evaluation frameworks for AI agents. Use when testing agent reasoning quality, building graders, doing error analysis, or establishing regression protection. Framework-agnostic concepts that apply to any SDK.
maintainer
panaversity
Mis à jour 1/19/2026
Étoiles
109
Forks
95
quick start
Installation and usage
Design and implement evaluation frameworks for AI agents. Use when testing agent reasoning quality, building graders, doing error analysis, or establishing regression protection. Framework-agnostic concepts that apply to any SDK.
Installation
$ install --globalskills.sh
Utilisation
Après l'installation, vous pouvez utiliser ce skill en exécutant la commande suivante dans votre terminal :
skills use agent-evals