home/categories/machine-learning/panaversity-agentfactory-docs-skills-archive-rare-agent-evals-skill-md
machine-learningdata-ai
agent-evals
Design and implement evaluation frameworks for AI agents. Use when testing agent reasoning quality, building graders, doing error analysis, or establishing regression protection. Framework-agnostic concepts that apply to any SDK.
maintainer
panaversity
Atualizado 1/19/2026
Estrelas
109
Forks
95
quick start
Installation and usage
Design and implement evaluation frameworks for AI agents. Use when testing agent reasoning quality, building graders, doing error analysis, or establishing regression protection. Framework-agnostic concepts that apply to any SDK.
Instalação
$ install --globalskills.sh
Uso
Depois de instalar, você pode usar esta skill executando o seguinte comando no terminal:
skills use agent-evals