home/categories/llm-ai/muratcankoylan-agent-skills-for-context-engineering-skills-evaluation-skill-md
llm-aidata-ai
evaluation
This skill should be used when the user asks to "evaluate agent performance", "build test framework", "measure agent quality", "create evaluation rubrics", or mentions LLM-as-judge, multi-dimensional evaluation, agent testing, or quality gates for agent pipelines.
maintainer
muratcankoylan
Atualizado 3/18/2026
Estrelas
14945
Forks
1173
quick start
Installation and usage
This skill should be used when the user asks to "evaluate agent performance", "build test framework", "measure agent quality", "create evaluation rubrics", or mentions LLM-as-judge, multi-dimensional evaluation, agent testing, or quality gates for agent pipelines.
Instalação
$ install --globalskills.sh
Uso
Depois de instalar, você pode usar esta skill executando o seguinte comando no terminal:
skills use evaluation