home/categories/llm-ai/muratcankoylan-agent-skills-for-context-engineering-skills-evaluation-skill-md
llm-aidata-ai

evaluation

This skill should be used when the user asks to "evaluate agent performance", "build test framework", "measure agent quality", "create evaluation rubrics", or mentions LLM-as-judge, multi-dimensional evaluation, agent testing, or quality gates for agent pipelines.

muratcankoylan
maintainer
muratcankoylan
Mis à jour 3/18/2026
Étoiles
14945
Forks
1173
quick start

Installation and usage

This skill should be used when the user asks to "evaluate agent performance", "build test framework", "measure agent quality", "create evaluation rubrics", or mentions LLM-as-judge, multi-dimensional evaluation, agent testing, or quality gates for agent pipelines.

Installation
$ install --globalskills.sh
Utilisation

Après l'installation, vous pouvez utiliser ce skill en exécutant la commande suivante dans votre terminal :

skills use evaluation