home/categories/llm-ai/muratcankoylan-agent-skills-for-context-engineering-skills-evaluation-skill-md
llm-aidata-ai
evaluation
This skill should be used when the user asks to "evaluate agent performance", "build test framework", "measure agent quality", "create evaluation rubrics", or mentions LLM-as-judge, multi-dimensional evaluation, agent testing, or quality gates for agent pipelines.
maintainer
muratcankoylan
Обновлено 3/18/2026
Звёзды
14945
Форки
1173
quick start
Installation and usage
This skill should be used when the user asks to "evaluate agent performance", "build test framework", "measure agent quality", "create evaluation rubrics", or mentions LLM-as-judge, multi-dimensional evaluation, agent testing, or quality gates for agent pipelines.
Установка
$ install --globalskills.sh
Использование
После установки вы можете использовать этот skill, выполнив следующую команду в терминале:
skills use evaluation