home/categories/llm-ai/muratcankoylan-agent-skills-for-context-engineering-skills-evaluation-skill-md
llm-aidata-ai

evaluation

This skill should be used when the user asks to "evaluate agent performance", "build test framework", "measure agent quality", "create evaluation rubrics", or mentions LLM-as-judge, multi-dimensional evaluation, agent testing, or quality gates for agent pipelines.

muratcankoylan
maintainer
muratcankoylan
آخر تحديث 3/18/2026
النجوم
14945
التفرعات
1173
quick start

Installation and usage

This skill should be used when the user asks to "evaluate agent performance", "build test framework", "measure agent quality", "create evaluation rubrics", or mentions LLM-as-judge, multi-dimensional evaluation, agent testing, or quality gates for agent pipelines.

التثبيت
$ install --globalskills.sh
الاستخدام

بعد التثبيت، يمكنك استخدام هذه المهارة بتشغيل الأمر التالي في الطرفية:

skills use evaluation