home/categories/academic/muratcankoylan-agent-skills-for-context-engineering-skills-advanced-evaluation-skill-md
academicresearch

advanced-evaluation

This skill should be used when the user asks to "implement LLM-as-judge", "compare model outputs", "create evaluation rubrics", "mitigate evaluation bias", or mentions direct scoring, pairwise comparison, position bias, evaluation pipelines, or automated quality assessment.

muratcankoylan
maintainer
muratcankoylan
اپ ڈیٹ ہوا 3/18/2026
اسٹارز
14945
فورکس
1173
quick start

Installation and usage

This skill should be used when the user asks to "implement LLM-as-judge", "compare model outputs", "create evaluation rubrics", "mitigate evaluation bias", or mentions direct scoring, pairwise comparison, position bias, evaluation pipelines, or automated quality assessment.

انسٹالیشن
$ install --globalskills.sh
استعمال

انسٹال کرنے کے بعد، آپ یہ اسکل ٹرمینل میں درج ذیل کمانڈ چلا کر استعمال کر سکتے ہیں:

skills use advanced-evaluation