home/categories/machine-learning/sickn33-antigravity-awesome-skills-skills-advanced-evaluation-skill-md
machine-learningdata-ai

advanced-evaluation

This skill should be used when the user asks to "implement LLM-as-judge", "compare model outputs", "create evaluation rubrics", "mitigate evaluation bias", or mentions direct scoring, pairwise comparison, position bias, evaluation pipelines, or automated quality assessment.

sickn33
maintainer
sickn33
Обновлено 3/20/2026
Звёзды
32093
Форки
5340
quick start

Installation and usage

This skill should be used when the user asks to "implement LLM-as-judge", "compare model outputs", "create evaluation rubrics", "mitigate evaluation bias", or mentions direct scoring, pairwise comparison, position bias, evaluation pipelines, or automated quality assessment.

Установка
$ install --globalskills.sh
Использование

После установки вы можете использовать этот skill, выполнив следующую команду в терминале:

skills use advanced-evaluation