evaluation

Name: evaluation
Author: Treytucker05

Evaluate agent and LLM outputs. Use when asked to evaluate agent performance, build evaluation frameworks, implement LLM-as-judge, compare model outputs, create rubrics, mitigate evaluation bias, or design evaluation pipelines and quality gates.

ソースを表示 productivity-tools

maintainer

Treytucker05

更新日 1/20/2026

スター

フォーク

quick start

Installation and usage

インストール

$ install --globalskills.sh

使い方

インストール後、ターミナルで以下のコマンドを実行してこのスキルを使用できます：

skills use evaluation