eval-framework
Framework for capturing, storing, and comparing AI evaluations to measure consistency and completeness. Use when: comparing reviews, measuring evaluation quality, running reproducibility tests, auditing AI outputs, validating findings across runs. Triggers: "compare evaluations", "measure consistency", "evaluation framework", "reproducible review", "compare reviews", "validate findings", "audit evaluation".
Installation and usage
Framework for capturing, storing, and comparing AI evaluations to measure consistency and completeness. Use when: comparing reviews, measuring evaluation quality, running reproducibility tests, auditing AI outputs, validating findings across runs. Triggers: "compare evaluations", "measure consistency", "evaluation framework", "reproducible review", "compare reviews", "validate findings", "audit evaluation".
インストール後、ターミナルで以下のコマンドを実行してこのスキルを使用できます:
skills use eval-framework