eval-framework

Name: eval-framework
Author: maxxentropy

Framework for capturing, storing, and comparing AI evaluations to measure consistency and completeness. Use when: comparing reviews, measuring evaluation quality, running reproducibility tests, auditing AI outputs, validating findings across runs. Triggers: "compare evaluations", "measure consistency", "evaluation framework", "reproducible review", "compare reviews", "validate findings", "audit evaluation".

View Source academic

maintainer

maxxentropy

Updated 12/23/2025

Stars

Forks

quick start

Installation and usage

Installation

$ install --globalskills.sh

Usage

Once installed, you can use this skill by running the following command in your terminal:

skills use eval-framework