home/categories/debugging/brazil-bench-pourpoise-skills-compare-attempts-skill-md
debuggingtools

compare-attempts

This SOP compares evaluated brazil-bench attempts across multiple dimensions to produce a ranked leaderboard and detailed comparison summary. It supports up to 10 attempts in the "Top 10" format, automatically pruning lower-ranked entries when more are added.

brazil-bench
maintainer
brazil-bench
Updated 1/13/2026
Stars
9
Forks
0
quick start

Installation and usage

This SOP compares evaluated brazil-bench attempts across multiple dimensions to produce a ranked leaderboard and detailed comparison summary. It supports up to 10 attempts in the "Top 10" format, automatically pruning lower-ranked entries when more are added.

Installation
$ install --globalskills.sh
Usage

Once installed, you can use this skill by running the following command in your terminal:

skills use compare-attempts