home/categories/debugging/brazil-bench-pourpoise-skills-compare-attempts-skill-md
debuggingtools

compare-attempts

This SOP compares evaluated brazil-bench attempts across multiple dimensions to produce a ranked leaderboard and detailed comparison summary. It supports up to 10 attempts in the "Top 10" format, automatically pruning lower-ranked entries when more are added.

brazil-bench
maintainer
brazil-bench
更新于 1/13/2026
星标
9
分支
0
quick start

Installation and usage

This SOP compares evaluated brazil-bench attempts across multiple dimensions to produce a ranked leaderboard and detailed comparison summary. It supports up to 10 attempts in the "Top 10" format, automatically pruning lower-ranked entries when more are added.

安装
$ install --globalskills.sh
使用

安装后,您可以通过在终端运行以下命令来使用此技能:

skills use compare-attempts