home/categories/finance-investment/openclaw-skills-skills-0x-professor-ml-model-eval-benchmark-skill-md
finance-investmentbusiness

ml-model-eval-benchmark

Compare model candidates using weighted metrics and deterministic ranking outputs. Use for benchmark leaderboards and model promotion decisions.

openclaw
maintainer
openclaw
Updated 2/28/2026
Stars
4001
Forks
1095
quick start

Installation and usage

Compare model candidates using weighted metrics and deterministic ranking outputs. Use for benchmark leaderboards and model promotion decisions.

Installation
$ install --globalskills.sh
Usage

Once installed, you can use this skill by running the following command in your terminal:

skills use ml-model-eval-benchmark