ml-model-eval-benchmark

Name: ml-model-eval-benchmark
Author: openclaw

Compare model candidates using weighted metrics and deterministic ranking outputs. Use for benchmark leaderboards and model promotion decisions.

maintainer

openclaw

Updated 2/28/2026

Stars

4001

Forks

1095

quick start

Installation and usage

Compare model candidates using weighted metrics and deterministic ranking outputs. Use for benchmark leaderboards and model promotion decisions.

Installation

$ install --globalskills.sh

Usage

Once installed, you can use this skill by running the following command in your terminal:

skills use ml-model-eval-benchmark