home/categories/llm-ai/alirezarezvani-claude-skills-engineering-agenthub-skills-eval-skill-md
llm-aidata-ai

eval

Evaluate and rank agent results by metric or LLM judge for an AgentHub session.

alirezarezvani
maintainer
alirezarezvani
Updated 3/17/2026
Stars
10408
Forks
1307
quick start

Installation and usage

Evaluate and rank agent results by metric or LLM judge for an AgentHub session.

Installation
$ install --globalskills.sh
Usage

Once installed, you can use this skill by running the following command in your terminal:

skills use eval