home/categories/testing/elastic-kibana-agents-skills-evals-write-spec-skill-md
testingtesting-security

evals-write-spec

Write LLM evaluation spec files with datasets, tasks, and evaluators using the @kbn/evals Playwright fixture. Use when authoring new eval specs, adding datasets or evaluators, or debugging evaluation test failures.

elastic
maintainer
elastic
Updated 3/3/2026
Stars
21033
Forks
8549
quick start

Installation and usage

Write LLM evaluation spec files with datasets, tasks, and evaluators using the @kbn/evals Playwright fixture. Use when authoring new eval specs, adding datasets or evaluators, or debugging evaluation test failures.

Installation
$ install --globalskills.sh
Usage

Once installed, you can use this skill by running the following command in your terminal:

skills use evals-write-spec