home/categories/lab-tools/comeonoliver-skillshub-skills-github-awesome-copilot-eval-driven-dev-skill-md

lab-toolsresearch

eval-driven-dev

Name: eval-driven-dev
Author: ComeOnOliver

Add instrumentation, build golden datasets, write eval-based tests, run them, root-cause failures, and iterate — Ensure your Python LLM application works correctly. Make sure to use this skill whenever a user is developing, testing, QA-ing, evaluating, or benchmarking a Python project that calls an LLM. Use for making sure an LLM application works correctly, catching regressions after prompt changes, fixing unexpected behavior, or validating output quality before shipping.

查看源码 lab-tools

maintainer

ComeOnOliver

更新于 3/23/2026

星标

分支

quick start

Installation and usage

安装

$ install --globalskills.sh

使用

安装后，您可以通过在终端运行以下命令来使用此技能：

skills use eval-driven-dev