home/categories/lab-tools/comeonoliver-skillshub-skills-github-awesome-copilot-eval-driven-dev-skill-md

lab-toolsresearch

eval-driven-dev

Name: eval-driven-dev
Author: ComeOnOliver

Add instrumentation, build golden datasets, write eval-based tests, run them, root-cause failures, and iterate — Ensure your Python LLM application works correctly. Make sure to use this skill whenever a user is developing, testing, QA-ing, evaluating, or benchmarking a Python project that calls an LLM. Use for making sure an LLM application works correctly, catching regressions after prompt changes, fixing unexpected behavior, or validating output quality before shipping.

소스 보기 lab-tools

maintainer

ComeOnOliver

업데이트됨 3/23/2026

스타

포크

quick start

Installation and usage

설치

$ install --globalskills.sh

사용법

설치 후 터미널에서 다음 명령을 실행하여 이 스킬을 사용할 수 있습니다:

skills use eval-driven-dev