eval-driven-dev

Name: eval-driven-dev
Author: github

Set up eval-based QA for Python LLM applications: instrument the app, build golden datasets, write and run eval tests, and iterate on failures. ALWAYS USE THIS SKILL when the user asks to set up QA, add tests, add evals, evaluate, benchmark, fix wrong behaviors, improve quality, or do quality assurance for any Python project that calls an LLM model.

سورس دیکھیں sales-marketing

maintainer

github

اپ ڈیٹ ہوا 4/10/2026

اسٹارز

29277

فورکس

3455

quick start

Installation and usage

انسٹالیشن

$ install --globalskills.sh

استعمال

انسٹال کرنے کے بعد، آپ یہ اسکل ٹرمینل میں درج ذیل کمانڈ چلا کر استعمال کر سکتے ہیں:

skills use eval-driven-dev