home/categories/debugging/posthog-posthog-products-llm-analytics-skills-exploring-llm-evaluations-skill-md

debuggingtools

exploring-llm-evaluations

Name: exploring-llm-evaluations
Author: PostHog

Investigate LLM analytics evaluations of both types — `hog` (deterministic code-based) and `llm_judge` (LLM-prompt-based). Find existing evaluations, inspect their configuration, run them against specific generations, query individual pass/fail results, and generate AI-powered summaries of patterns across many runs. Use when the user asks to debug why an evaluation is failing, surface common failure modes, compare results across filters, dry-run a Hog evaluator, prototype a new LLM-judge prompt, or manage the evaluation lifecycle (create, update, enable/disable, delete).

সোর্স দেখুন debugging

maintainer

PostHog

আপডেট হয়েছে 4/8/2026

স্টার

32541

ফর্ক

2505

quick start

Installation and usage

ইনস্টলেশন

$ install --globalskills.sh

ব্যবহার

ইনস্টল করার পর, টার্মিনালে নিচের কমান্ড চালিয়ে আপনি এই স্কিল ব্যবহার করতে পারবেন:

skills use exploring-llm-evaluations