home/categories/testing/dwmkerr-claude-toolkit-plugins-toolkit-skills-anthropic-evaluations-skill-md
testingtesting-security
anthropic-evaluations
This skill should be used when the user asks to "create evals", "evaluate an agent", "build evaluation suite", or mentions agent testing, graders, or benchmarks. Also suggest when building coding agents, conversational agents, or research agents that need quality assurance.
maintainer
dwmkerr
अपडेट किया गया 1/19/2026
स्टार
1
फोर्क
0
quick start
Installation and usage
This skill should be used when the user asks to "create evals", "evaluate an agent", "build evaluation suite", or mentions agent testing, graders, or benchmarks. Also suggest when building coding agents, conversational agents, or research agents that need quality assurance.
इंस्टॉलेशन
$ install --globalskills.sh
उपयोग
इंस्टॉल करने के बाद, आप टर्मिनल में यह कमांड चलाकर इस स्किल का उपयोग कर सकते हैं:
skills use anthropic-evaluations