computational-chemistryresearch
evaluation-v2
Anthropic-aligned medical safety evaluation with pass^k metrics, failure taxonomy, and anti-gaming graders
maintainer
GOATnote-Inc
更新於 1/17/2026
星標
3
分支
1
quick start
Installation and usage
Anthropic-aligned medical safety evaluation with pass^k metrics, failure taxonomy, and anti-gaming graders
安裝
$ install --globalskills.sh
使用
安裝後,您可以通過在終端運行以下命令來使用此技能:
skills use evaluation-v2