computational-chemistryresearch
evaluation-v2
Anthropic-aligned medical safety evaluation with pass^k metrics, failure taxonomy, and anti-gaming graders
maintainer
GOATnote-Inc
Atualizado 1/17/2026
Estrelas
3
Forks
1
quick start
Installation and usage
Anthropic-aligned medical safety evaluation with pass^k metrics, failure taxonomy, and anti-gaming graders
Instalação
$ install --globalskills.sh
Uso
Depois de instalar, você pode usar esta skill executando o seguinte comando no terminal:
skills use evaluation-v2