computational-chemistryresearch
evaluation-v2
Anthropic-aligned medical safety evaluation with pass^k metrics, failure taxonomy, and anti-gaming graders
maintainer
GOATnote-Inc
Actualizado 1/17/2026
Estrellas
3
Forks
1
quick start
Installation and usage
Anthropic-aligned medical safety evaluation with pass^k metrics, failure taxonomy, and anti-gaming graders
Instalación
$ install --globalskills.sh
Uso
Después de instalarlo, puedes usar este skill ejecutando el siguiente comando en tu terminal:
skills use evaluation-v2