home/categories/philosophy-ethics/ndpvt-web-arxiv-claude-skills-skills-do-vlms-have-moral-skill-md
philosophy-ethicslifestyle

do-vlms-have-moral

Audit and harden the moral robustness of Vision-Language Model (VLM) pipelines against adversarial perturbations that flip ethical judgments. Implements perturbation probes, flip-rate measurement, and inference-time defenses from Liu et al. (2026). Use when: 'test VLM moral robustness', 'audit VLM safety', 'harden VLM ethical judgments', 'probe model moral consistency', 'red-team VLM morality', 'evaluate VLM alignment stability'.

ndpvt-web
maintainer
ndpvt-web
Обновлено 2/13/2026
Звёзды
2
Форки
0
quick start

Installation and usage

Audit and harden the moral robustness of Vision-Language Model (VLM) pipelines against adversarial perturbations that flip ethical judgments. Implements perturbation probes, flip-rate measurement, and inference-time defenses from Liu et al. (2026). Use when: 'test VLM moral robustness', 'audit VLM safety', 'harden VLM ethical judgments', 'probe model moral consistency', 'red-team VLM morality', 'evaluate VLM alignment stability'.

Установка
$ install --globalskills.sh
Использование

После установки вы можете использовать этот skill, выполнив следующую команду в терминале:

skills use do-vlms-have-moral