do-vlms-have-moral

Audit and harden the moral robustness of Vision-Language Model (VLM) pipelines against adversarial perturbations that flip ethical judgments. Implements perturbation probes, flip-rate measurement, and inference-time defenses from Liu et al. (2026). Use when: 'test VLM moral robustness', 'audit VLM safety', 'harden VLM ethical judgments', 'probe model moral consistency', 'red-team VLM morality', 'evaluate VLM alignment stability'.

Посмотреть исходный код philosophy-ethics

maintainer

ndpvt-web

Обновлено 2/13/2026

Звёзды

Форки

quick start

Installation and usage

Установка

$ install --globalskills.sh

Использование

После установки вы можете использовать этот skill, выполнив следующую команду в терминале:

skills use do-vlms-have-moral