testingtesting-security
aiml-moderation-content
Content moderation benchmark with 3 variants: guard toxic text (CSV), guard user input prompts (TXT), guard model output responses (JSONL). Use when: testing ISC on content moderation, generating toxic text samples, attack prompts, or unsafe model responses. Keywords: moderation, toxic, hate speech, jailbreak, harmful compliance.
maintainer
wuyoscar
更新日 4/10/2026
スター
785
フォーク
125
quick start
Installation and usage
Content moderation benchmark with 3 variants: guard toxic text (CSV), guard user input prompts (TXT), guard model output responses (JSONL). Use when: testing ISC on content moderation, generating toxic text samples, attack prompts, or unsafe model responses. Keywords: moderation, toxic, hate speech, jailbreak, harmful compliance.
インストール
$ install --globalskills.sh
使い方
インストール後、ターミナルで以下のコマンドを実行してこのスキルを使用できます:
skills use aiml-moderation-content