home/categories/testing-security
domain cluster

Testing & Security

QA, penetration testing, and code quality.

9326 skillsall categories
sorting
stars
current ordering strategy
query
all entries
refine the visible subset
code-quality
9.4K

go-review

Go code review guidelines for the Cog codebase

replicate
replicate
testing-security
open
code-quality
9.4K

python-review

Python code review guidelines for the Cog SDK

replicate
replicate
testing-security
open
code-quality
9.4K

rust-review

Rust code review guidelines for Coglet

replicate
replicate
testing-security
open
code-quality
9.3K

python-development

Coding standards, conventions, and patterns for developing Python code in the Agent Framework repository. Use this when writing or modifying Python source files in the python/ directory.

microsoft
microsoft
testing-security
open
testing
9.3K

verify-dotnet-samples

How to build, run and verify the .NET sample projects in the Agent Framework repository. Use this when a user wants to verify that the samples still function as expected.

microsoft
microsoft
testing-security
open
testing
9.3K

python-testing

Guidelines for writing and running tests in the Agent Framework Python codebase. Use this when creating, modifying, or running tests.

microsoft
microsoft
testing-security
open
testing
9.2K

phoenix-playwright-tests

Write Playwright E2E tests for the Phoenix AI observability platform. Use when creating, updating, or debugging Playwright tests, or when the user asks about testing UI features, writing E2E tests, or automating browser interactions for Phoenix.

Arize-ai
Arize-ai
testing-security
open
testing
9.2K

phoenix-client-development

Guide for the phoenix-client TypeScript package — experiment lifecycle, tracer provider management, and test conventions.

Arize-ai
Arize-ai
testing-security
open
testing
9.2K

grouped-tools-test

Test skill for groupedTools. When executing this skill, use the record_result tool to record the result value.

alibaba
alibaba
testing-security
open
testing
9.2K

sample-skill

Sample skill fixture for classpath registry enhancement tests.

alibaba
alibaba
testing-security
open
code-quality
8.9K

risingwave-rust-analyzer

Use rust-analyzer CLI and editor/LSP settings to inspect, diagnose, and refactor RisingWave Rust code. Use when working in the RisingWave workspace and you need fast semantic analysis, unresolved-reference checks, macro-aware navigation, structured search/replace, or guidance on choosing the correct crate root and feature flags before heavier cargo or risedev commands.

risingwavelabs
risingwavelabs
testing-security
open
code-quality
8.9K

code-review

Review changed code against project standards. Checks for missing tests, dead code, type safety, lint issues, and coding conventions. Run after completing any implementation work.

vectorize-io
vectorize-io
testing-security
open
security
8.8K

create-new-gosec-rule

Propose and implement a new generic gosec rule from a Go security issue description.

securego
securego
testing-security
open
testing
8.7K

testing-android-code

This skill should be used when writing or reviewing tests for Android code in Bitwarden. Triggered by "BaseViewModelTest", "BitwardenComposeTest", "BaseServiceTest", "stateEventFlow", "bufferedMutableSharedFlow", "FakeDispatcherManager", "expectNoEvents", "assertCoroutineThrows", "createMockCipher", "createMockSend", "asSuccess", "Why is my Bitwarden test failing?", or testing questions about ViewModels, repositories, Compose screens, or data sources in Bitwarden.

bitwarden
bitwarden
testing-security
open
code-quality
8.7K

perform-android-preflight-checklist

Quality gate checklist to run before committing or creating a PR. Use when finishing implementation, checking work quality, or preparing to commit. Triggered by "self review", "check my work", "ready to commit", "done implementing", "review checklist", "quality check".

bitwarden
bitwarden
testing-security
open
testing
8.7K

build-test-verify

Build, test, lint, and deploy commands for the Bitwarden Android project. Use when running tests, building APKs/AABs, running lint/detekt, deploying, using fastlane, or discovering codebase structure. Triggered by "run tests", "build", "gradle", "lint", "detekt", "deploy", "fastlane", "assemble", "verify", "coverage".

bitwarden
bitwarden
testing-security
open
testing
8.7K

harness-eval

This skill should be used when the user asks to "test the harness", "run integration tests", "validate features with real API", "test with real model calls", "run agent loop tests", "verify end-to-end", or needs to verify OpenHarness features on a real codebase with actual LLM calls.

HKUDS
HKUDS
testing-security
open
code-quality
8.6K

upgrade-dep

Upgrade a dependency in the Sentry JavaScript SDK. Use when upgrading packages, bumping versions, or fixing security vulnerabilities via dependency updates.

getsentry
getsentry
testing-security
open
security
8.6K

skill-scanner

Scan agent skills for security issues. Use when asked to "scan a skill", "audit a skill", "review skill security", "check skill for injection", "validate SKILL.md", or assess whether an agent skill is safe to install. Checks for prompt injection, malicious scripts, excessive permissions, secret exposure, and supply chain risks.

getsentry
getsentry
testing-security
open
testing
8.6K

bump-size-limit

Bump size limits in .size-limit.js when the size-limit CI check is failing. Use when the user mentions size limit failures, bundle size checks failing, CI size check errors, or needs to update size-limit thresholds. Also use when the user says "bumpSizeLimit", "fix size limit", "size check failing", or "update bundle size limits".

getsentry
getsentry
testing-security
open
testing
8.6K

e2e

Run E2E tests for Sentry JavaScript SDK test applications

getsentry
getsentry
testing-security
open
code-quality
8.6K

validate-prompts

Validate extracted Claude Code prompt data by reading files and checking rules directly — no external scripts or API calls needed. Checks JSON structure (30+ rules), generated markdown files, README consistency, and semantic variable name correctness. Use whenever asked to validate prompt JSON files, check generated output, run pre-release checks, debug validation errors, or analyze variable naming. Trigger phrases: "validate", "check prompts", "run validation", "verify prompts", "structural checks", "semantic check", "release prep". Also use when investigating a specific validation rule (A1–A21, B1–B6, C1–C7, A23) or when encountering errors in prompt data.

Piebald-AI
Piebald-AI
testing-security
open
code-quality
8.6K

verify-changelog

Verify changelog entries against actual prompt diffs by reading both JSON files and evaluating accuracy directly. Compares two prompt JSON versions (old → new), identifies added/removed/changed prompts, and checks that a human-written changelog accurately describes the changes. Use whenever writing, reviewing, or verifying a changelog entry for a new Claude Code version, when comparing prompt versions, when preparing a release, or when asked to "verify changelog", "check changelog", "changelog accuracy", or "diff vs changelog". Also use when asked whether a changelog is correct, complete, or well-worded, or when asked to help write a changelog for a version.

Piebald-AI
Piebald-AI
testing-security
open
code-quality
8.6K

code-review

Performs an architectural and quality code review on a specified file or set of files. Checks for coding standard compliance, architectural pattern adherence, SOLID principles, testability, and performance concerns.

Donchitos
Donchitos
testing-security
open
Previous
Page 29 / 389
Next