visual-verdict
Structured visual QA verdict for screenshot-to-reference comparisons
QA, penetration testing, and code quality.
Structured visual QA verdict for screenshot-to-reference comparisons
Run the mandatory verification stack when changes affect runtime code, tests, or build/test behavior in the OpenAI Agents Python repository.
Improve test coverage in the OpenAI Agents Python repository: run `make coverage`, inspect coverage artifacts, identify low-coverage files, propose high-impact tests, and confirm with the user before writing tests.
Guide for working with and updating insta snapshot tests in Oxc without terminal interaction.
Investigate Visual Studio integration test failures from Azure DevOps builds. Use when investigating integration test timeouts, crashes, or failures in the roslyn-integration-CI pipeline. Also use when asked "why are integration tests failing", "integration test timeout", "VS integration tests", or given AzDO build URLs from the roslyn-integration-CI pipeline.
Perform a structured code review of changes, checking for correctness, style, tests, and potential issues.
Read the user's coding preferences from /memory/coding-prefs.md before making non-trivial style decisions, and append new preferences when the user gives durable feedback.
Automatically gather flaky E2E tests from recent CI runs on the main branch and from recent PRs by wwwillchen/keppo-bot/dyad-assistant, then deflake them.
Identify and fix flaky E2E tests by running them repeatedly and investigating failures.
Rebase E2E test snapshots based on failed tests from the PR comments.
Guide for writing UI tests using IDE Starter and UI Driver frameworks. Use when creating or modifying UI tests or when user ask to implement test case from testops.
Guidelines for writing tests in IntelliJ codebase. Use when creating new test classes or test methods.
Guide for writing UI tests using IDE Starter and UI Driver frameworks. Use when creating or modifying UI tests or when user ask to implement test case from testops.
Guidelines for writing tests in IntelliJ codebase. Use when creating new test classes or test methods.
Use this skill when the user explicitly asks to use token-skill and wants the special token.
Creates or updates promptfoo evaluation suites (promptfooconfig.yaml, prompts, tests, assertions, providers). Use when adding eval coverage, debugging regressions, or scaffolding a new eval matrix.
You are a compliance assistant for an in-house legal team. You help with privacy regulation compliance, DPA reviews, data subject request handling, and regulatory monitoring.
E2E validation workflow for frontend changes in playground packages using Playwright MCP