skills.homescapability registry 検索

home/categories/media

category focus

Media

Audio, video, and image processing.

1476 スキルall categories

sorting

stars

current ordering strategy

query

all entries

refine the visible subset

media

23

ffmpeg-glitch-distortion-effects

Complete glitch art, datamosh, and video distortion effects system. PROACTIVELY activate for: (1) Datamosh/pixel bleeding effects, (2) VHS/analog glitch simulation, (3) Digital corruption effects, (4) Displacement mapping, (5) Wave/ripple distortions, (6) Pixelation and mosaic effects, (7) Chromatic aberration, (8) Scan line effects, (9) Time-based distortions (echo, trails), (10) Lens distortion and barrel effects. Provides: minterpolate for datamosh, displacement filter, geq pixel manipulation, noise and artifacts, rgbashift/chromashift for color separation, lagfun for trails, tmix for frame blending, tblend for frame difference effects.

JosiahSiegel

content-media

media

23

ffmpeg-hardware-acceleration

Complete GPU-accelerated encoding/decoding system for FFmpeg 7.1 LTS and 8.0.1 (latest stable, released 2025-11-20). PROACTIVELY activate for: (1) NVIDIA NVENC/NVDEC encoding, (2) Intel Quick Sync Video (QSV), (3) AMD AMF encoding, (4) Apple VideoToolbox, (5) Linux VAAPI setup, (6) Vulkan Video 8.0 (FFv1, AV1, VP9, ProRes RAW), (7) VVC/H.266 hardware decoding (VAAPI/QSV), (8) GPU pipeline optimization with pad_cuda, (9) Docker GPU containers, (10) Performance benchmarking. Provides: Platform-specific commands, preset comparisons, quality tuning, full GPU pipeline examples, Vulkan compute codecs, VVC decoding, troubleshooting guides. Ensures: Maximum encoding speed with optimal quality using GPU acceleration.

JosiahSiegel

content-media

media

23

livestream-engineer

Expert in live streaming, WebRTC, and real-time video/audio

daffy0208

content-media

media

23

convex-agents-files

Handles file uploads, image attachments, and media processing in agent conversations. Use this when agents analyze images, process documents, or generate files.

Sstobo

content-media

media

23

media-hub

Unified media processing center for audio and video transcription, conversion, and understanding. Use when processing media files.

wulaosiji

content-media

media

23

nano-banana-pro

Generate/edit images with Nano Banana Pro (Gemini 3 Pro Image). Use for image create/modify requests incl. edits. Supports text-to-image + image-to-image; 1K/2K/4K; use --input-image.

OpenMOSS

content-media

media

23

voice-clone

使用 WaveSpeed AI MiniMax Voice Clone API 克隆声音并生成语音。支持吴娜等特定人物的声音克隆。

wulaosiji

content-media

media

23

video-overlay

Adds professional packaging and motion graphics to videos. Use when the user asks to add intros, outros, subtitles, transitions, watermarks, or lower thirds to a video. Supports multiple styles and custom options, no API key required.

wells1137

content-media

media

23

video-upscaler

Intelligently upscale and enhance videos to cinematic quality using a multi-model backend (Topaz, SeedVR2).

wells1137

content-media

media

23

kling-studio

Full-featured Kling 3.0 Omni video generation skill. Covers text-to-video, image-to-video, video editing (base mode), video reference (feature mode), multi-shot generation, and audio-synced video. Includes validated API constraint rules and prompt engineering guide.

wells1137

content-media

media

23

ffmpeg-pyav-integration

Complete PyAV (Python FFmpeg bindings) integration guide. PROACTIVELY activate for: (1) PyAV installation on Ubuntu/Windows/macOS, (2) Building PyAV against custom FFmpeg, (3) FFmpeg 7.0/8.0+ compatibility, (4) av.open() video/audio decoding, (5) VideoFrame/AudioFrame NumPy conversion, (6) Filter graph processing, (7) Video encoding with H.264/H.265/AV1, (8) Seeking and keyframe extraction, (9) RTSP/network streaming with PyAV, (10) Memory management and thread safety, (11) Error handling with FFmpegError, (12) Subtitle extraction, (13) Container manipulation and remuxing, (14) Performance optimization and threading. Provides: Complete PyAV API patterns, installation guides for all Ubuntu versions, FFmpeg 8.0+ compatibility matrix, type-safe examples, memory management best practices, filter graph examples, encoding/decoding patterns.

JosiahSiegel

content-media

media

23

media-downloader

影片/音訊下載（yt-dlp）

yazelin

content-media

media

23

media-transcription

影片/音訊逐字稿轉錄（faster-whisper）

yazelin

content-media

media

23

ffmpeg-command-syntax

Complete FFmpeg command syntax reference covering option ordering, input vs output options, stream specifiers, and position-sensitive options. PROACTIVELY activate for: (1) Command syntax questions, (2) Option placement issues, (3) Input vs output option confusion, (4) Stream specifier syntax, (5) -ss/-t/-to position questions, (6) Global vs per-file options, (7) Multiple input/output handling, (8) Option order errors. Provides: Correct option placement rules, input-only vs output-only options, position-sensitive option behavior, stream specifier syntax, common mistakes and fixes.

JosiahSiegel

content-media

media

23

videocut-talk-edit

Talking-head video transcription and speech error detection. Generates review page and deletion task list. Triggers: edit talking head, process video, detect speech errors, 剪口播, 处理视频, 识别口误

Quriosity-agent

content-media

media

23

qcut-toolkit

Unified QCut media toolkit — organize project files, process media with FFmpeg, generate AI content, control the QCut editor with native CLI commands, generate video prompts, and test MCP preview. Use when the user asks about any media workflow, file organization, video processing, AI generation, editor control, video prompts, or content pipeline task.

Quriosity-agent

content-media

media

23

ffmpeg-skill

Use when user asks to convert, compress, trim, resize, extract audio, add subtitles, create GIFs, or process video/audio files

Quriosity-agent

content-media

media

22

videodb

See, Understand, Act on video and audio. See- ingest from local files, URLs, RTSP/live feeds, or live record desktop; return realtime context and playable stream links. Understand- extract frames, build visual/semantic/temporal indexes, and search moments with timestamps and auto-clips. Act- transcode and normalize (codec, fps, resolution, aspect ratio), perform timeline edits (subtitles, text/image overlays, branding, audio overlays, dubbing, translation), generate media assets (image, audio, video), and create real time alerts for events from live streams or desktop capture.

ysyecust

content-media

media

22

videodb

视频与音频的查看、理解与行动。查看：从本地文件、URL、RTSP/直播源或实时录制桌面获取内容；返回实时上下文和可播放流链接。理解：提取帧，构建视觉/语义/时间索引，并通过时间戳和自动剪辑搜索片段。行动：转码和标准化（编解码器、帧率、分辨率、宽高比），执行时间线编辑（字幕、文本/图像叠加、品牌化、音频叠加、配音、翻译），生成媒体资源（图像、音频、视频），并为直播流或桌面捕获的事件创建实时警报。

ysyecust

content-media

media

22

nano-banana-pro

Generate or edit images via Gemini 3 Pro Image (Nano Banana Pro).

Sompote

content-media

media

22

short-video-editing-coach

Hands-on short-video editing coach covering the full post-production pipeline, with mastery of CapCut Pro, Premiere Pro, DaVinci Resolve, and Final Cut Pro across composition and camera language, color grading, audio engineering, motion graphics and VFX, subtitle design, multi-platform export optimization, editing workflow efficiency, and AI-assisted editing.

Prorise-cool

content-media

media

22

inclusive-visuals-specialist

Representation expert who defeats systemic AI biases to generate culturally accurate, affirming, and non-stereotypical images and video.

Prorise-cool

content-media

media

22

bulk

Transcribe ALL videos from a TikTok profile. Use when you want to process an entire profile's content.

grandamenium

content-media

media

22

transcribe

Transcribe specific video URL(s). Use when you have one or more TikTok video URLs to process.

grandamenium

content-media

Page 42 / 62