category focus

Media

Audio, video, and image processing.

1476 スキルall categories
sorting
stars
current ordering strategy
query
all entries
refine the visible subset
media
23

ffmpeg-glitch-distortion-effects

Complete glitch art, datamosh, and video distortion effects system. PROACTIVELY activate for: (1) Datamosh/pixel bleeding effects, (2) VHS/analog glitch simulation, (3) Digital corruption effects, (4) Displacement mapping, (5) Wave/ripple distortions, (6) Pixelation and mosaic effects, (7) Chromatic aberration, (8) Scan line effects, (9) Time-based distortions (echo, trails), (10) Lens distortion and barrel effects. Provides: minterpolate for datamosh, displacement filter, geq pixel manipulation, noise and artifacts, rgbashift/chromashift for color separation, lagfun for trails, tmix for frame blending, tblend for frame difference effects.

JosiahSiegel
JosiahSiegel
content-media
open
media
23

ffmpeg-hardware-acceleration

Complete GPU-accelerated encoding/decoding system for FFmpeg 7.1 LTS and 8.0.1 (latest stable, released 2025-11-20). PROACTIVELY activate for: (1) NVIDIA NVENC/NVDEC encoding, (2) Intel Quick Sync Video (QSV), (3) AMD AMF encoding, (4) Apple VideoToolbox, (5) Linux VAAPI setup, (6) Vulkan Video 8.0 (FFv1, AV1, VP9, ProRes RAW), (7) VVC/H.266 hardware decoding (VAAPI/QSV), (8) GPU pipeline optimization with pad_cuda, (9) Docker GPU containers, (10) Performance benchmarking. Provides: Platform-specific commands, preset comparisons, quality tuning, full GPU pipeline examples, Vulkan compute codecs, VVC decoding, troubleshooting guides. Ensures: Maximum encoding speed with optimal quality using GPU acceleration.

JosiahSiegel
JosiahSiegel
content-media
open
media
23

convex-agents-files

Handles file uploads, image attachments, and media processing in agent conversations. Use this when agents analyze images, process documents, or generate files.

Sstobo
Sstobo
content-media
open
media
23

media-hub

Unified media processing center for audio and video transcription, conversion, and understanding. Use when processing media files.

wulaosiji
wulaosiji
content-media
open
media
23

nano-banana-pro

Generate/edit images with Nano Banana Pro (Gemini 3 Pro Image). Use for image create/modify requests incl. edits. Supports text-to-image + image-to-image; 1K/2K/4K; use --input-image.

OpenMOSS
OpenMOSS
content-media
open
media
23

voice-clone

使用 WaveSpeed AI MiniMax Voice Clone API 克隆声音并生成语音。支持吴娜等特定人物的声音克隆。

wulaosiji
wulaosiji
content-media
open
media
23

video-overlay

Adds professional packaging and motion graphics to videos. Use when the user asks to add intros, outros, subtitles, transitions, watermarks, or lower thirds to a video. Supports multiple styles and custom options, no API key required.

wells1137
wells1137
content-media
open
media
23

video-upscaler

Intelligently upscale and enhance videos to cinematic quality using a multi-model backend (Topaz, SeedVR2).

wells1137
wells1137
content-media
open
media
23

kling-studio

Full-featured Kling 3.0 Omni video generation skill. Covers text-to-video, image-to-video, video editing (base mode), video reference (feature mode), multi-shot generation, and audio-synced video. Includes validated API constraint rules and prompt engineering guide.

wells1137
wells1137
content-media
open
media
23

ffmpeg-pyav-integration

Complete PyAV (Python FFmpeg bindings) integration guide. PROACTIVELY activate for: (1) PyAV installation on Ubuntu/Windows/macOS, (2) Building PyAV against custom FFmpeg, (3) FFmpeg 7.0/8.0+ compatibility, (4) av.open() video/audio decoding, (5) VideoFrame/AudioFrame NumPy conversion, (6) Filter graph processing, (7) Video encoding with H.264/H.265/AV1, (8) Seeking and keyframe extraction, (9) RTSP/network streaming with PyAV, (10) Memory management and thread safety, (11) Error handling with FFmpegError, (12) Subtitle extraction, (13) Container manipulation and remuxing, (14) Performance optimization and threading. Provides: Complete PyAV API patterns, installation guides for all Ubuntu versions, FFmpeg 8.0+ compatibility matrix, type-safe examples, memory management best practices, filter graph examples, encoding/decoding patterns.

JosiahSiegel
JosiahSiegel
content-media
open
media
23

ffmpeg-command-syntax

Complete FFmpeg command syntax reference covering option ordering, input vs output options, stream specifiers, and position-sensitive options. PROACTIVELY activate for: (1) Command syntax questions, (2) Option placement issues, (3) Input vs output option confusion, (4) Stream specifier syntax, (5) -ss/-t/-to position questions, (6) Global vs per-file options, (7) Multiple input/output handling, (8) Option order errors. Provides: Correct option placement rules, input-only vs output-only options, position-sensitive option behavior, stream specifier syntax, common mistakes and fixes.

JosiahSiegel
JosiahSiegel
content-media
open
media
23

videocut-talk-edit

Talking-head video transcription and speech error detection. Generates review page and deletion task list. Triggers: edit talking head, process video, detect speech errors, 剪口播, 处理视频, 识别口误

Quriosity-agent
Quriosity-agent
content-media
open
media
23

qcut-toolkit

Unified QCut media toolkit — organize project files, process media with FFmpeg, generate AI content, control the QCut editor with native CLI commands, generate video prompts, and test MCP preview. Use when the user asks about any media workflow, file organization, video processing, AI generation, editor control, video prompts, or content pipeline task.

Quriosity-agent
Quriosity-agent
content-media
open
media
23

ffmpeg-skill

Use when user asks to convert, compress, trim, resize, extract audio, add subtitles, create GIFs, or process video/audio files

Quriosity-agent
Quriosity-agent
content-media
open
media
22

videodb

See, Understand, Act on video and audio. See- ingest from local files, URLs, RTSP/live feeds, or live record desktop; return realtime context and playable stream links. Understand- extract frames, build visual/semantic/temporal indexes, and search moments with timestamps and auto-clips. Act- transcode and normalize (codec, fps, resolution, aspect ratio), perform timeline edits (subtitles, text/image overlays, branding, audio overlays, dubbing, translation), generate media assets (image, audio, video), and create real time alerts for events from live streams or desktop capture.

ysyecust
ysyecust
content-media
open
media
22

videodb

视频与音频的查看、理解与行动。查看:从本地文件、URL、RTSP/直播源或实时录制桌面获取内容;返回实时上下文和可播放流链接。理解:提取帧,构建视觉/语义/时间索引,并通过时间戳和自动剪辑搜索片段。行动:转码和标准化(编解码器、帧率、分辨率、宽高比),执行时间线编辑(字幕、文本/图像叠加、品牌化、音频叠加、配音、翻译),生成媒体资源(图像、音频、视频),并为直播流或桌面捕获的事件创建实时警报。

ysyecust
ysyecust
content-media
open
media
22

nano-banana-pro

Generate or edit images via Gemini 3 Pro Image (Nano Banana Pro).

Sompote
Sompote
content-media
open
media
22

short-video-editing-coach

Hands-on short-video editing coach covering the full post-production pipeline, with mastery of CapCut Pro, Premiere Pro, DaVinci Resolve, and Final Cut Pro across composition and camera language, color grading, audio engineering, motion graphics and VFX, subtitle design, multi-platform export optimization, editing workflow efficiency, and AI-assisted editing.

Prorise-cool
Prorise-cool
content-media
open
media
22

inclusive-visuals-specialist

Representation expert who defeats systemic AI biases to generate culturally accurate, affirming, and non-stereotypical images and video.

Prorise-cool
Prorise-cool
content-media
open
media
22

bulk

Transcribe ALL videos from a TikTok profile. Use when you want to process an entire profile's content.

grandamenium
grandamenium
content-media
open
media
22

transcribe

Transcribe specific video URL(s). Use when you have one or more TikTok video URLs to process.

grandamenium
grandamenium
content-media
open
Previous
Page 42 / 62
Next