category focus

Media

Audio, video, and image processing.

1476 스킬all categories
sorting
stars
current ordering strategy
query
all entries
refine the visible subset
media
54

ffmpeg-process

Process, transcode, and manipulate media files using FFmpeg available in the sandbox. Input/output through the shared volume at {{SHARED_VOLUME}}.

bidewio
bidewio
content-media
open
media
53

text-to-video

Expert patterns for AI video generation including text-to-video, image-to-video, video editing, and API integration with Runway, Kling, Luma, Wan, and ReplicateUse when "text to video, video generation, image to video, runway api, kling video, luma dream machine, wan video, animate image, ai video, video-generation, text-to-video, image-to-video, runway, kling, luma, wan, replicate, ai-video" mentioned.

omer-metin
omer-metin
content-media
open
media
53

demoscene-coding

Specialist in creating size-optimized real-time audio-visual demos and procedural artUse when "demoscene, size coding, 64k intro, 4k intro, 1k intro, tiny code, shader golf, shader minification, procedural demo, bytebeat, pouet, demo party, real-time procedural, demoscene, size-coding, 64k-intro, 4k-intro, shader-optimization, procedural, webgl, graphics" mentioned.

omer-metin
omer-metin
content-media
open
media
53

add-to-leaderboard

Use this skill when the user wants to add a new codec entry to the leaderboard, update leaderboard rankings, or mentions adding someone's compression results.

agavra
agavra
content-media
open
media
53

view-attachment

View image or file attachments that you can't directly see. Use this skill when you receive a message with attachments listed as file paths and you need to understand their contents. Especially useful for text-only models that cannot process images natively.

tkellogg
tkellogg
content-media
open
media
53

videohub-douyin

下载抖音单视频,复用 src/douyin_cli.py。适合处理抖音分享链接、短链接和标准视频链接。

cacity
cacity
content-media
open
media
53

videohub-subtitles

处理字幕生成后的烧录和合成流程。适用于把 ASS 字幕烧录进视频,或指导用户使用现有字幕工具。

cacity
cacity
content-media
open
media
53

videohub

VideoHub 总入口。用于识别用户要处理的平台或功能,并路由到更具体的 VideoHub skills,如 YouTube、抖音、蔻享、闲时队列、FFmpeg、字幕和直播录制。

cacity
cacity
content-media
open
media
53

videohub-ffmpeg

管理 FFmpeg 配置、模式、路径、下载和可用性测试。直接复用 src/ffmpeg_config_cli.py。

cacity
cacity
content-media
open
media
53

video-directing

World-class video directing mastery drawing from cinematic legends like Spielberg, Cameron, Coppola, and Nolan. This skill translates directorial intent into AI video generation, focusing on visual storytelling, emotional pacing, and shot composition. It guides the use of AI tools to orchestrate emotion, not just generate footage. Covers pre-visualization, shot sequencing, and the application of classic techniques—like the Spielberg Face or Cameron Scale—to synthetic media. Emphasizes that in an era of unlimited generative possibility, the director's vision, craft, and understanding of "why" a shot works are more critical than ever.Use when "direct, directing, director, shot, camera angle, camera movement, cinematic, scene, blocking, coverage, composition, visual storytelling, like Spielberg, like Cameron, like Nolan, like Tarantino, like Scorsese, film style, movie style, directing, cinematography, camera, shot-composition, storytelling, film, cinematic, visual-language, blocking, spielberg, cameron, nolan,

omer-metin
omer-metin
content-media
open
media
53

vfx-realtime

Expert real-time VFX artist specializing in particle systems, shader effects, and the invisible craft that makes games feel satisfying. Masters Niagara, VFX Graph, Godot GPU particles, and understands the AAA principles that make effects read clearly at 60fps. Use when "particle system, visual effects, vfx, particles, niagara, vfx graph, flipbook, sprite sheet, explosion effect, magic effect, trail effect, beam effect, dissolve, distortion, force field, hit effect, muzzle flash, impact effect, smoke particles, fire effect, soft particles, game juice, screen shake, particle overdraw, effect optimization, vfx, particles, effects, niagara, vfx-graph, game-juice, visual-effects, shaders, flipbook, trails, beams, explosions, optimization, gpu-particles" mentioned.

omer-metin
omer-metin
content-media
open
media
53

ai-image-editing

Expert patterns for AI-powered image editing including inpainting, outpainting, ControlNet, image-to-image, and API integration with Replicate, Stability AI, and FalUse when "ai image editing, inpainting, outpainting, controlnet, image to image, remove object from image, extend image, flux inpaint, sdxl editing, image-editing, inpainting, outpainting, controlnet, stable-diffusion, flux, replicate, stability-ai, comfyui" mentioned.

omer-metin
omer-metin
content-media
open
media
53

ai-visual-effects

The enhancement layer for AI-generated content. This skill covers AI-powered visual effects, compositing, upscaling, restoration, and post-production magic—turning raw AI output into polished, professional content. AI generation gets you 80% of the way. Visual effects get you the remaining 20% that separates "clearly AI" from "how did they do that?" This skill covers ComfyUI workflows, Runway's AI tools, intelligent upscaling, rotoscoping, color grading, and the integration of AI elements into traditional footage. The practitioners of this skill are technical artists who understand both traditional VFX workflows and the new AI-native approaches that are revolutionizing post-production. Use when "AI visual effects, VFX, upscale, upscaling, composite, rotoscope, background removal, color grade, inpaint, outpaint, style transfer, enhance, ComfyUI, post-production AI, vfx, visual-effects, compositing, upscaling, post-production, comfyui, enhancement, color-grading" mentioned.

omer-metin
omer-metin
content-media
open
media
52

cloudinary

Cloudinary API for image/video management. Use when user mentions "Cloudinary", "upload image", "transform image", or media assets.

vm0-ai
vm0-ai
content-media
open
media
52

syncfusion-maui-image-editor

Implements and customize Syncfusion .NET MAUI ImageEditor (SfImageEditor) for editing, annotating, and transforming images. Use when working with MAUI image editing, photo editing, image cropping, or image transformations. Covers shape/text overlays, freehand drawing, image filters, toolbar customization, and saving edited images.

syncfusion
syncfusion
content-media
open
media
52

exiftool-immich

Write EXIF/IPTC/XMP metadata to photos and videos for Immich sync, including albums (keywords), favorites (rating), description, GPS location, and date/time. Handles diff-based updates with proper removal of metadata.

jmathai
jmathai
content-media
open
media
52

imagemagick

Use this skill whenever scientific image assets need deterministic preprocessing (resize, crop, convert, DPI normalization, montage/contact sheets) using ImageMagick 7 `magick`.

drpedapati
drpedapati
content-media
open
media
51

youtube-transcript

Download YouTube video transcripts with automatic frame extraction for visual references. Use when analyzing YouTube videos, tutorials, or conference talks.

b33eep
b33eep
content-media
open
media
51

video-ingest

Download, transcribe, and summarize videos via the Inngest pipeline. Use when the user asks to grab/download/transcribe/ingest a video, save a YouTube video, or process any video URL. Also handles batch ingest of multiple URLs. This skill triggers the durable Inngest workflow — do NOT run yt-dlp, mlx-whisper, or scp manually.

joelhooks
joelhooks
content-media
open
media
51

mux-video

Upload, manage, and embed videos via Mux. Covers direct uploads, API asset management, webhook event flow, playback embedding, and the Mux CLI. Use when uploading video, creating assets, checking encoding status, embedding playback, or handling Mux webhook events.

joelhooks
joelhooks
content-media
open
media
51

image-utils

Classic image manipulation with Python Pillow - resize, crop, composite, format conversion, watermarks, brightness/contrast adjustments, and web optimization. Use this skill when post-processing AI-generated images, preparing images for web delivery, batch processing image directories, creating responsive image variants, or performing any deterministic pixel-level image operation. Works standalone or alongside bria-ai for post-processing generated images.

Bria-AI
Bria-AI
content-media
open
media
51

vision

Analyze images from local files, URLs, or base64 data. For user-uploaded images, the path is provided in the [image attached] annotation.

ionclaw-org
ionclaw-org
content-media
open
media
50

image-optimization

Use when implementing responsive images, format conversion, focal point cropping, or image processing pipelines. Covers srcset generation, WebP/AVIF conversion, lazy loading, and image transformation APIs for headless CMS.

melodic-software
melodic-software
content-media
open
media
50

vllm-omni-video-gen

Generate videos with vLLM-Omni using Wan2.2 and other video generation models. Use when generating videos from text, creating videos from images, configuring video generation parameters, or working with text-to-video or image-to-video models.

hsliuustc0106
hsliuustc0106
content-media
open
Previous
Page 29 / 62
Next