home/categories/media

category focus

Media

Audio, video, and image processing.

1476 스킬all categories

sorting

stars

current ordering strategy

query

all entries

refine the visible subset

media

ffmpeg-process

Process, transcode, and manipulate media files using FFmpeg available in the sandbox. Input/output through the shared volume at {{SHARED_VOLUME}}.

bidewio

content-media

open

media

Expert patterns for AI video generation including text-to-video, image-to-video, video editing, and API integration with Runway, Kling, Luma, Wan, and ReplicateUse when "text to video, video generation, image to video, runway api, kling video, luma dream machine, wan video, animate image, ai video, video-generation, text-to-video, image-to-video, runway, kling, luma, wan, replicate, ai-video" mentioned.

omer-metin

content-media

open

media

demoscene-coding

Specialist in creating size-optimized real-time audio-visual demos and procedural artUse when "demoscene, size coding, 64k intro, 4k intro, 1k intro, tiny code, shader golf, shader minification, procedural demo, bytebeat, pouet, demo party, real-time procedural, demoscene, size-coding, 64k-intro, 4k-intro, shader-optimization, procedural, webgl, graphics" mentioned.

omer-metin

content-media

open

media

add-to-leaderboard

Use this skill when the user wants to add a new codec entry to the leaderboard, update leaderboard rankings, or mentions adding someone's compression results.

agavra

content-media

open

media

view-attachment

View image or file attachments that you can't directly see. Use this skill when you receive a message with attachments listed as file paths and you need to understand their contents. Especially useful for text-only models that cannot process images natively.

tkellogg

content-media

open

media

videohub-douyin

下载抖音单视频，复用 src/douyin_cli.py。适合处理抖音分享链接、短链接和标准视频链接。

cacity

content-media

open

media

videohub-subtitles

处理字幕生成后的烧录和合成流程。适用于把 ASS 字幕烧录进视频，或指导用户使用现有字幕工具。

cacity

content-media

open

media

videohub

VideoHub 总入口。用于识别用户要处理的平台或功能，并路由到更具体的 VideoHub skills，如 YouTube、抖音、蔻享、闲时队列、FFmpeg、字幕和直播录制。

cacity

content-media

open

media

videohub-ffmpeg

管理 FFmpeg 配置、模式、路径、下载和可用性测试。直接复用 src/ffmpeg_config_cli.py。

cacity

content-media

open

media

video-directing

World-class video directing mastery drawing from cinematic legends like Spielberg, Cameron, Coppola, and Nolan. This skill translates directorial intent into AI video generation, focusing on visual storytelling, emotional pacing, and shot composition. It guides the use of AI tools to orchestrate emotion, not just generate footage. Covers pre-visualization, shot sequencing, and the application of classic techniques—like the Spielberg Face or Cameron Scale—to synthetic media. Emphasizes that in an era of unlimited generative possibility, the director's vision, craft, and understanding of "why" a shot works are more critical than ever.Use when "direct, directing, director, shot, camera angle, camera movement, cinematic, scene, blocking, coverage, composition, visual storytelling, like Spielberg, like Cameron, like Nolan, like Tarantino, like Scorsese, film style, movie style, directing, cinematography, camera, shot-composition, storytelling, film, cinematic, visual-language, blocking, spielberg, cameron, nolan,

omer-metin

content-media

open

media

vfx-realtime

Expert real-time VFX artist specializing in particle systems, shader effects, and the invisible craft that makes games feel satisfying. Masters Niagara, VFX Graph, Godot GPU particles, and understands the AAA principles that make effects read clearly at 60fps. Use when "particle system, visual effects, vfx, particles, niagara, vfx graph, flipbook, sprite sheet, explosion effect, magic effect, trail effect, beam effect, dissolve, distortion, force field, hit effect, muzzle flash, impact effect, smoke particles, fire effect, soft particles, game juice, screen shake, particle overdraw, effect optimization, vfx, particles, effects, niagara, vfx-graph, game-juice, visual-effects, shaders, flipbook, trails, beams, explosions, optimization, gpu-particles" mentioned.

omer-metin

content-media

open

media

ai-image-editing

Expert patterns for AI-powered image editing including inpainting, outpainting, ControlNet, image-to-image, and API integration with Replicate, Stability AI, and FalUse when "ai image editing, inpainting, outpainting, controlnet, image to image, remove object from image, extend image, flux inpaint, sdxl editing, image-editing, inpainting, outpainting, controlnet, stable-diffusion, flux, replicate, stability-ai, comfyui" mentioned.

omer-metin

content-media

open

media

ai-visual-effects

The enhancement layer for AI-generated content. This skill covers AI-powered visual effects, compositing, upscaling, restoration, and post-production magic—turning raw AI output into polished, professional content. AI generation gets you 80% of the way. Visual effects get you the remaining 20% that separates "clearly AI" from "how did they do that?" This skill covers ComfyUI workflows, Runway's AI tools, intelligent upscaling, rotoscoping, color grading, and the integration of AI elements into traditional footage. The practitioners of this skill are technical artists who understand both traditional VFX workflows and the new AI-native approaches that are revolutionizing post-production. Use when "AI visual effects, VFX, upscale, upscaling, composite, rotoscope, background removal, color grade, inpaint, outpaint, style transfer, enhance, ComfyUI, post-production AI, vfx, visual-effects, compositing, upscaling, post-production, comfyui, enhancement, color-grading" mentioned.

omer-metin

content-media

open

media

cloudinary

Cloudinary API for image/video management. Use when user mentions "Cloudinary", "upload image", "transform image", or media assets.

vm0-ai

content-media

open

media

syncfusion-maui-image-editor

Implements and customize Syncfusion .NET MAUI ImageEditor (SfImageEditor) for editing, annotating, and transforming images. Use when working with MAUI image editing, photo editing, image cropping, or image transformations. Covers shape/text overlays, freehand drawing, image filters, toolbar customization, and saving edited images.

syncfusion

content-media

open

media

exiftool-immich

Write EXIF/IPTC/XMP metadata to photos and videos for Immich sync, including albums (keywords), favorites (rating), description, GPS location, and date/time. Handles diff-based updates with proper removal of metadata.

jmathai

content-media

open

media

imagemagick

Use this skill whenever scientific image assets need deterministic preprocessing (resize, crop, convert, DPI normalization, montage/contact sheets) using ImageMagick 7 `magick`.

drpedapati

content-media

open

media

youtube-transcript

Download YouTube video transcripts with automatic frame extraction for visual references. Use when analyzing YouTube videos, tutorials, or conference talks.

b33eep

content-media

open

media

video-ingest

Download, transcribe, and summarize videos via the Inngest pipeline. Use when the user asks to grab/download/transcribe/ingest a video, save a YouTube video, or process any video URL. Also handles batch ingest of multiple URLs. This skill triggers the durable Inngest workflow — do NOT run yt-dlp, mlx-whisper, or scp manually.

joelhooks

content-media

open

media

mux-video

Upload, manage, and embed videos via Mux. Covers direct uploads, API asset management, webhook event flow, playback embedding, and the Mux CLI. Use when uploading video, creating assets, checking encoding status, embedding playback, or handling Mux webhook events.

joelhooks

content-media

open

media

image-utils

Classic image manipulation with Python Pillow - resize, crop, composite, format conversion, watermarks, brightness/contrast adjustments, and web optimization. Use this skill when post-processing AI-generated images, preparing images for web delivery, batch processing image directories, creating responsive image variants, or performing any deterministic pixel-level image operation. Works standalone or alongside bria-ai for post-processing generated images.

Bria-AI

content-media

open

media

vision

Analyze images from local files, URLs, or base64 data. For user-uploaded images, the path is provided in the [image attached] annotation.

ionclaw-org

content-media

open

media

image-optimization

Use when implementing responsive images, format conversion, focal point cropping, or image processing pipelines. Covers srcset generation, WebP/AVIF conversion, lazy loading, and image transformation APIs for headless CMS.

melodic-software

content-media

open

media

vllm-omni-video-gen

Generate videos with vLLM-Omni using Wan2.2 and other video generation models. Use when generating videos from text, creating videos from images, configuring video generation parameters, or working with text-to-video or image-to-video models.

hsliuustc0106

content-media

open

Page 29 / 62