ffmpeg-process
Process, transcode, and manipulate media files using FFmpeg available in the sandbox. Input/output through the shared volume at {{SHARED_VOLUME}}.
Process, transcode, and manipulate media files using FFmpeg available in the sandbox. Input/output through the shared volume at {{SHARED_VOLUME}}.
Expert patterns for AI video generation including text-to-video, image-to-video, video editing, and API integration with Runway, Kling, Luma, Wan, and ReplicateUse when "text to video, video generation, image to video, runway api, kling video, luma dream machine, wan video, animate image, ai video, video-generation, text-to-video, image-to-video, runway, kling, luma, wan, replicate, ai-video" mentioned.
Specialist in creating size-optimized real-time audio-visual demos and procedural artUse when "demoscene, size coding, 64k intro, 4k intro, 1k intro, tiny code, shader golf, shader minification, procedural demo, bytebeat, pouet, demo party, real-time procedural, demoscene, size-coding, 64k-intro, 4k-intro, shader-optimization, procedural, webgl, graphics" mentioned.
Use this skill when the user wants to add a new codec entry to the leaderboard, update leaderboard rankings, or mentions adding someone's compression results.
View image or file attachments that you can't directly see. Use this skill when you receive a message with attachments listed as file paths and you need to understand their contents. Especially useful for text-only models that cannot process images natively.
管理 FFmpeg 配置、模式、路径、下载和可用性测试。直接复用 src/ffmpeg_config_cli.py。
World-class video directing mastery drawing from cinematic legends like Spielberg, Cameron, Coppola, and Nolan. This skill translates directorial intent into AI video generation, focusing on visual storytelling, emotional pacing, and shot composition. It guides the use of AI tools to orchestrate emotion, not just generate footage. Covers pre-visualization, shot sequencing, and the application of classic techniques—like the Spielberg Face or Cameron Scale—to synthetic media. Emphasizes that in an era of unlimited generative possibility, the director's vision, craft, and understanding of "why" a shot works are more critical than ever.Use when "direct, directing, director, shot, camera angle, camera movement, cinematic, scene, blocking, coverage, composition, visual storytelling, like Spielberg, like Cameron, like Nolan, like Tarantino, like Scorsese, film style, movie style, directing, cinematography, camera, shot-composition, storytelling, film, cinematic, visual-language, blocking, spielberg, cameron, nolan,
Expert real-time VFX artist specializing in particle systems, shader effects, and the invisible craft that makes games feel satisfying. Masters Niagara, VFX Graph, Godot GPU particles, and understands the AAA principles that make effects read clearly at 60fps. Use when "particle system, visual effects, vfx, particles, niagara, vfx graph, flipbook, sprite sheet, explosion effect, magic effect, trail effect, beam effect, dissolve, distortion, force field, hit effect, muzzle flash, impact effect, smoke particles, fire effect, soft particles, game juice, screen shake, particle overdraw, effect optimization, vfx, particles, effects, niagara, vfx-graph, game-juice, visual-effects, shaders, flipbook, trails, beams, explosions, optimization, gpu-particles" mentioned.
Expert patterns for AI-powered image editing including inpainting, outpainting, ControlNet, image-to-image, and API integration with Replicate, Stability AI, and FalUse when "ai image editing, inpainting, outpainting, controlnet, image to image, remove object from image, extend image, flux inpaint, sdxl editing, image-editing, inpainting, outpainting, controlnet, stable-diffusion, flux, replicate, stability-ai, comfyui" mentioned.
The enhancement layer for AI-generated content. This skill covers AI-powered visual effects, compositing, upscaling, restoration, and post-production magic—turning raw AI output into polished, professional content. AI generation gets you 80% of the way. Visual effects get you the remaining 20% that separates "clearly AI" from "how did they do that?" This skill covers ComfyUI workflows, Runway's AI tools, intelligent upscaling, rotoscoping, color grading, and the integration of AI elements into traditional footage. The practitioners of this skill are technical artists who understand both traditional VFX workflows and the new AI-native approaches that are revolutionizing post-production. Use when "AI visual effects, VFX, upscale, upscaling, composite, rotoscope, background removal, color grade, inpaint, outpaint, style transfer, enhance, ComfyUI, post-production AI, vfx, visual-effects, compositing, upscaling, post-production, comfyui, enhancement, color-grading" mentioned.
Cloudinary API for image/video management. Use when user mentions "Cloudinary", "upload image", "transform image", or media assets.
Implements and customize Syncfusion .NET MAUI ImageEditor (SfImageEditor) for editing, annotating, and transforming images. Use when working with MAUI image editing, photo editing, image cropping, or image transformations. Covers shape/text overlays, freehand drawing, image filters, toolbar customization, and saving edited images.
Write EXIF/IPTC/XMP metadata to photos and videos for Immich sync, including albums (keywords), favorites (rating), description, GPS location, and date/time. Handles diff-based updates with proper removal of metadata.
Use this skill whenever scientific image assets need deterministic preprocessing (resize, crop, convert, DPI normalization, montage/contact sheets) using ImageMagick 7 `magick`.
Download YouTube video transcripts with automatic frame extraction for visual references. Use when analyzing YouTube videos, tutorials, or conference talks.
Download, transcribe, and summarize videos via the Inngest pipeline. Use when the user asks to grab/download/transcribe/ingest a video, save a YouTube video, or process any video URL. Also handles batch ingest of multiple URLs. This skill triggers the durable Inngest workflow — do NOT run yt-dlp, mlx-whisper, or scp manually.
Upload, manage, and embed videos via Mux. Covers direct uploads, API asset management, webhook event flow, playback embedding, and the Mux CLI. Use when uploading video, creating assets, checking encoding status, embedding playback, or handling Mux webhook events.
Classic image manipulation with Python Pillow - resize, crop, composite, format conversion, watermarks, brightness/contrast adjustments, and web optimization. Use this skill when post-processing AI-generated images, preparing images for web delivery, batch processing image directories, creating responsive image variants, or performing any deterministic pixel-level image operation. Works standalone or alongside bria-ai for post-processing generated images.
Use when implementing responsive images, format conversion, focal point cropping, or image processing pipelines. Covers srcset generation, WebP/AVIF conversion, lazy loading, and image transformation APIs for headless CMS.
Generate videos with vLLM-Omni using Wan2.2 and other video generation models. Use when generating videos from text, creating videos from images, configuring video generation parameters, or working with text-to-video or image-to-video models.