category focus

Media

Audio, video, and image processing.

1476 skillsall categories
sorting
stars
current ordering strategy
query
all entries
refine the visible subset
media
40

framerate-audit

This skill should be used when the user asks "check frame rate", "is this CFR or VFR", "video has duplicate frames", "video stutters", "frame rate issues", "why does video judder", or wants to analyze frame rate characteristics and detect timing problems.

robbyt
robbyt
content-media
open
media
40

hdr-audit

This skill should be used when the user asks "is this real HDR", "check HDR metadata", "fake HDR", "is this Dolby Vision legitimate", "HDR vs SDR", "check HDR peak brightness", or wants to verify whether HDR content is genuine or inverse tonemapped from SDR.

robbyt
robbyt
content-media
open
media
40

video-add-chapters

Add chapters to videos by transcribing, analyzing, and generating structured markdown documents with YouTube chapter markers. Optionally generate highlight videos.

jykim
jykim
content-media
open
media
40

magic-mirror-demo

魔镜 demo 技能。用户说“魔镜拍照”“帮我拍一张”“魔镜请拍照”时,用 soarmmoce-real-con 的本地摄像头抓拍脚本把照片保存到这个 skill 的工作空间;用户说“谁最美”“选最美的人”“把最美的人变成皇后”“生成皇后视频”时,从工作空间照片里选出最佳候选,先把本地照片上传成 ArtsAPI 可访问的公网图片 URL,再调用 ArtsAPI 图生视频生成皇后变身视频并优先保存到本地。

wanhaoniu
wanhaoniu
content-media
open
media
40

sharp

Process images with the Sharp library for Node.js — resize, convert formats, composite, apply effects, and manage metadata. Use when the user mentions "sharp", "image processing", "resize image", "convert image", "image format", "jpeg quality", "png compression", "webp", "avif", "image thumbnail", "crop image", "watermark", "overlay image", "blur image", "sharpen image", "image metadata", "EXIF", "ICC profile", "colour space", "alpha channel", "animated gif", "image pipeline", or asks how to manipulate images in Node.js/TypeScript. Also use for "sharp constructor", "sharp cache", "sharp concurrency", "toFile", "toBuffer", or any Sharp API method.

clasen
clasen
content-media
open
media
40

video-full-process

Unified workflow combining video-clean and video-add-chapters with transcript reuse and chapter remapping

jykim
jykim
content-media
open
media
40

transcribe

Create clean, subtitle-ready `.srt` subtitles for a local audio or video file using Whisper CLI. Use this when the user asks to transcribe a local media file, generate subtitles, create captions, or make an `.srt` for a specific local file path.

bholmesdev
bholmesdev
content-media
open
media
40

alt-text

Write concise alt text for a local image file. Use this when the user asks for alt text, accessibility text, or a brief image description for a specific local image path. Pass the target image path as `$0`.

bholmesdev
bholmesdev
content-media
open
media
40

fal-upscale

Upscale and enhance image and video resolution using AI

benjaminasterA
benjaminasterA
content-media
open
media
40

fal-image-edit

AI-powered image editing with style transfer and object removal

benjaminasterA
benjaminasterA
content-media
open
media
40

remotion-video

Remotion video production — scenes, transitions, OffthreadVideo, audio mixing, multi-scene timelines, and rendering output

InugamiDev
InugamiDev
content-media
open
media
40

bibi

AI video & audio summarizer. Summarize YouTube videos, Bilibili videos, podcasts, TikTok, Twitter/X, Xiaohongshu, and any online video or audio. Use when the user wants to summarize a video, extract transcripts/subtitles, get chapter-by-chapter summaries, or understand video content quickly. Triggers: "summarize this video", "what's this video about", "extract subtitles", "总结这个视频", "帮我看看这个视频讲了什么", "video summary", "podcast notes", "YouTube summary", "B站总结", "get transcript", "video to notes". Works via bibi CLI (macOS/Windows) or OpenAPI (Linux / any platform without CLI).

JimmyLv
JimmyLv
content-media
open
media
40

media-processing

Image, video, and audio processing using FFmpeg, ImageMagick, Sharp, and web-optimized media pipelines

InugamiDev
InugamiDev
content-media
open
media
40

image-optimization

Image optimization — next/image, responsive sizes, priority loading, blur placeholders, WebP/AVIF, CDN loaders, lazy loading

InugamiDev
InugamiDev
content-media
open
media
39

ffmpeg-media-processing

Use when user asks to convert, compress, trim, resize, extract audio, add subtitles, create GIFs, or process video/audio files

donghaozhang
donghaozhang
content-media
open
media
39

transcribe

Transcribe audio files to text with optional diarization and known-speaker hints. Use when a user asks to transcribe speech from audio/video, extract text from recordings, or label speakers in interviews or meetings.

lingxling
lingxling
content-media
open
media
39

neon

Generate .neon motion definition files and render them to MP4/WebP video. Use when creating motion graphics, visual effects, animation videos, or when user mentions neon, .neon files, motion effects, or video rendering.

S1mpleSonny
S1mpleSonny
content-media
open
media
39

neon-replicate

Replicate motion effects from reference videos. Use when user wants to copy/clone/replicate an existing motion effect, compare generated effects with originals, or iterate on effect accuracy.

S1mpleSonny
S1mpleSonny
content-media
open
media
39

image-enhancer

Improves the quality of images, especially screenshots, by enhancing resolution, sharpness, and clarity. Perfect for preparing images for presentations, documentation, or social media posts.

lingxling
lingxling
content-media
open
media
39

youtube-downloader

Download YouTube videos with customizable quality and format options. Use this skill when the user asks to download, save, or grab YouTube videos. Supports various quality settings (best, 1080p, 720p, 480p, 360p), multiple formats (mp4, webm, mkv), and audio-only downloads as MP3.

lingxling
lingxling
content-media
open
media
38

nano-banana-pro

Generate or edit images via Gemini 3 Pro Image (Nano Banana Pro).

jiangye1314
jiangye1314
content-media
open
media
38

inclusive-visuals

Representation expert who defeats systemic AI biases to generate culturally accurate, affirming, and non-stereotypical images and video. Adapted from msitarzewski/agency-agents.

elophanto
elophanto
content-media
open
Previous
Page 32 / 62
Next