category focus

Media

Audio, video, and image processing.

1476 مهارةall categories
sorting
stars
current ordering strategy
query
all entries
refine the visible subset
media
215

image-handling

Image handling for Claude API constraints (5MB max, 8000px max dimension). Use when working with images, screenshots, or MCP browser tools.

megalithic
megalithic
content-media
open
media
203

video-frames

Extract frames or short clips from videos using ffmpeg.

CoWork-OS
CoWork-OS
content-media
open
media
202

volcengine-video-understanding

火山视频理解 - 使用火山方舟视频理解 API 分析视频内容。通过 Files API 上传视频(推荐),支持大文件(最大512MB),可用于视频内容分析、物体识别、动作理解等。当用户需要分析视频、理解视频内容、提取视频信息时激活此技能。

freestylefly
freestylefly
content-media
open
media
202

canghe-compress-image

Compresses images to WebP (default) or PNG with automatic tool selection. Use when user asks to "compress image", "optimize image", "convert to webp", or reduce image file size.

freestylefly
freestylefly
content-media
open
media
198

media-processor

提供基于 FFmpeg 和 ImageMagick 的多媒体处理能力,支持视频和图像的格式转换、分辨率调整、压缩等操作

anbeime
anbeime
content-media
open
media
198

infinitetalk

音频驱动的稀疏帧视频配音工具,支持音频驱动的 Video-to-Video 和 Image-to-Video 生成,实现精准的唇形、头部、身体姿态同步,支持无限时长视频生成

anbeime
anbeime
content-media
open
media
198

infinitetalk

音频驱动的稀疏帧视频配音工具,支持音频驱动的 Video-to-Video 和 Image-to-Video 生成,实现精准的唇形、头部、身体姿态同步,支持无限时长视频生成

anbeime
anbeime
content-media
open
media
197

share-social

This skill should be used when the user asks to "optimize for Instagram", "YouTube Shorts format", "make it 9:16", "square video", "TikTok format", "Reels format", "prepare for social media", "encode for Twitter", "optimize for Facebook", "LinkedIn video", "crop for portrait", or mentions any platform-specific video format or upload requirements.

gupsammy
gupsammy
content-media
open
media
197

compress-video

This skill should be used when the user asks to "compress this video", "reduce file size", "make this video smaller", "optimize for web", "shrink this video", "compress to under X MB", "reduce bitrate", "make it smaller without losing quality", "encode with H.265", or "re-encode this video".

gupsammy
gupsammy
content-media
open
media
197

convert-video

This skill should be used when the user asks to "convert this video", "change format to mp4", "trim from X to Y", "cut the first X seconds", "speed up this video", "slow motion", "timelapse", "extract frames", "resize video", "scale down", "rotate video", "flip video", "remux", or any general FFmpeg video manipulation not covered by compress-video, make-gif, share-social, or extract-audio.

gupsammy
gupsammy
content-media
open
media
197

tts-node

Atomic reference for @panda-video-generator/tts-node: pnpm tts, cli.ts, processNarrationFile — Edge-TTS narration → audio.mp3 + audio.vtt; env vars TTS_*, EDGE_TTS_*, ffmpeg. Triggers: TTS, Edge-TTS, 口播音频, audio.mp3, public/tts.

szhshp
szhshp
content-media
open
media
197

extract-audio

This skill should be used when the user asks to "extract audio", "get the mp3", "strip audio from video", "rip audio", "save audio from video", "convert to audio", "get the soundtrack", "pull the audio track", "save as mp3", "export audio", or "separate audio from video".

gupsammy
gupsammy
content-media
open
media
191

nikon-photography

Manage Nikon Z5 II photo and video libraries. Batch resize, convert to JPEG XL or optimized JPEG via mozjpeg, create contact sheets, manage EXIF metadata, prepare share-ready albums, and process 4K video. Uses Python scripts via uv run for all batch operations. Use when working with JPG, NEF, HEIF, or MOV photo/video files, or when the user mentions photos, camera, Nikon, resize, sharing pictures, photo library, image optimization, JPEG XL, or contact sheet.

wcygan
wcygan
content-media
open
media
188

douyin-downloader

Download Douyin (抖音) videos from share links. Parse Douyin share text/links, download watermark-free videos, and transcribe audio to text using Volcano Engine ASR (Doubao Speech). Uses Python for iSH compatibility.

OpenMinis
OpenMinis
content-media
open
media
188

twitter-downloader

Download text, images, GIFs, and videos from Twitter/X posts via fxtwitter API. Trigger when users share any twitter.com or x.com link, or ask to download or see media from a tweet (e.g., '下载推特视频', '把这条推文的图存下来', 'what's in this tweet').

OpenMinis
OpenMinis
content-media
open
media
187

video-clip-extractor

Processes videos to identify engaging moments, generate transcripts, and create highlight clips with artistic titles and custom cover images. Use when user needs to: extract highlights from long videos or livestreams, clip or cut best moments from videos, cut video highlights, process Bilibili/YouTube URLs or local video files, generate transcripts via Whisper, analyze content for engaging moments, create short-form clips with styled titles and covers, adjust cover text position and colors, find and export memorable scenes from recordings, burn subtitles into clips (with optional translation), guide clip selection with user intent, or identify speakers in multi-person conversations.

linzzzzzz
linzzzzzz
content-media
open
media
186

image-upload

Use when you need to upload and preview images.

thedaviddias
thedaviddias
content-media
open
media
186

image-gallery

Use when you need to display and browse image collections.

thedaviddias
thedaviddias
content-media
open
media
186

nano-banana-pro

Generate or edit images via Gemini 3 Pro Image (Nano Banana Pro).

TermiX-official
TermiX-official
content-media
open
media
186

video-player

Use when implementing video playback with controls.

thedaviddias
thedaviddias
content-media
open
media
186

video-frames

Extract frames or short clips from videos using ffmpeg.

TermiX-official
TermiX-official
content-media
open
media
185

transcribee

Transcribe YouTube videos and local audio/video files with speaker diarization. Use when user asks to transcribe a YouTube URL, podcast, video, or audio file. Outputs clean speaker-labeled transcripts ready for LLM analysis.

itsfabioroma
itsfabioroma
content-media
open
media
185

gastrohem-media-processor

Automatically process unprocessed audio and image files in Gastrohem daily WhatsApp folders. This skill should be used when the user asks to transcribe audio files, perform OCR on images, or process media in daily folders (e.g., "Process media in today's folder", "Transcribe audio and OCR images in 24.10 folder"). Handles audio transcription using insanely-fast-whisper (parallelized, creates .json) and image OCR using Claude's vision capabilities (creates natural .md summaries with Gastrohem-relevant info).

majiayu000
majiayu000
content-media
open
media
185

ffmpeg-patterns

FFmpeg video and audio processing patterns. Use when transcoding video/audio, extracting clips, adding filters, merging media, creating thumbnails, or batch processing media files.

majiayu000
majiayu000
content-media
open
Previous
Page 19 / 62
Next