category focus

Media

Audio, video, and image processing.

1476 مهارةall categories
sorting
stars
current ordering strategy
query
all entries
refine the visible subset
media
30

youtube

Comprehensive YouTube operations using yt-dlp - download videos/audio, extract transcripts and subtitles, get metadata, work with playlists, download thumbnails, and inspect available formats. Use this for any YouTube content processing task.

ericmjl
ericmjl
content-media
open
media
30

director

Full production pipeline — story to scenes, Z-Image start frames, Qwen Edit end frames, WAN FLF video clips, ffmpeg concatenation

artokun
artokun
content-media
open
media
30

edit-greek-reel

Edit a raw talking-head video into a polished short-form reel with karaoke subtitles. Trims silence, adds Manrope Bold subtitles, zoom effects, SFX, and image overlays. Supports any language. Usage - /edit-greek-reel <path-to-video> [options]

artemisln
artemisln
content-media
open
media
30

weibo-video

微博视频上传工具。当用户需要上传本地视频文件到微博时激活。 支持大文件分片上传,自动计算 MD5 校验值,显示上传进度。

wecode-ai
wecode-ai
content-media
open
media
30

youtube-content-creator

Transforms video concepts into production-ready scripts with exact spoken lines, forward-pulling hooks between every beat, and a demo-first structure. Takes a concepts.md file and applies the ideal-mechanics.md playbook.

harperaa
harperaa
content-media
open
media
30

youtube-ingestion

Ingest YouTube videos into the vault. Triggers when user pastes a YouTube URL (youtube.com/watch or youtu.be). Fetches transcript using yt-dlp, extracts metadata, creates transcript note and summary note. User may provide additional context about the video.

ericmjl
ericmjl
content-media
open
media
30

nano-banana-pro

Generate or edit images via Gemini 3 Pro Image (Nano Banana Pro).

mholovetskyi
mholovetskyi
content-media
open
media
30

play-tape

Play a tape file by loading its patterns, effects, and arrangement from the tapes/ directory.

jeremyruppel
jeremyruppel
content-media
open
media
29

media-transcoding

FFmpeg-based media transcoding workflows with preset-driven conversions, batch processing, and safe backups for web/mobile/archive outputs.

bobmatnyc
bobmatnyc
content-media
open
media
29

snapas

Snap.as API Documentation

rawveg
rawveg
content-media
open
media
29

voiceover

使用 edge-tts 生成多语言配音(中文/英文)。当需要为视频生成语音旁白、基于时间线同步配音时使用。支持语速调整、多种声音选择和配音验证。

MatrixReligio
MatrixReligio
content-media
open
media
29

nano-banana-pro-zh

使用 Nano Banana Pro (Gemini 3 Pro Image) 生成/编辑图像。用于图像创建或修改请求,支持文生图和图生图;支持 1K/2K/4K 分辨率;可使用 --input-image 参数编辑现有图像。

L-LesterYu
L-LesterYu
content-media
open
media
29

compositing

使用 Remotion 合成最终视频。当需要将片头、录屏、配音、片尾组合成完整视频时使用。包含动画效果、时间线管理、多尺寸模板和故障处理。

MatrixReligio
MatrixReligio
content-media
open
media
29

imagemagick

You are an expert in ImageMagick, the powerful command-line tool for creating, editing, compositing, and converting images. You help developers automate image processing pipelines using ImageMagick's `convert`, `mogrify`, `composite`, and `identify` commands — batch resizing, format conversion, watermarking, thumbnail generation, PDF manipulation, and complex image compositing for web applications, print production, and data visualization.

TerminalSkills
TerminalSkills
content-media
open
media
29

cloudinary

Manage images and videos with Cloudinary. Use when a user asks to optimize images, add image transformations, implement responsive images, upload media, or serve optimized assets from a CDN.

TerminalSkills
TerminalSkills
content-media
open
media
29

bgm

为视频添加背景音乐。支持免版权音乐来源、音量混合、淡入淡出效果。当需要为视频添加背景音乐、调整音乐与配音音量平衡时使用。

MatrixReligio
MatrixReligio
content-media
open
media
29

video-frames-zh

使用 ffmpeg 从视频中提取帧或短片段。

L-LesterYu
L-LesterYu
content-media
open
media
29

svgo

Optimize SVG files with SVGO — remove unnecessary metadata, minify paths, merge shapes, configure plugins, and integrate into build pipelines. Use when tasks involve reducing SVG file size, cleaning up exported SVGs from design tools, building icon systems, or automating SVG optimization in CI/CD.

TerminalSkills
TerminalSkills
content-media
open
media
29

imgix

Optimize and transform images with imgix. Use when serving responsive images, implementing image CDN, adding real-time transformations, or optimizing Core Web Vitals with image delivery.

TerminalSkills
TerminalSkills
content-media
open
media
29

camsnap

Capture frames or clips from RTSP/ONVIF cameras.

mangiapanejohn-dev
mangiapanejohn-dev
content-media
open
media
29

nano-banana-pro

Generate or edit images via Gemini 3 Pro Image (Nano Banana Pro).

mangiapanejohn-dev
mangiapanejohn-dev
content-media
open
media
29

video-frames

Extract frames or short clips from videos using ffmpeg.

mangiapanejohn-dev
mangiapanejohn-dev
content-media
open
media
29

whisper-transcribe-docker

Speech-to-text (逐字稿/转写) in Docker using faster-whisper (local, no API key). Use when you already have an audio file (e.g. from `media-audio-download`) and need a transcript with optional timestamps for summarization.

hc-tec
hc-tec
content-media
open
media
29

media-audio-download

Download audio tracks from video links for transcription/summarization. Docker-first (no host Python): uses yt-dlp+ffmpeg for Bilibili and Playwright extraction for Xiaohongshu note pages. Use when a platform skill needs an audio file for STT (e.g. Bilibili “No subtitles found”, Xiaohongshu video notes), or when the user asks “把这个视频音频下载下来/做逐字稿”.

hc-tec
hc-tec
content-media
open
Previous
Page 36 / 62
Next