category focus

Media

Audio, video, and image processing.

1476 個技能all categories
sorting
stars
current ordering strategy
query
all entries
refine the visible subset
media
10.6K

transferring-video-styles

Converts live-action videos into artistic styles like comic, anime, manga, 3D cartoon. Use when user wants to transform video style, create anime version, apply artistic effects, convert to cartoon style, make comic video, create manga style video, stylize video. 视频风格转换、动漫风格、漫画风格、卡通风格、视频二次元化、视频风格化、AI风格转换、视频转动漫、视频转漫画。

yikart
yikart
content-media
open
media
10.6K

translating-videos

Translates videos including subtitles, voice cloning, and lip sync. Use when user wants to translate video, dub video to another language, convert video language, localize video content, create multilingual video, add translated voiceover, change video audio language, or produce foreign language version. 视频翻译、配音翻译、字幕翻译、视频本地化、多语言视频、翻译视频、视频配音、语言转换。

yikart
yikart
content-media
open
media
10.4K

demo-video

Use when the user asks to create a demo video, product walkthrough, feature showcase, animated presentation, marketing video, or GIF from screenshots or scene descriptions. Orchestrates playwright, ffmpeg, and edge-tts MCPs to produce polished video content.

alirezarezvani
alirezarezvani
content-media
open
media
10K

mmx-cli

Use mmx to generate text, images, video, speech, and music via the MiniMax AI platform. Use when the user wants to create media content, chat with MiniMax models, perform web search, or manage MiniMax API resources from the terminal.

MiniMax-AI
MiniMax-AI
content-media
open
media
9K

acestep-simplemv

Render music videos from audio files and lyrics using Remotion. Accepts audio + LRC/JSON lyrics + title to produce MP4 videos with waveform visualization and synced lyrics display. Use when users mention MV generation, music video rendering, creating video from audio/lyrics, or visualizing songs.

ace-step
ace-step
content-media
open
media
8.9K

mediago

Download videos from m3u8/HLS streams, Bilibili, and direct URLs using MediaGo. 下载视频、m3u8直播流、B站视频。 Triggers on: download video, 下载视频, 下载这个链接, 帮我下载, m3u8 download, 设置mediago地址, configure mediago, mediago api key. Requires a running MediaGo instance (desktop app or Docker).

caorushizi
caorushizi
content-media
open
media
6.6K

blip-2-vision-language

Vision-language pre-training framework bridging frozen image encoders and LLMs. Use when you need image captioning, visual question answering, image-text retrieval, or multimodal chat with state-of-the-art zero-shot performance.

Orchestra-Research
Orchestra-Research
content-media
open
media
6K

bibi

BibiGPT CLI for summarizing videos, audio, and podcasts directly in the terminal. Use when the user wants to summarize a URL (YouTube, Bilibili, podcast, etc.) or check their BibiGPT authentication status. Requires the BibiGPT desktop app installed with an active login session, or a BIBI_API_TOKEN environment variable set.

JimmyLv
JimmyLv
content-media
open
media
5.3K

video-frames

Extract frames or short clips from videos using ffmpeg.

clawdbot
clawdbot
content-media
open
media
5.3K

songsee

Generate spectrograms and feature-panel visualizations from audio with the songsee CLI.

clawdbot
clawdbot
content-media
open
media
5.3K

gifgrep

Search GIF providers with CLI/TUI, download results, and extract stills/sheets.

clawdbot
clawdbot
content-media
open
media
5.3K

camsnap

Capture frames or clips from RTSP/ONVIF cameras.

clawdbot
clawdbot
content-media
open
media
5K

films-search

Search cloud drives for downloadable film and TV resources (movies, TV series, anime). Use this skill when the user wants to download a specific movie or TV show. Do NOT use for general movie information, schedules, reviews, or recommendations.

netease-youdao
netease-youdao
content-media
open
media
4.9K

adaptive-stem-alignment

Incremental audio production with duration mismatch handling, adaptive stem extension, and pre-mix alignment verification

HKUDS
HKUDS
content-media
open
media
4.9K

aligned-stem-workflow

Incremental audio production with duration alignment handling, per-stem verification, and adaptive extension strategies

HKUDS
HKUDS
content-media
open
media
4.9K

ffmpeg-encoder-check-4855c0

Check available FFmpeg encoders before writing encoding scripts to avoid library version mismatches

HKUDS
HKUDS
content-media
open
media
4.9K

ffmpeg-encoder-check

Check FFmpeg encoder availability before video encoding to avoid library mismatches

HKUDS
HKUDS
content-media
open
media
4.9K

safe-video-concat-workflow

Two-pass video concatenation workflow that avoids encoder compatibility issues by separating concat and audio mixing

HKUDS
HKUDS
content-media
open
media
4.5K

seo-images

Image optimization analysis for SEO and performance. Checks alt text, file sizes, formats, responsive images, lazy loading, CLS prevention, image SERP rankings (via DataForSEO), and image file optimization (WebP/AVIF conversion, IPTC/XMP metadata injection). Use when user says "image optimization", "alt text", "image SEO", "image size", "image audit", "optimize images", "image metadata", "image SERP", "convert to webp", or "image file optimize".

AgriciDaniel
AgriciDaniel
content-media
open
media
4.2K

analyzing-disk-image-with-autopsy

Perform comprehensive forensic analysis of disk images using Autopsy to recover files, examine artifacts, and build investigation timelines.

mukul975
mukul975
content-media
open
media
4.2K

performing-file-carving-with-foremost

Recover files from disk images and unallocated space using Foremost's header-footer signature carving to extract evidence regardless of file system state.

mukul975
mukul975
content-media
open
Previous
Page 3 / 62
Next