category focus

Media

Audio, video, and image processing.

1476 个技能all categories
sorting
stars
current ordering strategy
query
all entries
refine the visible subset
media
4K

muapi-cinema-director

Direct high-fidelity cinematic video with AI — translates creative intent into technical cinematographic directives for Veo3, Kling, and Luma video models via muapi.ai

openclaw
openclaw
content-media
open
media
4K

muapi-media-editing

Edit and enhance images and videos with AI via muapi.ai — prompt-based editing, upscaling, background removal, face swap, lipsync, video effects, and more

openclaw
openclaw
content-media
open
media
4K

muapi-seedance-2

Expert Cinema Director skill for Seedance 2.0 (ByteDance) — high-fidelity video generation using technical camera grammar and multimodal references. Supports text-to-video, image-to-video, and video extension.

openclaw
openclaw
content-media
open
media
4K

pixel-art-processing

Pixel art sprite sheet processing tool — video frame extraction, GIF/frames conversion, sprite sheet compose/split, image matting, pixelation, resize, crop, and watermark removal. Use when processing pixel art, game assets, RPG Maker sprites, or any sprite sheet workflow. Triggers on: sprite sheet, GIF拆帧, 序列帧, 像素图片, 抠图, 去水印, 视频转帧, pixel art, sprite, GIF to frames, frames to GIF, background removal, pixelate.

openclaw
openclaw
content-media
open
media
4K

chromecast-with-google-tv

Cast YouTube videos, Tubi TV show episodes, and TV show episodes from other video streaming apps via ADB to Chromecast with Android TV (Chromecast 4K supported, Google TV Streamer support is unknown)

openclaw
openclaw
content-media
open
media
4K

instagram-reels

Download Instagram Reels, transcribe audio, and extract captions. Share a reel URL and get back a full transcript with the original description.

openclaw
openclaw
content-media
open
media
4K

yt-dlp-downloader

Download videos from YouTube, Bilibili, Twitter, and thousands of other sites using yt-dlp. Use when the user provides a video URL and wants to download it, extract audio (MP3), download subtitles, or select video quality. Triggers on phrases like "下载视频", "download video", "yt-dlp", "YouTube", "B站", "抖音", "提取音频", "extract audio".

openclaw
openclaw
content-media
open
media
4K

ffmpeg-master-pro

FFmpeg Master Pro - 全能视频处理技能。当用户需要视频处理时使用,支持视频转换、视频压缩、视频编辑等。当用户要求视频处理, 视频转换, 视频压缩, FFmpeg, 视频编辑, 视频转码, 视频剪辑, 字幕处理, 视频优化, 批量视频处理, GIF转换, 视频翻转, 速度调节, 音频提取, 视频合并, 宽高比, 水印, 截图, 抽帧时使用此技能。

openclaw
openclaw
content-media
open
media
4K

gemini-stt

Transcribe audio files using Google's Gemini API or Vertex AI

openclaw
openclaw
content-media
open
media
4K

edit-greek-reel

Edit a raw talking-head video into a polished short-form reel with Greek karaoke subtitles. Trims silence, adds Manrope Bold subtitles, zoom effects, SFX, and image overlays. Usage - /edit-greek-reel <path-to-video> [options]

openclaw
openclaw
content-media
open
media
4K

ffmpeg-cli

Comprehensive video/audio processing with FFmpeg. Use for: (1) Video transcoding and format conversion, (2) Cutting and merging clips, (3) Audio extraction and manipulation, (4) Thumbnail and GIF generation, (5) Resolution scaling and quality adjustment, (6) Adding subtitles or watermarks, (7) Speed adjustment (slow/fast motion), (8) Color correction and filters.

openclaw
openclaw
content-media
open
media
4K

seedance-vs-kling

Get 1080p MP4 files from your text or images using this seedance-vs-kling tool. It runs AI video generation comparison on cloud GPUs, so your machine does zero heavy lifting. content creators can comparing AI video generation quality between Seedance and Kling in roughly 1-2 minutes — supports MP4, MOV, WebM, PNG.

openclaw
openclaw
content-media
open
media
4K

record

macOS CLI tool for recording audio (microphone), screen (video/screenshot), and camera (video/photo) from the terminal. Use when the user or an AI agent needs to: (1) record microphone audio, (2) capture screen video or screenshot, (3) capture camera video or photo, (4) list available devices/displays/cameras, or any task involving audio/video/image capture on macOS via the command line. Trigger on keywords like: record, microphone, screen capture, screenshot, screen recording, camera, webcam, photo, audio capture.

openclaw
openclaw
content-media
open
media
4K

sprite-animator

Generate animated pixel art sprites from any image using AI. Send a photo, get a 16-frame animated GIF.

openclaw
openclaw
content-media
open
media
4K

audio-rename

Rename audio files with Chinese/special characters to simple English names for mlx-stt compatibility.

openclaw
openclaw
content-media
open
media
4K

yt-assemblyai-monitor

YouTube channel monitor and video transcription using AssemblyAI cloud API. Pure Python + requests only — no ffmpeg, no Whisper, no extra tools needed. Monitors YouTube channels for new videos, extracts audio URLs via innertube API, submits to AssemblyAI for cloud transcription, and returns text + AI summary. Works on Mac, Linux, Windows. Only dependency: requests (usually pre-installed). Use when: user asks to monitor YouTube channels, transcribe YouTube videos, summarize video content, or set up YouTube content monitoring.

openclaw
openclaw
content-media
open
media
4K

keevx-image-to-video

Use the Keevx API to convert images to videos. Supports multiple models (V/KL), various resolutions (720p/1080p/4K), and audio generation. Use this skill when the user needs to: (1) Convert images to video (2) Generate video with Keevx (3) Create and query image-to-video tasks (4) Batch image-to-video conversion. Keywords: image to video, Keevx, video generation.

openclaw
openclaw
content-media
open
media
4K

llm-video-generator

Generate videos from text descriptions using ZhipuAI CogVideoX-3 model. Supports text-to-video, image-to-video, and first/last frame-to-video generation. Automatically handles long videos (over 5s) by chaining multiple generation calls with last-frame continuation. Use when the user asks to create/generate a video from text, make a video, text-to-video, 文生视频, 生成视频, 做个视频, or any request involving converting text/images into a video. Supports configuring video content, style, resolution (up to 4K), frame rate (30/60fps), audio, and duration.

openclaw
openclaw
content-media
open
media
4K

magnet-search

搜索电影磁力下载链接。接入真实磁力搜索引擎 API (ThePirateBay, Nyaa等),返回高质量的种子链接。用于合法的个人学习和研究目的。

openclaw
openclaw
content-media
open
media
4K

bilibili-downloader

Download videos, audio, subtitles, and covers from Bilibili using bilibili-api. Use when working with Bilibili content for downloading videos in various qualities, extracting audio, getting subtitles and danmaku, downloading covers, and managing download preferences.

openclaw
openclaw
content-media
open
media
4K

download

Downloads YouTube videos to ~/Downloads. Use when user wants to download a YouTube video to their machine.

openclaw
openclaw
content-media
open
media
4K

video-understanding

Analyze videos with Google Gemini multimodal AI. Download from any URL (Loom, YouTube, TikTok, Vimeo, Twitter/X, Instagram, 1000+ sites) and get transcripts, descriptions, and answers to questions. Use when asked to watch, analyze, summarize, or transcribe a video, or answer questions about video content. Triggers on video URLs or requests involving video understanding.

openclaw
openclaw
content-media
open
media
4K

alibaba-super-resolution

Enhance video resolution using Alibaba Cloud Super Resolution API. Use when the user wants to: (1) upscale low-res videos to higher resolution, (2) improve video quality before publishing, or (3) convert 480p videos to 1080p.

openclaw
openclaw
content-media
open
media
4K

mm-voice-maker

Enables voice synthesis, voice cloning, voice design, and audio post-processing using MiniMax Voice API and FFmpeg. Use when converting text to speech, creating custom voices, or processing/merging audio.

openclaw
openclaw
content-media
open
Previous
Page 5 / 62
Next