category focus

Media

Audio, video, and image processing.

1476 اسکلزall categories
sorting
stars
current ordering strategy
query
all entries
refine the visible subset
media
36

downloads-organizer

Automatically organize and clean up downloads folder by categorizing files, removing duplicates, and optimizing storage space

ttmouse
ttmouse
content-media
open
media
36

youtube-to-docs

Comprehensive suite for processing YouTube videos. Use this when the user needs to: (1) Extract transcripts, (2) Generate visual infographics, (3) Create audio summaries (TTS) and videos, or (4) Perform full 'kitchen sink' processing of YouTube content.

DoIT-Artificial-Intelligence
DoIT-Artificial-Intelligence
content-media
open
media
36

youtube-to-docs

Comprehensive suite for processing YouTube videos. Use this when the user needs to: (1) Extract transcripts, (2) Generate visual infographics, (3) Create audio summaries (TTS) and videos, or (4) Perform full 'kitchen sink' processing of YouTube content.

DoIT-Artificial-Intelligence
DoIT-Artificial-Intelligence
content-media
open
media
36

youtube-to-docs

Comprehensive suite for processing YouTube videos. Use this when the user needs to: (1) Extract transcripts, (2) Generate visual infographics, (3) Create audio summaries (TTS) and videos, or (4) Perform full 'kitchen sink' processing of YouTube content.

DoIT-Artificial-Intelligence
DoIT-Artificial-Intelligence
content-media
open
media
36

youtube-to-docs

Comprehensive suite for processing YouTube videos. Use this when the user needs to: (1) Extract transcripts, (2) Generate visual infographics, (3) Create audio summaries (TTS) and videos, or (4) Perform full 'kitchen sink' processing of YouTube content.

DoIT-Artificial-Intelligence
DoIT-Artificial-Intelligence
content-media
open
media
36

media-ops

媒体处理与下载——图片编辑(缩放、裁剪、旋转、格式转换、压缩)、视频格式转换、音频提取、视频下载。当用户提到视频下载、图片处理、调整大小、裁剪、旋转、格式转换、提取音频、视频转码、"下载这个视频"、"把图片转成 PNG"、"压缩图片"、"提取背景音乐"、"这个视频转 MP4"、resize、crop、blur、convert、"图片太大了"、"转成 webp"、"视频怎么这么大" 时激活。注意:AI 图片/视频生成已迁移到画布 v3,本 skill 仅处理已有文件和网络视频下载。

OpenLoaf
OpenLoaf
content-media
open
media
35

video-mute

Remove audio tracks from video files. Use when the user needs to strip audio from videos, create silent versions, or remove unwanted soundtracks from MP4, MOV, AVI, MKV, WebM, and other video formats.

meowgorithm
meowgorithm
content-media
open
media
35

image-convert

Convert images between formats (PNG, JPEG, WebP, GIF, BMP, TIFF, AVIF, HEIC) with quality control and resizing. Use when the user needs to convert images, batch process multiple files, optimize image sizes, or convert to modern formats like WebP or AVIF.

meowgorithm
meowgorithm
content-media
open
media
35

image-processing

Process images for documentation - add borders/shadows to screenshots, create GIFs from videos. Use when preparing visual assets.

testomatio
testomatio
content-media
open
media
35

shelby-media

Build video streaming and media applications with Shelby Protocol media packages. Use when working with @shelby-protocol/player for video playback (React video player component, Shaka Player integration, playback controls) or @shelby-protocol/media-prepare for transcoding video/audio with FFmpeg, CMAF packaging for DASH/HLS adaptive streaming, or Widevine DRM encryption.

shelby
shelby
content-media
open
media
35

tvc-kinetic-typography

TVC级动态字幕设计。生成与音乐节奏同步的动态文字方案,让字幕本身成为视觉元素而非信息附属。

lujiaheng-artpivot
lujiaheng-artpivot
content-media
open
media
34

web-audio-api

Web Audio API for JARVIS audio feedback and voice processing

martinholovsky
martinholovsky
content-media
open
media
34

cloudflare-images

Store and transform images with Cloudflare Images API and transformations. Use when: uploading images, implementing direct creator uploads, creating variants, generating signed URLs, optimizing formats (WebP/AVIF), transforming via Workers, or debugging CORS, multipart, or error codes 9401-9413.

ovachiever
ovachiever
content-media
open
media
34

orklev2-audio

Answer questions about orkid's Singularity synthesizer engine, DSP processing, oscillators, filters, envelopes, modulation, sampling, FM synthesis, audio output, and program/bank structure. Use when the user asks about audio, synth, DSP, or sound.

tweakoz
tweakoz
content-media
open
media
34

whisper-transcribe

Transcribe audio/video to accurate subtitles using Whisper AI, with optional translation and delivery. Supports YouTube URLs and local audio/video files. Use when: (1) a YouTube video has no subtitles, (2) auto-generated captions are inaccurate, (3) the user wants high-quality transcription, (4) the user needs translated subtitles, (5) the user wants transcripts sent to email or cloud storage. Triggers: "轉錄", "語音轉文字", "Whisper", "沒有字幕", "字幕不準", "transcribe", "speech to text", "no subtitles", "bad captions", "翻譯字幕", "translate subtitles", "寄到信箱", "上傳到雲端". Make sure to use this skill whenever the user needs transcription beyond what YouTube auto-captions provide, or when yt-search reports no subtitles available.

azuma520
azuma520
content-media
open
media
34

media-library-organizer

Use when organizing media files (movies, TV, anime) on NAS or local storage - cleaning junk files, merging scattered episodes, normalizing folder names to "Title (Year)" format, and verifying episode completeness against TMDB

Innei
Innei
content-media
open
media
34

orklev2-compositor

Answer questions about orkid's compositor system, CompositingData/Scene/Technique, render nodes (Forward/Unlit/Picking), post-FX nodes (ACES/HSVG/Bloom/User), output nodes (Screen/RtGroup/VR/File), presets (ForwardPBR/Unlit/Picking), RtGroup render targets, and Python compositor bindings. Use when the user asks about compositing, post-processing, render targets, or rendering presets.

tweakoz
tweakoz
content-media
open
media
34

nano-banana-pro

Generate/edit images with Nano Banana Pro (Gemini 3.1 Flash Image). Use for image create/modify requests incl. edits. Supports text-to-image + image-to-image; 1K/2K/4K; use --input-image.

antoniolg
antoniolg
content-media
open
media
34

longform-video-clone-edit

End-to-end longform video cloning pipeline. Downloads YouTube video, transcribes, generates HeyGen avatar chunks, detects PIP vs fullscreen vs noface segments, precisely locates webcam bubbles, composites avatar overlay with lip-synced audio. Handles screen recordings with PIP webcam bubbles and fullscreen talking head.

kevinbadi
kevinbadi
content-media
open
media
34

wan-video-clone

Clone Kevin's videos as Kev's Assistant using AI face swap + voice clone + WAN 2.2 animate. Handles long videos by splitting into 5s chunks, processing each through the pipeline, and stitching back together. Trigger when asked to clone a video, create a Kev's Assistant version, or convert Kevin's content into Kev's Assistant content.

kevinbadi
kevinbadi
content-media
open
media
34

youtube-to-heygen-longform

Converts YouTube videos into long-form (10-14 minute) landscape avatar videos using HeyGen AI clone. Analyzes video with Gemini, generates full Alex Hormozi-style script condensed to max 2000 words, creates 1920x1080 landscape avatar video. Trigger when asked to create long-form avatar videos, clone long YouTube videos, or make HeyGen long-form content.

kevinbadi
kevinbadi
content-media
open
media
34

youtube-to-heygen-video

Converts YouTube videos into short-form (60-second) avatar videos using HeyGen AI clone. Analyzes video with Gemini, generates Alex Hormozi-style script, creates vertical 1080x1920 avatar video with social media caption. Trigger when asked to create clone shorts, convert YouTube to avatar video, or make HeyGen short-form content.

kevinbadi
kevinbadi
content-media
open
media
33

asciinema-converter

Convert .cast recordings to .txt for analysis. TRIGGERS - convert cast, cast to txt, strip ANSI, batch convert.

terrylica
terrylica
content-media
open
media
33

asciinema-cast-format

Asciinema v3 .cast file format reference. TRIGGERS - cast format, asciicast spec, event codes, parse cast file.

terrylica
terrylica
content-media
open
Previous
Page 34 / 62
Next