category focus

Media

Audio, video, and image processing.

1476 مهارةall categories
sorting
stars
current ordering strategy
query
all entries
refine the visible subset
media
261

dotnet-libvlc

Expert knowledge of the libvlc C API (3.x and 4.x), the multimedia framework behind VLC media player. Use when helping with LibVLC or LibVLCSharp for media playback, streaming, or transcoding.

managedcode
managedcode
content-media
open
media
254

vhs-recording

Generate terminal recordings using VHS tape files, produces GIF outputs.

athola
athola
content-media
open
media
254

media-composition

Combine media assets (GIFs, videos) into composite tutorials with vertical/horizontal

athola
athola
content-media
open
media
254

gif-generation

Post-process video files and generate optimized GIFs. Converts webm/mp4

athola
athola
content-media
open
media
248

transcription

Audio/video transcription using OpenAI Whisper. Covers installation, model selection, transcript formats (SRT, VTT, JSON), timing synchronization, and speaker diarization. Use when transcribing media or generating subtitles.

MadAppGang
MadAppGang
content-media
open
media
248

final-cut-pro

Apple Final Cut Pro FCPXML format reference. Covers project structure, timeline creation, clip references, effects, and transitions. Use when generating FCP projects or understanding FCPXML structure.

MadAppGang
MadAppGang
content-media
open
media
248

ffmpeg-core

FFmpeg fundamentals for video/audio manipulation. Covers common operations (trim, concat, convert, extract), codec selection, filter chains, and performance optimization. Use when planning or executing video processing tasks.

MadAppGang
MadAppGang
content-media
open
media
247

ponyflash

Generate images, videos, speech audio, and music using the PonyFlash Python SDK. Also handle local media editing with FFmpeg, including clip, concat, transcode, extract audio, frame capture, subtitle capability checks, and ASS subtitle prep. Use when the user asks to create, generate, produce, edit, trim, merge, concatenate, transcode, subtitle, or render AI-generated media content.

aiskillstore
aiskillstore
content-media
open
media
247

p-video

Generate videos with Pruna P-Video and WAN models via inference.sh CLI. Models: P-Video, WAN-T2V, WAN-I2V. Capabilities: text-to-video, image-to-video, audio support, 720p/1080p, fast inference. Pruna optimizes models for speed without quality loss. Triggers: pruna video, p-video, pruna ai video, fast video generation, optimized video, wan t2v, wan i2v, economic video generation, cheap video generation, pruna text to video, pruna image to video

aiskillstore
aiskillstore
content-media
open
media
238

gpt-image-1-5

Generate and edit images using OpenAI's GPT Image 1.5 model. Use when the user asks to generate, create, edit, modify, change, alter, or update images. Also use when user references an existing image file and asks to modify it in any way (e.g., "modify this image", "change the background", "replace X with Y"). Supports text-to-image generation and image editing with optional mask. DO NOT read the image file first - use this skill directly with the --input-image parameter.

intellectronica
intellectronica
content-media
open
media
238

gpt-image-1-5

Generate and edit images using OpenAI's GPT Image 1.5 model. Use when the user asks to generate, create, edit, modify, change, alter, or update images. Also use when user references an existing image file and asks to modify it in any way (e.g., "modify this image", "change the background", "replace X with Y"). Supports text-to-image generation and image editing with optional mask. DO NOT read the image file first - use this skill directly with the --input-image parameter.

intellectronica
intellectronica
content-media
open
media
238

nano-banana-pro

Generate and edit images using Google's Nano Banana Pro (Gemini 3 Pro Image) API. Use when the user asks to generate, create, edit, modify, change, alter, or update images. Also use when user references an existing image file and asks to modify it in any way (e.g., "modify this image", "change the background", "replace X with Y"). Supports both text-to-image generation and image-to-image editing with configurable resolution (1K default, 2K, or 4K for high resolution). DO NOT read the image file first - use this skill directly with the --input-image parameter.

intellectronica
intellectronica
content-media
open
media
238

nano-banana-2

Generate and edit images using Google's Nano Banana 2 (Gemini 3.1 Flash Image Preview) API. This skill should be used when the user asks to create or modify images, especially when they need fast iteration, explicit aspect-ratio control, or resolution control from 512px to 4K.

intellectronica
intellectronica
content-media
open
media
238

nano-banana-2

Generate and edit images using Google's Nano Banana 2 (Gemini 3.1 Flash Image Preview) API. This skill should be used when the user asks to create or modify images, especially when they need fast iteration, explicit aspect-ratio control, or resolution control from 512px to 4K.

intellectronica
intellectronica
content-media
open
media
235

qqbot-media

QQBot 富媒体收发能力。使用 <qqmedia> 标签,系统根据文件扩展名自动识别类型(图片/语音/视频/文件)。

ryanlee-gemini
ryanlee-gemini
content-media
open
media
225

music-search

搜索和播放音乐 / Search & play music. 当用户想要:(1) 随机点一首歌/随机推荐一首歌/帮我放首歌 (2) 搜索某首歌曲/点播指定歌曲/播放某个歌手的歌 (3) 获取歌曲播放链接或封面 (4) 在网易云/酷狗/酷我/汽水/QQ音乐等平台查找音乐时使用。Use when user wants to: randomly recommend or play a song, search for a song by name/artist, play specific music, get play URL or cover image from NetEase/KuGou/KuWo/QiShui/QQ Music.

Lingyuzhou111
Lingyuzhou111
content-media
open
media
224

transcribe-tool

Audio transcription tool. Converts audio files to text with Whisper and optional LLM post-processing. Use when: transcribing meetings, podcasts, or extracting text from recorded audio files.

xuiltul
xuiltul
content-media
open
media
224

transcribe-tool

음성 문자 변환 도구. Whisper로 오디오를 텍스트로 바꾸고 필요 시 LLM 후처리한다. Use when: 회의 녹음 전사, 팟캐스트 텍스트화, 녹음 파일에서 본문 추출이 필요할 때.

xuiltul
xuiltul
content-media
open
media
216

video-subtitle-cutter

Transcribe video, analyze subtitles with AI, and cut video by removing filler words, pauses, and mistakes

different-ai
different-ai
content-media
open
media
216

youtube-rl-tracker

Track YouTube video performance for "poor man's reinforcement learning" - learn what thumbnails, titles, and hooks work

different-ai
different-ai
content-media
open
media
216

media-accessibility

Video, audio, and streaming media accessibility specialist. Audits captions (WebVTT/SRT), transcripts, audio descriptions, accessible media player controls, and WCAG 1.2.x time-based media criteria.

Community-Access
Community-Access
content-media
open
media
216

media-accessibility

Video, audio, and streaming media accessibility reference. WebVTT/SRT/TTML caption formats, audio description patterns, accessible media player ARIA, WCAG 1.2.x criteria mapping, caption quality guidelines, and live captioning integration.

Community-Access
Community-Access
content-media
open
media
216

media-accessibility

Video, audio, and streaming media accessibility specialist. Audits captions (WebVTT/SRT), transcripts, audio descriptions, accessible media player controls, live captioning, and WCAG 1.2.x time-based media criteria.

Community-Access
Community-Access
content-media
open
Previous
Page 18 / 62
Next