category focus

Media

Audio, video, and image processing.

1476 اسکلزall categories
sorting
stars
current ordering strategy
query
all entries
refine the visible subset
media
46

agency-inclusive-visuals-specialist

Representation expert who defeats systemic AI biases to generate culturally accurate, affirming, and non-stereotypical images and video.

mk-knight23
mk-knight23
content-media
open
media
46

hxaudio-player

HX Audio Player for music and sound playback. Use when implementing music playback, sound effects, HXMusic, HXSound, or audio in HXAudioPlayer. Library is in the hxaudio module.

huhx0015
huhx0015
content-media
open
media
46

agency-inclusive-visuals-specialist

Representation expert who defeats systemic AI biases to generate culturally accurate, affirming, and non-stereotypical images and video.

mk-knight23
mk-knight23
content-media
open
media
46

agency-inclusive-visuals-specialist

Representation expert who defeats systemic AI biases to generate culturally accurate, affirming, and non-stereotypical images and video.

mk-knight23
mk-knight23
content-media
open
media
46

media-toolkit

Process audio and video with clipping, conversion, analysis, captions, thumbnails, GIFs, and batch utilities. Use for practical media manipulation workflows.

dkyazzentwatwa
dkyazzentwatwa
content-media
open
media
44

asr

Transcribe audio files to text using local speech recognition. Triggers on: "转录", "transcribe", "语音转文字", "ASR", "识别音频", "把这段音频转成文字".

marswaveai
marswaveai
content-media
open
media
43

ai-content-pipeline

Generate AI content (images, videos, audio, avatars) and analyze videos with detailed timelines using YAML pipelines with 51 models across 8 categories. Includes video analysis with Gemini 3 Pro.

donghaozhang
donghaozhang
content-media
open
media
43

video-frames

Extract frames or short clips from videos using ffmpeg.

NJX-njx
NJX-njx
content-media
open
media
43

nano-banana-pro

Generate or edit images via Gemini 3 Pro Image (Nano Banana Pro).

NJX-njx
NJX-njx
content-media
open
media
43

refactor

Refactor large files into smaller sub-components. Do not create more large files. Refactor into modular, focused units.

regression-io
regression-io
content-media
open
media
43

camsnap

Capture frames or clips from RTSP/ONVIF cameras.

NJX-njx
NJX-njx
content-media
open
media
42

kirby-performance-and-media

Improves Kirby performance and media delivery (cache tuning, CDN, responsive images, lazy loading). Use when optimizing page speed, caching, or image handling.

bnomei
bnomei
content-media
open
media
42

image-enhancer

Improves the quality of images, especially screenshots, by enhancing resolution, sharpness, and clarity. Perfect for preparing images for presentations, documentation, or social media posts.

vuralserhat86
vuralserhat86
content-media
open
media
41

video-creator

长视频生成专家。适用于需要生成有画面的视频内容,且时长超过单个片段限制(4-12秒)的场景。包含分镜脚本创作、图片序列生成、视频片段生成、拼接的完整工作流。支持角色一致性保持。

hirogoing
hirogoing
content-media
open
media
41

virtual-anchor

虚拟人视频生成专家。适用于将图片+音频合成为口型同步的虚拟人视频,包含角色形象生成、人脸检测、虚拟人合成的完整工作流。

hirogoing
hirogoing
content-media
open
media
41

analyze-video

Analyze, fact-check, and detect bias in videos.

glowingkitty
glowingkitty
content-media
open
media
41

seo-images

Image optimization analysis for SEO and performance. Checks alt text, file sizes, formats, responsive images, lazy loading, and CLS prevention. Use when user says "image optimization", "alt text", "image SEO", "image size", or "image audit".

mits-pl
mits-pl
content-media
open
media
41

video-generator

AI video production workflow using Remotion. Use when creating videos, short films, commercials, or motion graphics. Triggers on requests to make promotional videos, product demos, social media videos, animated explainers, or any programmatic video content. Produces polished motion graphics, not slideshows.

panaversity
panaversity
content-media
open
media
40

source-compare

This skill should be used when the user asks "compare these videos", "which source is better", "compare blu-ray vs web", "which release should I use", "compare video quality", or needs to evaluate multiple versions of the same content to determine which has better quality.

robbyt
robbyt
content-media
open
media
40

telecine-detect

This skill should be used when the user asks "is this interlaced", "detect telecine", "video has combing", "should I deinterlace", "3:2 pulldown", "inverse telecine", or sees horizontal lines/combing artifacts in video and wants to understand if the content is truly interlaced or telecined.

robbyt
robbyt
content-media
open
media
40

subtitle-audit

This skill should be used when the user asks "check subtitles", "audit subs", "subtitle font issues", "ASS vs SRT", "missing fonts", "subtitle timing", or wants to analyze subtitle tracks for issues like missing fonts, timing problems, or format limitations.

robbyt
robbyt
content-media
open
media
40

artifact-detect

This skill should be used when the user asks "what's wrong with this video", "why does this video look bad", "detect video artifacts", "find quality issues", "video has artifacts", "identify compression artifacts", or wants to diagnose specific quality problems in a video file.

robbyt
robbyt
content-media
open
media
40

format-explain

This skill should be used when the user asks "what does this mediainfo mean", "explain this video format", "what is BT.709", "what is H.264", "container vs codec", "why is this 10bit", "what does limited range mean", or wants educational explanations of video technical concepts.

robbyt
robbyt
content-media
open
media
40

video-audit

This skill should be used when the user asks to "audit this video", "analyze video quality", "check this video file", "is this video good quality", "should I reencode this", "what format is this video", or wants to understand a video file's technical properties and quality before working with it.

robbyt
robbyt
content-media
open
Previous
Page 31 / 62
Next