category focus

Media

Audio, video, and image processing.

1476 اسکلزall categories
sorting
stars
current ordering strategy
query
all entries
refine the visible subset
media
3

advanced-video-downloader

Download and transcribe videos from YouTube, Bilibili, TikTok and 1000+ platforms. Use when user requests video download, transcription (转录/字幕提取), or converting video to text/markdown. Supports quality selection, audio extraction, playlist downloads, cookie-based authentication, and AI-powered transcription via SiliconFlow API (免费转录).

Jst-Well-Dan
Jst-Well-Dan
content-media
open
media
3

audio-processor

ffmpeg 기반 오디오 변환 및 처리. "오디오 변환", "wav 변환", "샘플레이트 변경", "모노 변환", "세그먼트 분할", "ffmpeg" 요청 시 활성화됩니다.

jiunbae
jiunbae
content-media
open
media
3

pexels-media

Source royalty-free images and videos from Pexels API for design, placeholders, or content. Supports search, curated/popular content, collections, multiple resolutions, and ALWAYS creates detailed sidecar metadata files.

nicepkg
nicepkg
content-media
open
media
3

image-gen

Generate images using Gemini via ZenMux

MarkShawn2020
MarkShawn2020
content-media
open
media
3

daw-compatibility-guide

DAW-specific quirks, known issues, and workarounds for Logic Pro, Ableton Live, Pro Tools, Cubase, Reaper, FL Studio, Bitwig with format-specific requirements (AU/VST3/AAX). Use when troubleshooting DAW compatibility, fixing host-specific bugs, implementing DAW workarounds, passing auval validation, or debugging automation issues.

yebot
yebot
content-media
open
media
3

audio-reactive

Binding audio analysis data to visual parameters including smoothing, beat detection responses, and frequency-to-visual mappings. Use when creating audio visualizers, music-reactive animations, or any visual effect driven by audio input.

Bbeierle12
Bbeierle12
content-media
open
media
2

deepgram-transcription

Transcribe audio and video files using the Deepgram API. This skill should be used when the user requests transcription of audio files (mp3, wav, m4a, aac) or video files (mp4, mov, avi, etc.). Handles large video files by extracting audio first to reduce upload size and processing time.

AgentiveAU
AgentiveAU
content-media
open
media
2

transcribe

Transcribe audio files from meetings into text documents using Whisper. Use when the user types /transcribe, has a new audio recording, or when RA detects new audio files in meetings/audio/. Supports speaker diarization with pyannote.

braselog
braselog
content-media
open
media
2

art-icon-creator

This skill should be used when creating artistic icon variations from images. It generates 10 different greyscale icon styles from a single image source, automatically compressing to under 20KB with high contrast appearance. Supports both URL and local file inputs.

bennoloeffler
bennoloeffler
content-media
open
media
2

managing-fighter-images

Use this skill when working with UFC fighter images including downloading from multiple sources (Wikimedia, Sherdog, Bing), detecting and replacing placeholder images, handling duplicates, normalizing image sizes, validating image quality, syncing filesystem to database, or running the complete image pipeline. Handles missing images, batch downloads, and multi-source orchestration.

wolfiesch
wolfiesch
content-media
open
media
2

resize-image

Resize images using ImageMagick when they are too large to view or process. Use this skill when you encounter an image file that exceeds token limits, is too large to read, or when you need to create a smaller version of an image for viewing.

ponderingBGI
ponderingBGI
content-media
open
media
2

media-processing

Process multimedia files with FFmpeg (video/audio encoding, conversion, streaming, filtering, hardware acceleration) and ImageMagick (image manipulation, format conversion, batch processing, effects, composition). Use when converting media formats, encoding videos with specific codecs (H.264, H.265, VP9), resizing/cropping images, extracting audio from video, applying filters and effects, optimizing file sizes, creating streaming manifests (HLS/DASH), generating thumbnails, batch processing images, creating composite images, or implementing media processing pipelines. Supports 100+ formats, hardware acceleration (NVENC, QSV), and complex filtergraphs.

vibery-studio
vibery-studio
content-media
open
media
2

td-filter

Digital filtering for noise reduction and signal enhancement

teradata-labs
teradata-labs
content-media
open
media
2

openai-image-edit

Edit images via OpenAI gpt-image-1 API. Creates edited or extended images from source images and a prompt. Supports masks for selective editing, multiple input images for compositing, and input fidelity control for preserving facial features. Use when user wants to modify existing images, combine multiple images, remove/replace objects, extend images, or needs AI-powered image editing with reference images.

LarsEckart
LarsEckart
content-media
open
media
2

sound-effect-sourcing

Sound effect sourcing from Adobe Audition, Freesound, ElevenLabs text-to-sound-effects, and audio library management for professional productions. Use when adding sound effects, building audio libraries, or creating immersive soundscapes.

onesmartguy
onesmartguy
content-media
open
media
2

smart-reading

Use when reading files or command output of unknown size to avoid blind truncation and context loss

axiomantic
axiomantic
content-media
open
media
2

td-smoothing

Signal smoothing and noise reduction techniques

teradata-labs
teradata-labs
content-media
open
media
2

manga-reader

Reader screen patterns and image handling

chiraitori
chiraitori
content-media
open
media
2

video-transcript-downloader

This skill should be used when the user asks to "download this video", "get the transcript", "save this clip", "rip audio from", "get subtitles for", "transcribe this video", or mentions YouTube URLs, yt-dlp, or video/audio extraction. Use for any video downloading, transcript extraction, or format troubleshooting.

sevos
sevos
content-media
open
media
2

asset-catalog-optimizer

Analyze and optimize Xcode asset catalogs - find unused assets, missing resolutions, compress images

paleoterra
paleoterra
content-media
open
media
2

mermaid-reverse-attempt

Mermaid URL codec - encodes/decodes #base64: (amp CLI) and #pako: (mermaid.live) formats

plurigrid
plurigrid
content-media
open
media
2

td-resample

Signal resampling and interpolation for rate conversion

teradata-labs
teradata-labs
content-media
open
media
2

audio-editing-automation

FFmpeg audio processing, batch editing, normalization, mixing, and automated audio production workflows. Use when processing audio at scale, automating editing tasks, or building audio pipelines.

onesmartguy
onesmartguy
content-media
open
Previous
Page 53 / 62
Next