skills.homescapability registry تلاش

home/categories/media

category focus

Media

Audio, video, and image processing.

1476 اسکلزall categories

sorting

stars

current ordering strategy

query

all entries

refine the visible subset

media

3

advanced-video-downloader

Download and transcribe videos from YouTube, Bilibili, TikTok and 1000+ platforms. Use when user requests video download, transcription (转录/字幕提取), or converting video to text/markdown. Supports quality selection, audio extraction, playlist downloads, cookie-based authentication, and AI-powered transcription via SiliconFlow API (免费转录).

Jst-Well-Dan

content-media

media

3

audio-processor

ffmpeg 기반 오디오 변환 및 처리. "오디오 변환", "wav 변환", "샘플레이트 변경", "모노 변환", "세그먼트 분할", "ffmpeg" 요청 시 활성화됩니다.

jiunbae

content-media

media

3

pexels-media

Source royalty-free images and videos from Pexels API for design, placeholders, or content. Supports search, curated/popular content, collections, multiple resolutions, and ALWAYS creates detailed sidecar metadata files.

nicepkg

content-media

media

3

image-gen

Generate images using Gemini via ZenMux

MarkShawn2020

content-media

media

3

daw-compatibility-guide

DAW-specific quirks, known issues, and workarounds for Logic Pro, Ableton Live, Pro Tools, Cubase, Reaper, FL Studio, Bitwig with format-specific requirements (AU/VST3/AAX). Use when troubleshooting DAW compatibility, fixing host-specific bugs, implementing DAW workarounds, passing auval validation, or debugging automation issues.

yebot

content-media

media

3

audio-reactive

Binding audio analysis data to visual parameters including smoothing, beat detection responses, and frequency-to-visual mappings. Use when creating audio visualizers, music-reactive animations, or any visual effect driven by audio input.

Bbeierle12

content-media

media

2

deepgram-transcription

Transcribe audio and video files using the Deepgram API. This skill should be used when the user requests transcription of audio files (mp3, wav, m4a, aac) or video files (mp4, mov, avi, etc.). Handles large video files by extracting audio first to reduce upload size and processing time.

AgentiveAU

content-media

media

2

transcribe

Transcribe audio files from meetings into text documents using Whisper. Use when the user types /transcribe, has a new audio recording, or when RA detects new audio files in meetings/audio/. Supports speaker diarization with pyannote.

braselog

content-media

media

2

art-icon-creator

This skill should be used when creating artistic icon variations from images. It generates 10 different greyscale icon styles from a single image source, automatically compressing to under 20KB with high contrast appearance. Supports both URL and local file inputs.

bennoloeffler

content-media

media

2

managing-fighter-images

Use this skill when working with UFC fighter images including downloading from multiple sources (Wikimedia, Sherdog, Bing), detecting and replacing placeholder images, handling duplicates, normalizing image sizes, validating image quality, syncing filesystem to database, or running the complete image pipeline. Handles missing images, batch downloads, and multi-source orchestration.

wolfiesch

content-media

media

2

resize-image

Resize images using ImageMagick when they are too large to view or process. Use this skill when you encounter an image file that exceeds token limits, is too large to read, or when you need to create a smaller version of an image for viewing.

ponderingBGI

content-media

media

2

remotion-best-practices

Best practices for Remotion - Video creation in React

connorads

content-media

media

2

media-processing

Process multimedia files with FFmpeg (video/audio encoding, conversion, streaming, filtering, hardware acceleration) and ImageMagick (image manipulation, format conversion, batch processing, effects, composition). Use when converting media formats, encoding videos with specific codecs (H.264, H.265, VP9), resizing/cropping images, extracting audio from video, applying filters and effects, optimizing file sizes, creating streaming manifests (HLS/DASH), generating thumbnails, batch processing images, creating composite images, or implementing media processing pipelines. Supports 100+ formats, hardware acceleration (NVENC, QSV), and complex filtergraphs.

vibery-studio

content-media

media

2

td-filter

Digital filtering for noise reduction and signal enhancement

teradata-labs

content-media

media

2

openai-image-edit

Edit images via OpenAI gpt-image-1 API. Creates edited or extended images from source images and a prompt. Supports masks for selective editing, multiple input images for compositing, and input fidelity control for preserving facial features. Use when user wants to modify existing images, combine multiple images, remove/replace objects, extend images, or needs AI-powered image editing with reference images.

LarsEckart

content-media

media

2

sound-effect-sourcing

Sound effect sourcing from Adobe Audition, Freesound, ElevenLabs text-to-sound-effects, and audio library management for professional productions. Use when adding sound effects, building audio libraries, or creating immersive soundscapes.

onesmartguy

content-media

media

2

smart-reading

Use when reading files or command output of unknown size to avoid blind truncation and context loss

axiomantic

content-media

media

2

td-smoothing

Signal smoothing and noise reduction techniques

teradata-labs

content-media

media

2

manga-reader

Reader screen patterns and image handling

chiraitori

content-media

media

2

video-transcript-downloader

This skill should be used when the user asks to "download this video", "get the transcript", "save this clip", "rip audio from", "get subtitles for", "transcribe this video", or mentions YouTube URLs, yt-dlp, or video/audio extraction. Use for any video downloading, transcript extraction, or format troubleshooting.

sevos

content-media

media

2

asset-catalog-optimizer

Analyze and optimize Xcode asset catalogs - find unused assets, missing resolutions, compress images

paleoterra

content-media

media

2

mermaid-reverse-attempt

Mermaid URL codec - encodes/decodes #base64: (amp CLI) and #pako: (mermaid.live) formats

plurigrid

content-media

media

2

td-resample

Signal resampling and interpolation for rate conversion

teradata-labs

content-media

media

2

audio-editing-automation

FFmpeg audio processing, batch editing, normalization, mixing, and automated audio production workflows. Use when processing audio at scale, automating editing tasks, or building audio pipelines.

onesmartguy

content-media

Page 53 / 62