category focus

Media

Audio, video, and image processing.

1476 个技能all categories
sorting
stars
current ordering strategy
query
all entries
refine the visible subset
media
5

video-editing-advisor

Video editing expert covering cutting techniques, pacing, color grading, and post-production workflows

sandraschi
sandraschi
content-media
open
media
5

ffmpeg-helper

Guidelines for generating valid FFmpeg commands for the mobile FFmpeg Kit.

tsiresymila1
tsiresymila1
content-media
open
media
5

audio-normalizer

Use when asked to normalize audio volume, match loudness, or apply peak/RMS normalization to audio files.

dkyazzentwatwa
dkyazzentwatwa
content-media
open
media
5

audio-trimmer

Cut, trim, and edit audio segments with fade effects, speed control, concatenation, and basic audio manipulations.

dkyazzentwatwa
dkyazzentwatwa
content-media
open
media
5

image-metadata-tool

Extract EXIF metadata from images including GPS coordinates, camera settings, and timestamps. Map photo locations and strip metadata for privacy.

dkyazzentwatwa
dkyazzentwatwa
content-media
open
media
5

video-metadata-inspector

Use when asked to inspect video file metadata, get video duration, resolution, codec information, frame rate, or bitrate.

dkyazzentwatwa
dkyazzentwatwa
content-media
open
media
5

image-files

Image manipulation using ImageMagick command-line tools for resizing, converting, optimizing, and batch processing

lawless-m
lawless-m
content-media
open
media
5

pexels-media

Source royalty-free images and videos from Pexels API for design, placeholders, or content. Supports search, curated/popular content, collections, multiple resolutions, and ALWAYS creates detailed sidecar metadata files.

troykelly
troykelly
content-media
open
media
4

transcription-helper

Guides users through video transcription workflow from input to output. Transcribes local video files and YouTube URLs using gpt-4o-transcribe. Use when users want to transcribe videos, audio files, YouTube content, or need help with media-to-text conversion.

costiash
costiash
content-media
open
media
4

image-optimizer

Optimizes images for web performance by converting to modern formats, compressing, and generating responsive sizes. Use when user asks to "optimize images", "compress images", "convert to webp", or mentions image performance.

Dexploarer
Dexploarer
content-media
open
media
4

video-toolkit

Video analysis and editing with FFmpeg and Whisper. This skill should be used when video files are shared (.mov, .mp4, .avi, etc.) or when you encounter "cannot read binary files" errors for video files, when users request video analysis or summarization, or when users ask to edit videos (clip, merge, split).

emdashcodes
emdashcodes
content-media
open
media
4

ffmpeg

Extract audio and transcode MP4 to WebM using ffmpeg.

Th0rgal
Th0rgal
content-media
open
media
4

video-editing

Convert, edit, and process video and audio files using ffmpeg and ffprobe. Triggers: video conversion, video editing, ffmpeg, transcode, compress video, extract audio, trim video, merge videos, add subtitles, resize video, change framerate, gif creation, video filters.

Th0rgal
Th0rgal
content-media
open
media
4

img-optimize

This skill should be used when optimizing, converting, or resizing images. Trigger when user mentions image optimization, HEIC/HEIF conversion, JPEG/PNG/WebP/AVIF processing, reducing image file size, or batch image processing. Uses sharp-cli as the primary tool.

lttr
lttr
content-media
open
media
4

startup-portrait

Transform team photos into professional startup portraits using fal.ai Nano Banana Pro image-to-image API.

vm0-ai
vm0-ai
content-media
open
media
3

video-presentation-skill

Generate interactive HTML presentations for ANY video type (tutorials, comparisons, fact-checks, explainers, etc.). Creates self-contained, screen-recording-optimized slides with various content types including comparisons, steps, code blocks, calculators, and verdicts. Use when user wants visual aids for their videos.

SGobet
SGobet
content-media
open
media
3

transcribe-and-analyze

Transcribe audio and video from URLs (YouTube, direct media links) using WhisperKit locally. Optionally analyze transcripts with AI when explicitly requested. Use when users provide URLs to media content and request transcription or speech-to-text conversion.

nicepkg
nicepkg
content-media
open
media
3

imagekit-upload

Upload images to ImageKit from file paths or clipboard, returning the CDN URL for easy sharing and embedding

kevinslin
kevinslin
content-media
open
media
3

postfx-effects

Post-processing visual effects including chromatic aberration, vignette, depth of field, film grain, color grading, and LUT support. Use when adding cinematic polish, retro aesthetics, camera simulation, or atmospheric effects to 3D scenes. Essential for mood, style, and visual storytelling.

Bbeierle12
Bbeierle12
content-media
open
media
3

audio-playback

Audio playback using Tone.js including players, transport, scheduling, and loading audio. Use when implementing background music, sound effects, audio synchronization, or timed audio events. Essential for any audio-enabled web application.

Bbeierle12
Bbeierle12
content-media
open
media
3

audio-engineer

Activate this skill when users need help with audio configuration, troubleshooting, or optimization in OBS. Triggers include requests like "fix my audio", "adjust microphone levels", "mute desktop audio", "balance my audio sources", "check audio levels", or diagnosing audio issues like echo, distortion, or missing sound. This skill orchestrates audio tools to ensure professional sound quality.

ironystock
ironystock
content-media
open
media
3

dsp-cookbook

Production-ready DSP algorithms including filters, compressors, delays, modulation effects, saturation, and distortion with JUCE integration and optimization techniques. Use when implementing audio processing, DSP algorithms, audio effects, dynamics processors, or need code examples for common audio operations.

yebot
yebot
content-media
open
media
3

image-manipulation

Process and manipulate images using ImageMagick. Supports resizing, format conversion, batch processing, and retrieving image metadata. Use when working with images, creating thumbnails, resizing wallpapers, or performing batch image operations.

Visual-Studio-Wallpapers
Visual-Studio-Wallpapers
content-media
open
media
3

audio-analysis

Audio analysis with Tone.js and Web Audio API including FFT, frequency data extraction, amplitude measurement, and waveform analysis. Use when extracting audio data for visualizations, beat detection, or any audio-reactive features.

Bbeierle12
Bbeierle12
content-media
open
Previous
Page 52 / 62
Next