home/categories/media

category focus

Media

Audio, video, and image processing.

1476 स्किल्सall categories

sorting

stars

current ordering strategy

query

all entries

refine the visible subset

media

catsharp-sonification

Sonify GF(3) color streams via CatSharp scale. Maps Gay.jl colors to pitch classes and plays through sox. No voice synthesis.

plurigrid

content-media

open

media

ffmpeg

Media processing (10 man pages).

plurigrid

content-media

open

media

Media processing = ffmpeg + imagemagick + sox.

plurigrid

content-media

open

media

image-enhancer

Improves the quality of images, especially screenshots, by enhancing resolution, sharpness, and clarity. Perfect for preparing images for presentations, documentation, or social media posts.

plurigrid

content-media

open

media

image-enhancer

Improves the quality of images, especially screenshots, by enhancing

plurigrid

content-media

open

media

video-downloader

Downloads videos from YouTube and other platforms for offline viewing,

plurigrid

content-media

open

media

performing-steganography-detection

Detect and extract hidden data embedded in images, audio, and other media files using steganalysis tools to uncover covert communication channels.

plurigrid

content-media

open

media

video-processor

Automated video processing: metadata extraction, thumbnails, transcoding, audio extraction with DuckDB tracking

plurigrid

content-media

open

media

yt-playlist-acset

Extract transcripts from YouTube playlists into DuckDB ACSet schema. Uses pytubefix + mlx-whisper on Apple Silicon. Supports auto-captions and local transcription fallback.

plurigrid

content-media

open

media

sense

sense - Diagrammatic Video Extraction with Subtitle Alignment

plurigrid

content-media

open

media

bmorphism-video-interleave

bmorphism Video Archive Interleave

plurigrid

content-media

open

media

recovering-deleted-files-with-photorec

Recover deleted files from disk images and storage media using PhotoRec's file signature-based carving engine regardless of file system damage.

plurigrid

content-media

open

media

performing-file-carving-with-foremost

Recover files from disk images and unallocated space using Foremost's header-footer signature carving to extract evidence regardless of file system state.

plurigrid

content-media

open

media

Media processing = ffmpeg + imagemagick + sox.

plurigrid

content-media

open

media

acquiring-disk-image-with-dd-and-dcfldd

Create forensically sound bit-for-bit disk images using dd and dcfldd while preserving evidence integrity through hash verification.

plurigrid

content-media

open

media

analyzing-disk-image-with-autopsy

Perform comprehensive forensic analysis of disk images using Autopsy to recover files, examine artifacts, and build investigation timelines.

plurigrid

content-media

open

media

ffmpeg-media

FFmpeg media processing. Video/audio transcoding, stream manipulation, and filter graphs.

plurigrid

content-media

open

media

live-recording

Always-on audio capture via whisper-cpp to org file with Emacs live display

plurigrid

content-media

open

media

Domain-specific guidance for Remotion video work in this repository. Use when creating, editing, or reviewing Remotion compositions, animations, captions, audio handling, transitions, or media-processing workflows.

co-r-e

content-media

open

media

nanobanana-image-edit

Edits existing images via Gemini API and updates them in DexCode slide decks. Sends the original image with an edit prompt to apply targeted modifications such as removing objects, changing colors, or adding elements. Use when user says "edit image", "fix image", "modify image", "remove the background", or the Japanese equivalents "画像を編集", "画像を修正", "画像を直して". Key capabilities: in-place overwrite or save-as-new, visual verification before and after edit, aspect ratio and resolution control, English prompt optimization for best Gemini results.

co-r-e

content-media

open

media

qasai

Image compression CLI with lossless/lossy options, multiple engines, batch processing, and format conversion. Use when compressing, optimizing, or converting images.

ahmadawais

content-media

open

media

video-tool

Video processing toolkit. Use when user wants to: - Download videos from YouTube or other sites - Remove silence from videos - Trim, cut, or extract segments from videos - Extract audio from video files - Enhance or denoise audio - Replace audio track in a video - Change video playback speed - Concatenate multiple videos - Generate transcripts/captions (VTT) - Generate video descriptions, timestamps, or context cards - Upload videos to YouTube or Bunny.net CDN - Post social updates to X (Twitter) or LinkedIn - Get video metadata (duration, resolution, codec)

alejandro-ao

content-media

open

media

video-summarizer

Download videos from URLs (YouTube, Bilibili, and any yt-dlp supported platform), transcribe speech to text using Whisper, generate a structured summary, and save both the summary and full transcript as linked Obsidian notes. Use this skill whenever the user wants to summarize a video, transcribe video content, extract key points from a video, or save video notes to Obsidian. Also trigger when the user shares a video URL and asks for analysis, notes, or a recap.

Zerone-Agent

content-media

open

media

video-edit

Edit talking-head videos by removing silences with neural VAD and adding 3D swivel teaser transitions. Use when user asks to edit video, remove silences, add jump cuts, or create video teasers.

nickjwells

content-media

open

Page 47 / 62