category focus

Media

Audio, video, and image processing.

1476টি স্কিলall categories
sorting
stars
current ordering strategy
query
all entries
refine the visible subset
media
16

catsharp-sonification

Sonify GF(3) color streams via CatSharp scale. Maps Gay.jl colors to pitch classes and plays through sox. No voice synthesis.

plurigrid
plurigrid
content-media
open
media
16

ffmpeg

Media processing (10 man pages).

plurigrid
plurigrid
content-media
open
media
16

media

Media processing = ffmpeg + imagemagick + sox.

plurigrid
plurigrid
content-media
open
media
16

image-enhancer

Improves the quality of images, especially screenshots, by enhancing resolution, sharpness, and clarity. Perfect for preparing images for presentations, documentation, or social media posts.

plurigrid
plurigrid
content-media
open
media
16

image-enhancer

Improves the quality of images, especially screenshots, by enhancing

plurigrid
plurigrid
content-media
open
media
16

video-downloader

Downloads videos from YouTube and other platforms for offline viewing,

plurigrid
plurigrid
content-media
open
media
16

performing-steganography-detection

Detect and extract hidden data embedded in images, audio, and other media files using steganalysis tools to uncover covert communication channels.

plurigrid
plurigrid
content-media
open
media
16

video-processor

Automated video processing: metadata extraction, thumbnails, transcoding, audio extraction with DuckDB tracking

plurigrid
plurigrid
content-media
open
media
16

yt-playlist-acset

Extract transcripts from YouTube playlists into DuckDB ACSet schema. Uses pytubefix + mlx-whisper on Apple Silicon. Supports auto-captions and local transcription fallback.

plurigrid
plurigrid
content-media
open
media
16

sense

sense - Diagrammatic Video Extraction with Subtitle Alignment

plurigrid
plurigrid
content-media
open
media
16

recovering-deleted-files-with-photorec

Recover deleted files from disk images and storage media using PhotoRec's file signature-based carving engine regardless of file system damage.

plurigrid
plurigrid
content-media
open
media
16

performing-file-carving-with-foremost

Recover files from disk images and unallocated space using Foremost's header-footer signature carving to extract evidence regardless of file system state.

plurigrid
plurigrid
content-media
open
media
16

media

Media processing = ffmpeg + imagemagick + sox.

plurigrid
plurigrid
content-media
open
media
16

analyzing-disk-image-with-autopsy

Perform comprehensive forensic analysis of disk images using Autopsy to recover files, examine artifacts, and build investigation timelines.

plurigrid
plurigrid
content-media
open
media
16

ffmpeg-media

FFmpeg media processing. Video/audio transcoding, stream manipulation, and filter graphs.

plurigrid
plurigrid
content-media
open
media
16

live-recording

Always-on audio capture via whisper-cpp to org file with Emacs live display

plurigrid
plurigrid
content-media
open
media
16

remotion-best-practices

Domain-specific guidance for Remotion video work in this repository. Use when creating, editing, or reviewing Remotion compositions, animations, captions, audio handling, transitions, or media-processing workflows.

co-r-e
co-r-e
content-media
open
media
16

nanobanana-image-edit

Edits existing images via Gemini API and updates them in DexCode slide decks. Sends the original image with an edit prompt to apply targeted modifications such as removing objects, changing colors, or adding elements. Use when user says "edit image", "fix image", "modify image", "remove the background", or the Japanese equivalents "画像を編集", "画像を修正", "画像を直して". Key capabilities: in-place overwrite or save-as-new, visual verification before and after edit, aspect ratio and resolution control, English prompt optimization for best Gemini results.

co-r-e
co-r-e
content-media
open
media
16

qasai

Image compression CLI with lossless/lossy options, multiple engines, batch processing, and format conversion. Use when compressing, optimizing, or converting images.

ahmadawais
ahmadawais
content-media
open
media
15

video-tool

Video processing toolkit. Use when user wants to: - Download videos from YouTube or other sites - Remove silence from videos - Trim, cut, or extract segments from videos - Extract audio from video files - Enhance or denoise audio - Replace audio track in a video - Change video playback speed - Concatenate multiple videos - Generate transcripts/captions (VTT) - Generate video descriptions, timestamps, or context cards - Upload videos to YouTube or Bunny.net CDN - Post social updates to X (Twitter) or LinkedIn - Get video metadata (duration, resolution, codec)

alejandro-ao
alejandro-ao
content-media
open
media
15

video-summarizer

Download videos from URLs (YouTube, Bilibili, and any yt-dlp supported platform), transcribe speech to text using Whisper, generate a structured summary, and save both the summary and full transcript as linked Obsidian notes. Use this skill whenever the user wants to summarize a video, transcribe video content, extract key points from a video, or save video notes to Obsidian. Also trigger when the user shares a video URL and asks for analysis, notes, or a recap.

Zerone-Agent
Zerone-Agent
content-media
open
media
15

video-edit

Edit talking-head videos by removing silences with neural VAD and adding 3D swivel teaser transitions. Use when user asks to edit video, remove silences, add jump cuts, or create video teasers.

nickjwells
nickjwells
content-media
open
Previous
Page 47 / 62
Next