downloads-organizer
Automatically organize and clean up downloads folder by categorizing files, removing duplicates, and optimizing storage space
Automatically organize and clean up downloads folder by categorizing files, removing duplicates, and optimizing storage space
Comprehensive suite for processing YouTube videos. Use this when the user needs to: (1) Extract transcripts, (2) Generate visual infographics, (3) Create audio summaries (TTS) and videos, or (4) Perform full 'kitchen sink' processing of YouTube content.
Comprehensive suite for processing YouTube videos. Use this when the user needs to: (1) Extract transcripts, (2) Generate visual infographics, (3) Create audio summaries (TTS) and videos, or (4) Perform full 'kitchen sink' processing of YouTube content.
Comprehensive suite for processing YouTube videos. Use this when the user needs to: (1) Extract transcripts, (2) Generate visual infographics, (3) Create audio summaries (TTS) and videos, or (4) Perform full 'kitchen sink' processing of YouTube content.
Comprehensive suite for processing YouTube videos. Use this when the user needs to: (1) Extract transcripts, (2) Generate visual infographics, (3) Create audio summaries (TTS) and videos, or (4) Perform full 'kitchen sink' processing of YouTube content.
Remove audio tracks from video files. Use when the user needs to strip audio from videos, create silent versions, or remove unwanted soundtracks from MP4, MOV, AVI, MKV, WebM, and other video formats.
Convert images between formats (PNG, JPEG, WebP, GIF, BMP, TIFF, AVIF, HEIC) with quality control and resizing. Use when the user needs to convert images, batch process multiple files, optimize image sizes, or convert to modern formats like WebP or AVIF.
Process images for documentation - add borders/shadows to screenshots, create GIFs from videos. Use when preparing visual assets.
Build video streaming and media applications with Shelby Protocol media packages. Use when working with @shelby-protocol/player for video playback (React video player component, Shaka Player integration, playback controls) or @shelby-protocol/media-prepare for transcoding video/audio with FFmpeg, CMAF packaging for DASH/HLS adaptive streaming, or Widevine DRM encryption.
TVC级动态字幕设计。生成与音乐节奏同步的动态文字方案,让字幕本身成为视觉元素而非信息附属。
Web Audio API for JARVIS audio feedback and voice processing
Store and transform images with Cloudflare Images API and transformations. Use when: uploading images, implementing direct creator uploads, creating variants, generating signed URLs, optimizing formats (WebP/AVIF), transforming via Workers, or debugging CORS, multipart, or error codes 9401-9413.
Answer questions about orkid's Singularity synthesizer engine, DSP processing, oscillators, filters, envelopes, modulation, sampling, FM synthesis, audio output, and program/bank structure. Use when the user asks about audio, synth, DSP, or sound.
Transcribe audio/video to accurate subtitles using Whisper AI, with optional translation and delivery. Supports YouTube URLs and local audio/video files. Use when: (1) a YouTube video has no subtitles, (2) auto-generated captions are inaccurate, (3) the user wants high-quality transcription, (4) the user needs translated subtitles, (5) the user wants transcripts sent to email or cloud storage. Triggers: "轉錄", "語音轉文字", "Whisper", "沒有字幕", "字幕不準", "transcribe", "speech to text", "no subtitles", "bad captions", "翻譯字幕", "translate subtitles", "寄到信箱", "上傳到雲端". Make sure to use this skill whenever the user needs transcription beyond what YouTube auto-captions provide, or when yt-search reports no subtitles available.
Use when organizing media files (movies, TV, anime) on NAS or local storage - cleaning junk files, merging scattered episodes, normalizing folder names to "Title (Year)" format, and verifying episode completeness against TMDB
Answer questions about orkid's compositor system, CompositingData/Scene/Technique, render nodes (Forward/Unlit/Picking), post-FX nodes (ACES/HSVG/Bloom/User), output nodes (Screen/RtGroup/VR/File), presets (ForwardPBR/Unlit/Picking), RtGroup render targets, and Python compositor bindings. Use when the user asks about compositing, post-processing, render targets, or rendering presets.
Generate/edit images with Nano Banana Pro (Gemini 3.1 Flash Image). Use for image create/modify requests incl. edits. Supports text-to-image + image-to-image; 1K/2K/4K; use --input-image.
End-to-end longform video cloning pipeline. Downloads YouTube video, transcribes, generates HeyGen avatar chunks, detects PIP vs fullscreen vs noface segments, precisely locates webcam bubbles, composites avatar overlay with lip-synced audio. Handles screen recordings with PIP webcam bubbles and fullscreen talking head.
Clone Kevin's videos as Kev's Assistant using AI face swap + voice clone + WAN 2.2 animate. Handles long videos by splitting into 5s chunks, processing each through the pipeline, and stitching back together. Trigger when asked to clone a video, create a Kev's Assistant version, or convert Kevin's content into Kev's Assistant content.
Converts YouTube videos into long-form (10-14 minute) landscape avatar videos using HeyGen AI clone. Analyzes video with Gemini, generates full Alex Hormozi-style script condensed to max 2000 words, creates 1920x1080 landscape avatar video. Trigger when asked to create long-form avatar videos, clone long YouTube videos, or make HeyGen long-form content.
Converts YouTube videos into short-form (60-second) avatar videos using HeyGen AI clone. Analyzes video with Gemini, generates Alex Hormozi-style script, creates vertical 1080x1920 avatar video with social media caption. Trigger when asked to create clone shorts, convert YouTube to avatar video, or make HeyGen short-form content.
Convert .cast recordings to .txt for analysis. TRIGGERS - convert cast, cast to txt, strip ANSI, batch convert.
Asciinema v3 .cast file format reference. TRIGGERS - cast format, asciicast spec, event codes, parse cast file.