home/categories/content-media
domain cluster

Content & Media

CMS, document processing, and media generation.

7032টি স্কিলall categories
sorting
stars
current ordering strategy
query
all entries
refine the visible subset
media
32

ponyflash

Generate images, videos, speech audio, and music using the PonyFlash Python SDK. Also handle local media editing with FFmpeg, including clip, concat, transcode, extract audio, frame capture, subtitle capability checks, and ASS subtitle prep. Use when the user asks to create, generate, produce, edit, trim, merge, concatenate, transcode, subtitle, or render AI-generated media content.

ponyflash
ponyflash
content-media
open
media
32

youtube

Download content from YouTube including transcripts, captions, subtitles, music, MP3s, and playlists. Use when the user provides a YouTube URL or asks to download, transcribe, or get content from YouTube videos or playlists.

steveclarke
steveclarke
content-media
open
media
32

carocut-media-audio

音频素材生成与获取。批量 Edge TTS 旁白生成(支持 storyboard pacing 字段驱动语速)、BGM/SFX 检索(BGM 节奏匹配 BPM 规则)、音频时长提取。包含 Edge voice 配置、速度调整规则、durations.json 格式规范(含 audio_visual_relation 说明)和关键的音频时序规则。

bilibili
bilibili
content-media
open
documents
31

large-scale-text-editing

Provides strategies for efficiently transforming large text files (thousands to millions of lines) using text editors like Vim, sed, or awk. This skill should be used when tasks involve bulk text transformations, CSV manipulation at scale, pattern-based edits across massive files, or when keystroke/operation efficiency is constrained. Applicable to tasks requiring macros, regex substitutions, or batch processing of structured text data.

letta-ai
letta-ai
content-media
open
documents
31

code-from-image

Guide for extracting code or pseudocode from images using OCR and implementing it correctly. This skill should be used when tasks involve reading code, pseudocode, or algorithms from images (PNG, JPG, screenshots) and executing or implementing the extracted logic.

letta-ai
letta-ai
content-media
open
documents
31

extract-moves-from-video

This skill provides guidance for extracting text commands, moves, or typed input from video recordings using OCR. It applies when extracting gameplay commands (e.g., Zork), terminal sessions, or any text-based interactions captured in video format. Use this skill when processing videos of text-based games, terminal recordings, or any scenario requiring OCR-based command extraction from screen recordings.

letta-ai
letta-ai
content-media
open
documents
31

filter-js-from-html

Guidance for removing JavaScript from HTML while preserving structure and formatting. This skill applies when filtering XSS vectors, sanitizing HTML content, removing script tags and event handlers, or building HTML sanitization tools. It covers comprehensive XSS vector identification, format-preserving transformations, and verification strategies.

letta-ai
letta-ai
content-media
open
documents
31

financial-document-processor

Guidance for processing, classifying, and extracting data from financial documents (invoices, receipts, statements). This skill should be used when tasks involve OCR extraction, document classification, data validation from financial PDFs/images, or batch processing of financial documents. Covers safe file operations, incremental testing, and data extraction verification.

letta-ai
letta-ai
content-media
open
documents
31

gcode-to-text

Extracts hidden or encoded text from GCODE files by analyzing toolpath geometry and coordinate data. This skill should be used when tasks involve decoding text from 3D printing files, recovering embossed or engraved text from GCODE, or CTF-style challenges involving GCODE analysis. Applies to any task requiring geometric reconstruction of text from CNC or 3D printer movement commands.

letta-ai
letta-ai
content-media
open
documents
31

filter-js-from-html

Guidance for filtering JavaScript and XSS attack vectors from HTML while preserving original formatting. This skill should be used when tasks involve removing script content, sanitizing HTML, filtering XSS payloads, or creating security filters that must preserve the original document structure unchanged.

letta-ai
letta-ai
content-media
open
documents
31

extracting-pdf-text

Extract text from PDFs for LLM consumption. Use when processing PDFs for RAG, document analysis, or text extraction. Supports API services (Mistral OCR) and local tools (PyMuPDF, pdfplumber). Handles text-based PDFs, tables, and scanned documents with OCR.

letta-ai
letta-ai
content-media
open
documents
31

code-from-image

Extracting code or pseudocode from images using OCR, then interpreting and implementing it. This skill should be used when tasks involve reading code, pseudocode, or algorithms from image files (PNG, JPG, screenshots) and converting them to executable code. Applies to OCR-based code extraction, image-to-code conversion, and implementing algorithms shown in visual formats.

letta-ai
letta-ai
content-media
open
documents
31

gcode-to-text

Decode and interpret text content from G-code files by analyzing toolpath geometry and coordinate patterns. This skill should be used when extracting text, letters, or symbols that are encoded as movement commands in G-code files (e.g., 3D printing, CNC engraving, laser cutting). Applies to tasks like identifying what text a G-code file will print/engrave, reverse-engineering embossed or engraved text from toolpaths, or visualizing G-code geometry to reveal hidden content.

letta-ai
letta-ai
content-media
open
documents
31

regex-log

Guidance for extracting structured data from log files using regular expressions. This skill applies when parsing logs to extract dates, IP addresses, timestamps, or other structured patterns, especially when multiple conditions must be combined (e.g., "find the last date on lines containing an IP"). Use this skill for complex regex construction involving lookaheads, anchors, and pattern composition.

letta-ai
letta-ai
content-media
open
documents
31

overfull-hbox

Guidance for fixing LaTeX overfull hbox warnings by replacing words with shorter synonyms from an allowed list. This skill applies when tasks involve modifying LaTeX documents to eliminate typographic warnings while adhering to strict word replacement constraints. Use when dealing with synonym-constrained text editing in LaTeX or similar markup languages.

letta-ai
letta-ai
content-media
open
documents
31

large-scale-text-editing

Guidance for transforming large text files (thousands to millions of rows) using text editors like Vim. This skill should be used when the task involves bulk text transformations, CSV manipulation, column reordering, regex-based find-and-replace operations, or when there are keystroke/efficiency constraints. Applies to tasks requiring macro-based editing, batch substitutions, or complex text processing where understanding the transformation pattern from input/output samples is needed.

letta-ai
letta-ai
content-media
open
documents
31

extract-moves-from-video

Guidance for extracting text-based game commands, moves, or inputs from video recordings using OCR and frame analysis. This skill applies when extracting user inputs from screen recordings of text-based games (Zork, interactive fiction), terminal sessions, or any video where typed commands need to be recovered. It covers OCR preprocessing, region-of-interest extraction, domain-aware validation, and deduplication strategies.

letta-ai
letta-ai
content-media
open
documents
31

cobol-modernization

This skill provides guidance for translating COBOL programs to modern languages (Python, Java, etc.) while preserving exact behavior. It should be used when tasks involve COBOL-to-modern-language migration, legacy code translation, fixed-width file format handling, or ensuring byte-level compatibility between source and target implementations.

letta-ai
letta-ai
content-media
open
media
31

video-processing

Guide for video analysis and frame-level event detection tasks using OpenCV and similar libraries. This skill should be used when detecting events in videos (jumps, movements, gestures), extracting frames, analyzing motion patterns, or implementing computer vision algorithms on video data. It provides verification strategies and helps avoid common pitfalls in video processing workflows.

letta-ai
letta-ai
content-media
open
media
31

video-processing

This skill provides guidance for video analysis and processing tasks using computer vision techniques. It should be used when analyzing video frames, detecting motion or events, tracking objects, extracting temporal data (e.g., identifying specific frames like takeoff/landing moments), or performing frame-by-frame processing with OpenCV or similar libraries.

letta-ai
letta-ai
content-media
open
media
31

reshard-c4-data

Guide for implementing reversible data resharding systems with hierarchical constraints (max files/folders per directory, max file size). Use when building compress/decompress scripts that reorganize datasets while maintaining full reconstruction capability.

letta-ai
letta-ai
content-media
open
design
31

algorithmic-art

Create generative and algorithmic art using code - SVG, p5.js, canvas, and procedural techniques.

RedWoodOG
RedWoodOG
content-media
open
design
31

jimeng-image

使用火山引擎 Ark 上的即梦 / Doubao Seedream 生成图片。支持文生图、图生图、多参考图、连续组图、批量模式与 workflow 模式。

dracohu2025-cloud
dracohu2025-cloud
content-media
open
design
31

qwen-wanx-comic-gen

使用通义千问·万相(wan2.6-t2i)生成漫画或动漫风格的图片。当用户说"生成漫画""用万相画漫画""生成漫画风格图片""用千问画一张二次元角色"等与漫画风格图像生成相关的请求时,执行本技能。

agentbay-ai
agentbay-ai
content-media
open
Previous
Page 164 / 293
Next