home/categories/content-media
domain cluster

Content & Media

CMS, document processing, and media generation.

7032 스킬all categories
sorting
stars
current ordering strategy
query
all entries
refine the visible subset
media
6K

bibi

BibiGPT CLI for summarizing videos, audio, and podcasts directly in the terminal. Use when the user wants to summarize a URL (YouTube, Bilibili, podcast, etc.) or check their BibiGPT authentication status. Requires the BibiGPT desktop app installed with an active login session, or a BIBI_API_TOKEN environment variable set.

JimmyLv
JimmyLv
content-media
open
documents
5.9K

fastexcel-to-fesod

Migrates a Java project from FastExcel 1.3 (cn.idev.excel:fastexcel:1.3.0) to Apache Fesod (org.apache.fesod:fesod-sheet:2.0.1-incubating). Invoke this skill when asked to "migrate FastExcel to Fesod", "upgrade to Apache Fesod", or "replace cn.idev.excel". FastExcel 1.3 is the direct predecessor of Apache Fesod. Supports both legacy namespaces seen in real projects: cn.idev.excel.* and org.apache.fesod.excel.*. The entry classes FastExcel and FastExcelFactory are kept as @Deprecated aliases in Fesod, so call-site renames are strongly recommended but NOT required for compilation. The only breaking change is the Java package prefix.

apache
apache
content-media
open
content-creation
5.9K

humanizer-zh

去除文本中的 AI 生成痕迹。适用于编辑或审阅文本,使其听起来更自然、更像人类书写。 基于维基百科的"AI 写作特征"综合指南。检测并修复以下模式:夸大的象征意义、 宣传性语言、以 -ing 结尾的肤浅分析、模糊的归因、破折号过度使用、三段式法则、 AI 词汇、否定式排比、过多的连接性短语。

op7418
op7418
content-media
open
documents
5.8K

glmocr-formula

Official skill for recognizing and extracting mathematical formulas from images and PDFs into LaTeX format using ZhiPu GLM-OCR API. Supports complex equations, inline formulas, and formula blocks. Use this skill when the user wants to extract formulas, convert formula images to LaTeX, or OCR mathematical expressions.

zai-org
zai-org
content-media
open
documents
5.8K

glmocr-handwriting

Official skill for recognizing handwritten text from images using ZhiPu GLM-OCR API. Supports various handwriting styles, languages, and mixed handwritten/printed content. Use this skill when the user wants to read handwritten notes, convert handwriting to text, or OCR handwritten documents.

zai-org
zai-org
content-media
open
documents
5.8K

glmocr

Extract text from images using GLM-OCR API. Supports images and PDFs with high accuracy OCR, table recognition, formula extraction, and handwriting recognition. Use this skill whenever the user wants to extract text from images, perform OCR on pictures, scan documents, convert images to text, or process any image files to get their textual content.

zai-org
zai-org
content-media
open
documents
5.8K

glmocr-table

Official skill for recognizing and extracting tables from images and PDFs into Markdown format using ZhiPu GLM-OCR API. Supports complex tables, merged cells, and multi-page documents. Use this skill when the user wants to extract tables, recognize spreadsheets, or convert table images to editable format.

zai-org
zai-org
content-media
open
documents
5.8K

glmocr

Trigger when: (1) User wants to extract text, tables, formulas, or structured data from images/PDFs/scanned documents, (2) User mentions "OCR", "文字识别", "文档解析", (3) User has a document (screenshot, scanned page, invoice, paper, whiteboard photo) and needs its content in structured form, (4) User asks to parse, digitize, or extract content from a visual document. Invokes the GLM-OCR SDK (pip install glmocr) to parse documents via Zhipu's cloud API. No GPU required. Returns structured JSON (regions with labels + bounding boxes) and Markdown. Agent can operate entirely via CLI — no YAML files needed. NOT for: real-time camera feeds, audio transcription, or non-document images (photos, illustrations).

zai-org
zai-org
content-media
open
documents
5.6K

model-sample-image-export

Export, validate, and publish model sample-result images into docs/source/images and reference them from README/docs pages. Use when model sample images are missing, outdated, or suspected to be invalid.

open-edge-platform
open-edge-platform
content-media
open
documents
5.5K

loro

Comprehensive guide for using Loro across document modeling, synchronization, versioning, rich text editors, app-state mirroring, performance tradeoffs, and wasm bindings. Use when Codex needs to work with `loro-crdt`, `loro`, `loro-prosemirror`, `loro-mirror`, or `crates/loro-wasm` for: (1) Choosing CRDT container types and document structure, (2) Designing sync, persistence, checkout, or history workflows, (3) Integrating rich-text editors and stable selections, (4) Mirroring app state with schemas and React, (5) Reasoning about versions, events, import status, or Inspector output, or (6) Maintaining the WASM binding layer.

loro-dev
loro-dev
content-media
open
documents
5.3K

summarize

Summarize or extract text/transcripts from URLs, podcasts, and local files (great fallback for “transcribe this YouTube/video”).

clawdbot
clawdbot
content-media
open
documents
5.3K

nano-pdf

Edit PDFs with natural-language instructions using the nano-pdf CLI.

clawdbot
clawdbot
content-media
open
media
5.3K

video-frames

Extract frames or short clips from videos using ffmpeg.

clawdbot
clawdbot
content-media
open
media
5.3K

songsee

Generate spectrograms and feature-panel visualizations from audio with the songsee CLI.

clawdbot
clawdbot
content-media
open
media
5.3K

gifgrep

Search GIF providers with CLI/TUI, download results, and extract stills/sheets.

clawdbot
clawdbot
content-media
open
media
5.3K

camsnap

Capture frames or clips from RTSP/ONVIF cameras.

clawdbot
clawdbot
content-media
open
content-creation
5.2K

prd

Generate a Product Requirements Document (PRD) for a new feature. Use when planning a feature, starting a new project, or when asked to create a PRD. Triggers on: create a prd, write prd for, plan this feature, requirements for, spec out.

snarktank
snarktank
content-media
open
documents
5.2K

dashboard

Use when reading, editing, or creating files in dlt/_workspace/helpers/dashboard/ or tests/workspace/helpers/dashboard/ or tests/e2e/

dlt-hub
dlt-hub
content-media
open
documents
5.2K

docs-workflow

End-to-end workflow for PR documentation — check, write, review. Use at any stage of documenting PR changes.

grafana
grafana
content-media
open
documents
5.2K

general-add-localization

Add localization keys and use them in elements or controllers. Use when adding user-facing text that should be translatable — labels, descriptions, error messages, button text, status text, or any string shown in the backoffice UI.

umbraco
umbraco
content-media
open
documents
5.1K

geo-llmstxt

Analyzes and generates llms.txt files -- the emerging standard for helping AI systems understand website structure and content. Can validate existing llms.txt files or generate new ones from scratch by crawling the site.

zubair-trabzada
zubair-trabzada
content-media
open
documents
5K

translate-doc-zh

Translate an English document under `docs/en/` into the matching Chinese document under `docs/zh/`.

inclusionAI
inclusionAI
content-media
open
documents
5K

translate

Translate new or untranslated i18n strings from English core.json to all other locale files, maintaining consistency with each language's existing translations.

tagspaces
tagspaces
content-media
open
content-creation
5K

article-writer

Multi-style article creation skill. Supports 5 writing styles (deep analysis, practical guide, story-driven, opinion, news brief), including complete workflow: material collection → outline → content → formatting. Activated when users mention "write article", "write post", "create", or "draft".

netease-youdao
netease-youdao
content-media
open
Previous
Page 22 / 293
Next