skills.homescapability registry 검색

home/categories/content-media

domain cluster

Content & Media

CMS, document processing, and media generation.

7032 스킬all categories

sorting

stars

current ordering strategy

query

all entries

refine the visible subset

media

6K

bibi

BibiGPT CLI for summarizing videos, audio, and podcasts directly in the terminal. Use when the user wants to summarize a URL (YouTube, Bilibili, podcast, etc.) or check their BibiGPT authentication status. Requires the BibiGPT desktop app installed with an active login session, or a BIBI_API_TOKEN environment variable set.

JimmyLv

content-media

documents

5.9K

fastexcel-to-fesod

Migrates a Java project from FastExcel 1.3 (cn.idev.excel:fastexcel:1.3.0) to Apache Fesod (org.apache.fesod:fesod-sheet:2.0.1-incubating). Invoke this skill when asked to "migrate FastExcel to Fesod", "upgrade to Apache Fesod", or "replace cn.idev.excel". FastExcel 1.3 is the direct predecessor of Apache Fesod. Supports both legacy namespaces seen in real projects: cn.idev.excel.* and org.apache.fesod.excel.*. The entry classes FastExcel and FastExcelFactory are kept as @Deprecated aliases in Fesod, so call-site renames are strongly recommended but NOT required for compilation. The only breaking change is the Java package prefix.

apache

content-media

content-creation

5.9K

humanizer-zh

去除文本中的 AI 生成痕迹。适用于编辑或审阅文本，使其听起来更自然、更像人类书写。基于维基百科的"AI 写作特征"综合指南。检测并修复以下模式：夸大的象征意义、宣传性语言、以 -ing 结尾的肤浅分析、模糊的归因、破折号过度使用、三段式法则、 AI 词汇、否定式排比、过多的连接性短语。

op7418

content-media

documents

5.8K

glmocr-formula

Official skill for recognizing and extracting mathematical formulas from images and PDFs into LaTeX format using ZhiPu GLM-OCR API. Supports complex equations, inline formulas, and formula blocks. Use this skill when the user wants to extract formulas, convert formula images to LaTeX, or OCR mathematical expressions.

zai-org

content-media

documents

5.8K

glmocr-handwriting

Official skill for recognizing handwritten text from images using ZhiPu GLM-OCR API. Supports various handwriting styles, languages, and mixed handwritten/printed content. Use this skill when the user wants to read handwritten notes, convert handwriting to text, or OCR handwritten documents.

zai-org

content-media

documents

5.8K

glmocr

Extract text from images using GLM-OCR API. Supports images and PDFs with high accuracy OCR, table recognition, formula extraction, and handwriting recognition. Use this skill whenever the user wants to extract text from images, perform OCR on pictures, scan documents, convert images to text, or process any image files to get their textual content.

zai-org

content-media

documents

5.8K

glmocr-table

Official skill for recognizing and extracting tables from images and PDFs into Markdown format using ZhiPu GLM-OCR API. Supports complex tables, merged cells, and multi-page documents. Use this skill when the user wants to extract tables, recognize spreadsheets, or convert table images to editable format.

zai-org

content-media

documents

5.8K

glmocr

Trigger when: (1) User wants to extract text, tables, formulas, or structured data from images/PDFs/scanned documents, (2) User mentions "OCR", "文字识别", "文档解析", (3) User has a document (screenshot, scanned page, invoice, paper, whiteboard photo) and needs its content in structured form, (4) User asks to parse, digitize, or extract content from a visual document. Invokes the GLM-OCR SDK (pip install glmocr) to parse documents via Zhipu's cloud API. No GPU required. Returns structured JSON (regions with labels + bounding boxes) and Markdown. Agent can operate entirely via CLI — no YAML files needed. NOT for: real-time camera feeds, audio transcription, or non-document images (photos, illustrations).

zai-org

content-media

documents

5.6K

model-sample-image-export

Export, validate, and publish model sample-result images into docs/source/images and reference them from README/docs pages. Use when model sample images are missing, outdated, or suspected to be invalid.

open-edge-platform

content-media

documents

5.5K

loro

Comprehensive guide for using Loro across document modeling, synchronization, versioning, rich text editors, app-state mirroring, performance tradeoffs, and wasm bindings. Use when Codex needs to work with `loro-crdt`, `loro`, `loro-prosemirror`, `loro-mirror`, or `crates/loro-wasm` for: (1) Choosing CRDT container types and document structure, (2) Designing sync, persistence, checkout, or history workflows, (3) Integrating rich-text editors and stable selections, (4) Mirroring app state with schemas and React, (5) Reasoning about versions, events, import status, or Inspector output, or (6) Maintaining the WASM binding layer.

loro-dev

content-media

documents

5.3K

summarize

Summarize or extract text/transcripts from URLs, podcasts, and local files (great fallback for “transcribe this YouTube/video”).

clawdbot

content-media

documents

5.3K

nano-pdf

Edit PDFs with natural-language instructions using the nano-pdf CLI.

clawdbot

content-media

media

5.3K

video-frames

Extract frames or short clips from videos using ffmpeg.

clawdbot

content-media

media

5.3K

songsee

Generate spectrograms and feature-panel visualizations from audio with the songsee CLI.

clawdbot

content-media

media

5.3K

gifgrep

Search GIF providers with CLI/TUI, download results, and extract stills/sheets.

clawdbot

content-media

media

5.3K

camsnap

Capture frames or clips from RTSP/ONVIF cameras.

clawdbot

content-media

content-creation

5.2K

prd

Generate a Product Requirements Document (PRD) for a new feature. Use when planning a feature, starting a new project, or when asked to create a PRD. Triggers on: create a prd, write prd for, plan this feature, requirements for, spec out.

snarktank

content-media

documents

5.2K

dashboard

Use when reading, editing, or creating files in dlt/_workspace/helpers/dashboard/ or tests/workspace/helpers/dashboard/ or tests/e2e/

dlt-hub

content-media

documents

5.2K

docs-workflow

End-to-end workflow for PR documentation — check, write, review. Use at any stage of documenting PR changes.

grafana

content-media

documents

5.2K

general-add-localization

Add localization keys and use them in elements or controllers. Use when adding user-facing text that should be translatable — labels, descriptions, error messages, button text, status text, or any string shown in the backoffice UI.

umbraco

content-media

documents

5.1K

geo-llmstxt

Analyzes and generates llms.txt files -- the emerging standard for helping AI systems understand website structure and content. Can validate existing llms.txt files or generate new ones from scratch by crawling the site.

zubair-trabzada

content-media

documents

5K

translate-doc-zh

Translate an English document under `docs/en/` into the matching Chinese document under `docs/zh/`.

inclusionAI

content-media

documents

5K

translate

Translate new or untranslated i18n strings from English core.json to all other locale files, maintaining consistency with each language's existing translations.

tagspaces

content-media

content-creation

5K

article-writer

Multi-style article creation skill. Supports 5 writing styles (deep analysis, practical guide, story-driven, opinion, news brief), including complete workflow: material collection → outline → content → formatting. Activated when users mention "write article", "write post", "create", or "draft".

netease-youdao

content-media

Page 22 / 293