pdf-text-extractor
Extracts text content from one or more PDF documents.
CMS, document processing, and media generation.
Extracts text content from one or more PDF documents.
Create handoff document for transferring work to another session
Translate English content to Thai with cultural adaptation for idioms, formality levels (กระผม/ผม/ครับ/ค่ะ/นะ), honorifics, and context-appropriate register. Use when translating to Thai, adapting cultural references, localizing English content for Thai audiences, or questions about Thai language formality and cultural nuance.
将文章或内容转换为专业的中文可视化HTML网页。当需要生成HTML页面、制作可视化内容展示页、将Markdown或文本转换为网页时使用。
Converts PDF files to markdown text. Use when the user wants to extract text from PDFs, convert PDFs to readable format, or process PDF documents.
Tra cứu thông tin công dân qua các cổng chính phủ Việt Nam
Extract transcripts, generate summaries, create Q&A highlights, and perform deep research from YouTube videos or local media files. Use when the user provides a YouTube URL or local video/audio file path and asks to summarize, digest, analyze, or transcribe media content. Triggers: "video insight", "summarize video", "transcribe audio" + URL or file path.
Convert documents and files to Markdown using markitdown. Use when converting PDF, Word (.docx), PowerPoint (.pptx), Excel (.xlsx, .xls), HTML, CSV, JSON, XML, images (with EXIF/OCR), audio (with transcription), ZIP archives, YouTube URLs, or EPubs to Markdown format for LLM processing or text analysis.
Python library for working with DICOM (Digital Imaging and Communications in Medicine) files. Use this skill when reading, writing, or modifying medical imaging data in DICOM format, extracting pixel data from medical images (CT, MRI, X-ray, ultrasound), anonymizing DICOM files, working with DICOM metadata and tags, converting DICOM images to other formats, handling compressed DICOM data, or processing medical imaging datasets. Applies to tasks involving medical image analysis, PACS systems, radiology workflows, and healthcare imaging applications.
Guide for adding localized text using l10n in app_lib/locale package (project)
Translates JU-DO-KON! PRD sections into implementation plans, code changes, and tests.
Google Docs ドキュメントにテキストを追加する。「ドキュメント更新」「Docs 更新」「ドキュメントに追加」「ドキュメントを編集」などで起動。
Converts PDF files to Markdown format using PyMuPDF, extracting text content and embedded images. Fast and lightweight. Automatically fixes LaTeX umlauts (¨a → ä, etc.) and converts ß to ss (Swiss German). Use when converting PDFs to Markdown, extracting document content, or processing PDF files for text analysis. Generates one .md file and 0..n .png files for images.
Procedures and logic for mapping Carif-Oref (RCO) XML data to Réfugiés.info (RI) structured metadata. Use when processing new RCO fiches to generate the technical frontmatter and mapping table.
Fast file editing via Morph Apply API (10,500 tokens/sec, 98% accuracy)
Extract structured data from web pages using CSS selectors and XPath
Speech-to-text transcription using multiple engines (Whisper, Google Speech, Azure, AssemblyAI). Record audio, transcribe files, real-time transcription, speaker diarization, timestamps, and multi-language support. Use for meeting transcription, voice notes, audio file processing, or accessibility features.
Generate English lesson feedback from audio transcription