document-conversion
This skill should be used when users need to convert PDFs (especially with tables or figures), mentions 'convert', 'PDF', 'document processing', has complex academic papers to import, or asks about MinerU vs Markdownify.
اپنے ایجنٹ کے لیے موزوں صلاحیت تلاش کریں۔
This skill should be used when users need to convert PDFs (especially with tables or figures), mentions 'convert', 'PDF', 'document processing', has complex academic papers to import, or asks about MinerU vs Markdownify.
Merge a cover image into a PDF book while preserving aspect ratio and matching width. Use when the user wants to "merge cover", "combine pdf", "fix cover size", or "add cover image".
Parse bank statement PDF text into structured transaction data with account information and transactions in consistent JSON format. Works with any bank format. Use when you need to extract or parse transactions from PDF bank statements.
Manipulate PDF files in Python: extract text/tables, create, merge, split, and fill forms. WHEN: Extracting text from PDFs, creating PDF documents, merging/splitting PDFs, filling PDF forms, working with pdfplumber/PyPDF2/reportlab. WHEN NOT: Word documents (use python-docx), Excel files (use python-xlsx), image processing.
Convert flat/scanned PDF documents into interactive fillable PDF forms using OCR-based field detection with AWS Textract and AI vision validation. Use when users need to: (1) Transform static PDFs into editable forms, (2) Detect and convert form fields (text inputs, checkboxes, signatures) automatically, (3) Digitize paper-based forms for electronic completion, (4) Validate form field detection accuracy, or (5) Batch convert multiple similar forms while preserving original styling and layout.
Extract and process content from PDFs and DOCX files. Handles large files, OCR for scanned documents, page splitting, and markdown conversion. Use when: (1) Processing PDF references in notes, (2) Extracting text from large documents for analysis, (3) Converting DOCX to markdown, (4) Handling scanned/image PDFs with OCR, (5) Integrating with Obsidian or note-taking workflows, (6) Splitting large documents into manageable chunks. Invoke with: /process-document, /extract-pdf, /extract-docx, or say "use document-processor skill to..."
Transcribe handwritten multi-source research notes from reMarkable PDFs or paper photos. Trigger phrase "process deep dive notes". Handles research compilations with multiple sources per document. Converts handwriting to markdown, applies formatting conventions, generates metadata, creates AI summaries, and appends quotes to a central quotes file. Notes may be in Czech or English.
Capture PDF books to ai-brain sources. USE WHEN user wants to capture pdf, import pdf, add pdf book, process pdf for embeddings OR /capture-pdf OR /pdf.
Process large volumes of bank statements (50+ PDFs) in batches with checkpointing and progress tracking. Orchestrates the standard ingestion skill across multiple batches for resumable processing.
Re-index all PDF and HTML documents, update index.html, and commit/push changes to the repository
Merge multiple PDF files into single document with customizable options. Supports page selection, bookmarks, and metadata. Use when combining PDFs, creating documents from multiple sources, or organizing PDF collections.
Convert documents and files to Markdown using markitdown. Use when converting PDF, Word (.docx), PowerPoint (.pptx), Excel (.xlsx, .xls), HTML, CSV, JSON, XML, images (with EXIF/OCR), audio (with transcription), ZIP archives, remote URLs (including YouTube), or EPubs to Markdown format for LLM processing or text analysis.
Extract text and tables from PDF files, fill forms, merge documents.
Extract text and images from PDF files, analyze diagrams and architecture charts, and convert visual diagrams to Mermaid format. Use when the user wants to read, analyze, or convert PDF documents, especially those containing diagrams, flowcharts, or architecture visualizations.
Universal document conversion using Pandoc with focus on DOCX format. Use when converting documents to or from DOCX format, including Markdown, HTML, PDF, and other formats. Supports: (1) Converting other formats to DOCX (Markdown, HTML, reStructuredText, etc.), (2) Converting DOCX to other formats, (3) Custom DOCX templates and styling, (4) Batch document processing
Send documents to Kindle via email. Use when user wants to send files to Kindle, read documents on Kindle, transfer ebooks, or send PDFs/EPUBs to their e-reader.
Transform YouTube transcripts into comprehensive lecture notes with PDF and HTML outputs