home/categories/documents/openclaw-skills-skills-ayalili-multimodal-parser-skill-md
documentscontent-media

multimodal-parser

Unified multi-modal content parser for images, PDF, DOCX, audio, auto OCR/transcription, output structured text for LLM processing

openclaw
maintainer
openclaw
Updated 3/13/2026
Stars
4001
Forks
1095
quick start

Installation and usage

Unified multi-modal content parser for images, PDF, DOCX, audio, auto OCR/transcription, output structured text for LLM processing

Installation
$ install --globalskills.sh
Usage

Once installed, you can use this skill by running the following command in your terminal:

skills use multimodal-parser