home/categories/llm-ai/thanhtunguet-agent-skills-free-vision-skill-md
llm-aidata-ai

free-vision

Handle vision/image tasks (read, describe, analyze images) by calling Gemini CLI or Qwen Code CLI from the shell. Use for requests to interpret or describe images, extract visible text, or summarize visual content; prefer Gemini and fall back to Qwen if Gemini fails or is too generic.

thanhtunguet
maintainer
thanhtunguet
Updated 1/16/2026
Stars
0
Forks
0
quick start

Installation and usage

Handle vision/image tasks (read, describe, analyze images) by calling Gemini CLI or Qwen Code CLI from the shell. Use for requests to interpret or describe images, extract visible text, or summarize visual content; prefer Gemini and fall back to Qwen if Gemini fails or is too generic.

Installation
$ install --globalskills.sh
Usage

Once installed, you can use this skill by running the following command in your terminal:

skills use free-vision