home/categories/productivity-tools/answerzhao-agent-skills-glm-skills-vlm-skill-md
productivity-toolstools

vlm

Implement vision-based AI chat capabilities using the z-ai-web-dev-sdk. Use this skill when the user needs to analyze images, describe visual content, or create applications that combine image understanding with conversational AI. Supports image URLs and base64 encoded images for multimodal interactions.

AnswerZhao
maintainer
AnswerZhao
Updated 1/15/2026
Stars
22
Forks
13
quick start

Installation and usage

Implement vision-based AI chat capabilities using the z-ai-web-dev-sdk. Use this skill when the user needs to analyze images, describe visual content, or create applications that combine image understanding with conversational AI. Supports image URLs and base64 encoded images for multimodal interactions.

Installation
$ install --globalskills.sh
Usage

Once installed, you can use this skill by running the following command in your terminal:

skills use vlm