home/categories/media/bytedance-agentkit-samples-skills-byted-las-vlm-video-skill-md
mediacontent-media

byted-las-vlm-video

Video content understanding operator (las_vlm_video) via Doubao models. Use this skill when user needs to: - Analyze/describe video content with natural language prompts - Ask questions about what happens in a video (objects, actions, scenes) - Summarize video, extract key events, or generate captions Supports public/intranet-accessible video URLs and returns model responses + compression metadata. Requires LAS_API_KEY for authentication.

bytedance
maintainer
bytedance
Updated 3/24/2026
Stars
301
Forks
51
quick start

Installation and usage

Video content understanding operator (las_vlm_video) via Doubao models. Use this skill when user needs to: - Analyze/describe video content with natural language prompts - Ask questions about what happens in a video (objects, actions, scenes) - Summarize video, extract key events, or generate captions Supports public/intranet-accessible video URLs and returns model responses + compression metadata. Requires LAS_API_KEY for authentication.

Installation
$ install --globalskills.sh
Usage

Once installed, you can use this skill by running the following command in your terminal:

skills use byted-las-vlm-video