home/categories/containers/vllm-project-vllm-skills-plugins-vllm-skills-skills-vllm-deploy-k8s-skill-md

containersdevops

vllm-deploy-k8s

Name: vllm-deploy-k8s
Author: vllm-project

Deploy vLLM to Kubernetes (K8s) with GPU support, health probes, and OpenAI-compatible API endpoint. Use this skill whenever the user wants to deploy, run, or serve vLLM on a Kubernetes cluster, including creating deployments, services, checking existing deployments, or managing vLLM on K8s.

View Source containers

maintainer

vllm-project

Updated 4/3/2026

Stars

Forks

quick start

Installation and usage

Installation

$ install --globalskills.sh

Usage

Once installed, you can use this skill by running the following command in your terminal:

skills use vllm-deploy-k8s