home/categories/containers/kaito-project-kaito-plugins-kaito-workspace-skills-kaito-inference-skill-md

containersdevops

kaito-inference

Name: kaito-inference
Author: kaito-project

Help users deploy LLM models to Kubernetes using the KAITO kubectl plugin. Use this skill whenever the user mentions deploying an LLM, AI model, or language model to Kubernetes, or asks about KAITO, kaito workspaces, GPU inference on k8s, or wants to run models like Llama, Phi, Mistral, DeepSeek, Falcon, Qwen, or Gemma on a Kubernetes cluster. Also trigger when the user mentions "kubectl kaito", model serving on k8s, or wants to set up an inference endpoint in Kubernetes — even if they don't say "KAITO" explicitly.

소스 보기 containers

maintainer

kaito-project

업데이트됨 3/19/2026

스타

919

포크

168

quick start

Installation and usage

설치

$ install --globalskills.sh

사용법

설치 후 터미널에서 다음 명령을 실행하여 이 스킬을 사용할 수 있습니다:

skills use kaito-inference