home/categories/containers/project-hami-hami-skill-k8s-debug-gpu-pod-skill-md
containersdevops

k8s-gpu-pod-troubleshooter

A comprehensive diagnostic skill for troubleshooting GPU pod scheduling and allocation issues in Kubernetes clusters using HAMi (Heterogeneous AI Computing Virtualization Middleware). It identifies GPU resource constraints, webhook configuration problems, device plugin issues, and scheduler policy misconfigurations to provide actionable remediation guidance.

Project-HAMi
maintainer
Project-HAMi
اپ ڈیٹ ہوا 3/2/2026
اسٹارز
3256
فورکس
506
quick start

Installation and usage

A comprehensive diagnostic skill for troubleshooting GPU pod scheduling and allocation issues in Kubernetes clusters using HAMi (Heterogeneous AI Computing Virtualization Middleware). It identifies GPU resource constraints, webhook configuration problems, device plugin issues, and scheduler policy misconfigurations to provide actionable remediation guidance.

انسٹالیشن
$ install --globalskills.sh
استعمال

انسٹال کرنے کے بعد، آپ یہ اسکل ٹرمینل میں درج ذیل کمانڈ چلا کر استعمال کر سکتے ہیں:

skills use k8s-gpu-pod-troubleshooter