home/categories/framework-internals/a5c-ai-babysitter-library-specializations-gpu-programming-skills-cuda-graphs-skill-md
framework-internalsdevelopment
cuda-graphs
Expert skill for CUDA Graph capture and optimization for reduced launch overhead. Capture CUDA operations into graphs, instantiate and execute graph instances, update graph node parameters, profile graph vs stream execution, design graph-friendly kernel patterns, and optimize launch latency for inference.
maintainer
a5c-ai
Обновлено 3/25/2026
Звёзды
538
Форки
33
quick start
Installation and usage
Expert skill for CUDA Graph capture and optimization for reduced launch overhead. Capture CUDA operations into graphs, instantiate and execute graph instances, update graph node parameters, profile graph vs stream execution, design graph-friendly kernel patterns, and optimize launch latency for inference.
Установка
$ install --globalskills.sh
Использование
После установки вы можете использовать этот skill, выполнив следующую команду в терминале:
skills use cuda-graphs