home/categories/framework-internals/a5c-ai-babysitter-library-specializations-gpu-programming-skills-cuda-graphs-skill-md
framework-internalsdevelopment

cuda-graphs

Expert skill for CUDA Graph capture and optimization for reduced launch overhead. Capture CUDA operations into graphs, instantiate and execute graph instances, update graph node parameters, profile graph vs stream execution, design graph-friendly kernel patterns, and optimize launch latency for inference.

a5c-ai
maintainer
a5c-ai
更新於 3/25/2026
星標
538
分支
33
quick start

Installation and usage

Expert skill for CUDA Graph capture and optimization for reduced launch overhead. Capture CUDA operations into graphs, instantiate and execute graph instances, update graph node parameters, profile graph vs stream execution, design graph-friendly kernel patterns, and optimize launch latency for inference.

安裝
$ install --globalskills.sh
使用

安裝後,您可以通過在終端運行以下命令來使用此技能:

skills use cuda-graphs