home/categories/framework-internals/a5c-ai-babysitter-library-specializations-gpu-programming-skills-cutlass-triton-skill-md
framework-internalsdevelopment

cutlass-triton

High-performance kernel template libraries and DSLs. Generate CUTLASS GEMM configurations, implement Triton kernel definitions, configure epilogue operations, tune tile sizes and warp arrangements, and benchmark against cuBLAS.

a5c-ai
maintainer
a5c-ai
更新於 3/25/2026
星標
538
分支
33
quick start

Installation and usage

High-performance kernel template libraries and DSLs. Generate CUTLASS GEMM configurations, implement Triton kernel definitions, configure epilogue operations, tune tile sizes and warp arrangements, and benchmark against cuBLAS.

安裝
$ install --globalskills.sh
使用

安裝後,您可以通過在終端運行以下命令來使用此技能:

skills use cutlass-triton