home/categories/framework-internals/a5c-ai-babysitter-library-specializations-gpu-programming-skills-cutlass-triton-skill-md
framework-internalsdevelopment
cutlass-triton
High-performance kernel template libraries and DSLs. Generate CUTLASS GEMM configurations, implement Triton kernel definitions, configure epilogue operations, tune tile sizes and warp arrangements, and benchmark against cuBLAS.
maintainer
a5c-ai
آخر تحديث 3/25/2026
النجوم
538
التفرعات
33
quick start
Installation and usage
High-performance kernel template libraries and DSLs. Generate CUTLASS GEMM configurations, implement Triton kernel definitions, configure epilogue operations, tune tile sizes and warp arrangements, and benchmark against cuBLAS.
التثبيت
$ install --globalskills.sh
الاستخدام
بعد التثبيت، يمكنك استخدام هذه المهارة بتشغيل الأمر التالي في الطرفية:
skills use cutlass-triton