home/categories/debugging/nvidia-tensorrt-llm-claude-skills-perf-analysis-skill-md
debuggingtools

perf-analysis

Performance analysis coordination workflow. Guides profiling delegation, bottleneck classification (compute/memory/launch/communication/sync), and structured report generation. Use when the user asks to analyze performance, profile a workload, check MFU/SOL, or diagnose bottlenecks.

NVIDIA
maintainer
NVIDIA
Updated 4/8/2026
Stars
13335
Forks
2271
quick start

Installation and usage

Performance analysis coordination workflow. Guides profiling delegation, bottleneck classification (compute/memory/launch/communication/sync), and structured report generation. Use when the user asks to analyze performance, profile a workload, check MFU/SOL, or diagnose bottlenecks.

Installation
$ install --globalskills.sh
Usage

Once installed, you can use this skill by running the following command in your terminal:

skills use perf-analysis