ptq

Name: ptq
Author: NVIDIA

This skill should be used when the user asks to "quantize a model", "run PTQ", "post-training quantization", "NVFP4 quantization", "FP8 quantization", "INT8 quantization", "INT4 AWQ", "quantize LLM", "quantize MoE", "quantize VLM", or needs to produce a quantized HuggingFace or TensorRT-LLM checkpoint from a pretrained model using ModelOpt.

عرض المصدر machine-learning

maintainer

NVIDIA

آخر تحديث 4/11/2026

النجوم

2429

التفرعات

344

quick start

Installation and usage

التثبيت

$ install --globalskills.sh

الاستخدام

بعد التثبيت، يمكنك استخدام هذه المهارة بتشغيل الأمر التالي في الطرفية:

skills use ptq