model-pruning

Name: model-pruning
Author: Orchestra-Research

Reduce LLM size and accelerate inference using pruning techniques like Wanda and SparseGPT. Use when compressing models without retraining, achieving 50% sparsity with minimal accuracy loss, or enabling faster inference on hardware accelerators. Covers unstructured pruning, structured pruning, N:M sparsity, magnitude pruning, and one-shot methods.

Voir le code source machine-learning

maintainer

Orchestra-Research

Mis à jour 11/20/2025

Étoiles

6563

Forks

515

quick start

Installation and usage

Installation

$ install --globalskills.sh

Utilisation

Après l'installation, vous pouvez utiliser ce skill en exécutant la commande suivante dans votre terminal :

skills use model-pruning