gguf-quantization

Name: gguf-quantization
Author: NousResearch

GGUF format and llama.cpp quantization for efficient CPU/GPU inference. Use when deploying models on consumer hardware, Apple Silicon, or when needing flexible quantization from 2-8 bit without GPU requirements.

View Source computational-chemistry

maintainer

NousResearch

Updated 3/9/2026

Stars

54282

Forks

7115

quick start

Installation and usage

Installation

$ install --globalskills.sh

Usage

Once installed, you can use this skill by running the following command in your terminal:

skills use gguf-quantization