gguf-quantization

Name: gguf-quantization
Author: Orchestra-Research

GGUF format and llama.cpp quantization for efficient CPU/GPU inference. Use when deploying models on consumer hardware, Apple Silicon, or when needing flexible quantization from 2-8 bit without GPU requirements.

View Source computational-chemistry

maintainer

Orchestra-Research

Updated 11/25/2025

Stars

6563

Forks

515

quick start

Installation and usage

Installation

$ install --globalskills.sh

Usage

Once installed, you can use this skill by running the following command in your terminal:

skills use gguf-quantization