gguf-quantization

Name: gguf-quantization
Author: NousResearch

GGUF format and llama.cpp quantization for efficient CPU/GPU inference. Use when deploying models on consumer hardware, Apple Silicon, or when needing flexible quantization from 2-8 bit without GPU requirements.

查看源码 computational-chemistry

maintainer

NousResearch

更新于 3/9/2026

星标

54282

分支

7115

quick start

Installation and usage

安装

$ install --globalskills.sh

使用

安装后，您可以通过在终端运行以下命令来使用此技能：

skills use gguf-quantization