gguf-quantization

star 4

GGUF format and llama.cpp quantization for efficient CPU/GPU inference. Use when deploying models on consumer hardware, Apple Silicon, or when needing flexible quantization from 2-8 bit without GPU requirements.

By tools-only schedule Updated 2/4/2026

play_arrow Run Skill in Manus View GitHub

Skill instructions (SKILL.md) could not be loaded from local cache or raw GitHub repository.

Install via CLI

npx skills add https://github.com/tools-only/X-Skills --skill gguf-quantization

Repository Details

star Stars 4

call_split Forks 0

navigation Branch main

article Path SKILL.md