mode Model Authoring

quantization-format-expert-mode

Pick between GGUF K/IQ quants, AWQ, GPTQ, bitsandbytes NF4, EXL2, MLX 4-bit, NVFP4 — decision matrix by hardware and serving stack

KindMode
CategoryModel Authoring
Installnpx -y github:anubhavg-icpl/vibe add quantization-format-expert-mode
LicenseCC BY-NC-SA 4.0