Production ready LLM model compression/quantization toolkit with hw accelerated inference support for both cpu/gpu via HF, vLLM, and SGLang.
[email protected] low health (65/100) — consider alternatives
Get this data programmatically — free, no authentication.
curl https://depscope.dev/api/check/pypi/gptqmodelLast updated · 2026-06-08T08:07:58.601760Z