SKILL·ED1C2B

quantizing-models-bitsandbytes

Name: quantizing-models-bitsandbytes
Author: davila7

davila7

更新于 2 months ago

927 次查看

18,478

1,685

18,478

在 GitHub 上查看

其他OptimizationBitsandbytesQuantization8-Bit4-BitMemory OptimizationQLoRANF4INT8HuggingFaceEfficient Inference

关于

This skill quantizes LLMs to 8-bit or 4-bit precision using bitsandbytes, achieving 50-75% memory reduction with minimal accuracy loss. It's ideal for running larger models on limited GPU memory or accelerating inference, supporting formats like INT8, NF4, and FP4. The skill integrates with HuggingFace Transformers and enables QLoRA training and 8-bit optimizers.

快速安装

Claude Code

GitHub 仓库

davila7/claude-code-templates

路径: cli-tool/components/skills/ai-research/optimization-bitsandbytes

anthropicanthropic-claudeclaudeclaude-code

FAQ

Frequently asked questions

What is the quantizing-models-bitsandbytes skill?

quantizing-models-bitsandbytes is a Claude Skill by davila7. Skills package instructions and resources that Claude loads on demand, so Claude can perform quantizing-models-bitsandbytes-related tasks without extra prompting.

How do I install quantizing-models-bitsandbytes?

Use the install commands on this page: add quantizing-models-bitsandbytes to Claude Code as a plugin, or clone its repository into your skills directory, then restart Claude so it picks up the skill.

What category does quantizing-models-bitsandbytes belong to?

quantizing-models-bitsandbytes is in the Other category, tagged Optimization, Bitsandbytes, Quantization, 8-Bit, 4-Bit and Memory Optimization.

Is quantizing-models-bitsandbytes free to use?

Yes. quantizing-models-bitsandbytes is listed on AIMCP and free to install. It runs inside Claude, so no separate service account is required to use the skill itself.