SKILL·ED1C2B

quantizing-models-bitsandbytes

Name: quantizing-models-bitsandbytes
Author: davila7

davila7

Aktualisiert 2 months ago

927 Ansichten

18,478

1,685

18,478

Auf GitHub ansehen

AndereOptimizationBitsandbytesQuantization8-Bit4-BitMemory OptimizationQLoRANF4INT8HuggingFaceEfficient Inference

Über

Diese Fähigkeit quantisiert LLMs auf 8-Bit- oder 4-Bit-Präzision mittels bitsandbytes und erreicht dabei eine Speicherreduzierung von 50–75 % bei minimalem Genauigkeitsverlust. Sie ist ideal für den Betrieb größerer Modelle mit begrenztem GPU-Speicher oder zur Beschleunigung von Inferenzvorgängen und unterstützt Formate wie INT8, NF4 und FP4. Die Fähigkeit integriert sich in HuggingFace Transformers und ermöglicht QLoRA-Training sowie 8-Bit-Optimierer.

Schnellinstallation

Claude Code

GitHub Repository

davila7/claude-code-templates

Pfad: cli-tool/components/skills/ai-research/optimization-bitsandbytes

anthropicanthropic-claudeclaudeclaude-code

FAQ

Frequently asked questions

What is the quantizing-models-bitsandbytes skill?

quantizing-models-bitsandbytes is a Claude Skill by davila7. Skills package instructions and resources that Claude loads on demand, so Claude can perform quantizing-models-bitsandbytes-related tasks without extra prompting.

How do I install quantizing-models-bitsandbytes?

Use the install commands on this page: add quantizing-models-bitsandbytes to Claude Code as a plugin, or clone its repository into your skills directory, then restart Claude so it picks up the skill.

What category does quantizing-models-bitsandbytes belong to?

quantizing-models-bitsandbytes is in the Other category, tagged Optimization, Bitsandbytes, Quantization, 8-Bit, 4-Bit and Memory Optimization.

Is quantizing-models-bitsandbytes free to use?

Yes. quantizing-models-bitsandbytes is listed on AIMCP and free to install. It runs inside Claude, so no separate service account is required to use the skill itself.