SKILL·AF405B

model-pruning

Name: model-pruning
Author: davila7

davila7

Aktualisiert 2 months ago

158 Ansichten

18,478

1,685

18,478

Auf GitHub ansehen

AndereEmerging TechniquesModel PruningWandaSparseGPTSparsityModel CompressionN:M SparsityOne-Shot PruningStructured PruningUnstructured PruningFast Inference

Über

Diese Fähigkeit bietet One-Shot-Pruning-Techniken wie Wanda und SparseGPT, um LLMs ohne erneutes Training zu komprimieren und die Modellgröße um 40–60 % bei minimalem Genauigkeitsverlust zu reduzieren. Sie ermöglicht schnellere Inferenz auf Hardwarebeschleunigern durch die Implementierung verschiedener Sparsity-Muster, einschließlich unstrukturiertem, strukturiertem und N:M-Pruning. Nutzen Sie sie, um Modelle auf ressourcenbeschränkter Hardware bereitzustellen oder 2–4× schnellere Inferenz zu erreichen.

Schnellinstallation

Claude Code

GitHub Repository

davila7/claude-code-templates

Pfad: cli-tool/components/skills/ai-research/emerging-techniques-model-pruning

anthropicanthropic-claudeclaudeclaude-code

FAQ

Frequently asked questions

What is the model-pruning skill?

model-pruning is a Claude Skill by davila7. Skills package instructions and resources that Claude loads on demand, so Claude can perform model-pruning-related tasks without extra prompting.

How do I install model-pruning?

Use the install commands on this page: add model-pruning to Claude Code as a plugin, or clone its repository into your skills directory, then restart Claude so it picks up the skill.

What category does model-pruning belong to?

model-pruning is in the Other category, tagged Emerging Techniques, Model Pruning, Wanda, SparseGPT, Sparsity and Model Compression.

Is model-pruning free to use?

Yes. model-pruning is listed on AIMCP and free to install. It runs inside Claude, so no separate service account is required to use the skill itself.