返回技能列表

huggingface-tokenizers

davila7
更新于 28 days ago
412 次查看
18,478
1,685
18,478
在 GitHub 上查看
文档处理TokenizationHuggingFaceBPEWordPieceUnigramFast TokenizationRustCustom TokenizerAlignment TrackingProduction

关于

This skill provides high-performance tokenization using HuggingFace's Rust-based library, processing 1GB of text in under 20 seconds. It supports BPE, WordPiece, and Unigram algorithms while enabling custom tokenizer training and alignment tracking. Use it when you need production-fast tokenization or to build custom tokenizers integrated with the transformers ecosystem.

快速安装

Claude Code

推荐
主要方式
npx skills add davila7/claude-code-templates -a claude-code
插件命令备选方式
/plugin add https://github.com/davila7/claude-code-templates
Git 克隆备选方式
git clone https://github.com/davila7/claude-code-templates.git ~/.claude/skills/huggingface-tokenizers

在 Claude Code 中复制并粘贴此命令以安装该技能

GitHub 仓库

davila7/claude-code-templates
路径: cli-tool/components/skills/ai-research/tokenization-huggingface-tokenizers
0
anthropicanthropic-claudeclaudeclaude-code

相关推荐技能