SKILL·268382

qdrant-scaling-query-volume

Name: qdrant-scaling-query-volume
Author: qdrant

qdrant

更新日 1 month ago

2 閲覧

209

GitHubで表示

デザインdesign

について

このClaudeスキルは、大量のクエリボリュームとページネーションを処理するためのQdrant最適化戦略を提供します。特に、ポアソン分布に基づくサブサンプリングを実装することで、複数のシャードにまたがる高リミットクエリのパフォーマンス問題に対処します。シャード化されたQdrantデプロイメントにおいて、スクロールパフォーマンス、大規模な結果セット、または高カーディナリティクエリを扱う際に、このスキルをご利用ください。

クイックインストール

Claude Code

推奨

メイン

npx skills add qdrant/skills -a claude-code

プラグインコマンド代替

/plugin add https://github.com/qdrant/skills

Git クローン代替

git clone https://github.com/qdrant/skills.git ~/.claude/skills/qdrant-scaling-query-volume

このコマンドをClaude Codeにコピー＆ペーストしてスキルをインストールします

ドキュメント

Scaling for Query Volume

Problem: When a query has a large limit (e.g. 1000) and there are multiple shards (e.g. 10), naively each shard must return the full 1000 results — totaling 10,000 scored points transferred and merged. This is wasteful since data is randomly distributed across auto-shards.

Core idea

Instead of asking every shard for the full limit, ask each shard for a smaller limit computed via Poisson distribution statistics, then merge. This is safe because auto-sharding guarantees random, independent data distribution.

When it activates

More than 1 shard
Auto-sharding is in use (all queried shards share the same shard key)
The request's limit + offset >= SHARD_QUERY_SUBSAMPLING_LIMIT (128)
The query is not exact

Key tradeoff

The strategy trades a small probability of slightly incomplete results for a large reduction in inter-shard data transfer, especially for high-limit queries across many shards. The 1.2x safety factor and the 99.9% Poisson threshold keep the error rate very low — comparable to inaccuracies already introduced by approximate vector indices like HNSW.

GitHub リポジトリ

qdrant/skills

パス: skills/qdrant-scaling/scaling-query-volume

agent-skillsai-agentsclaude-codecodexcursorembeddings

FAQ

よくある質問