qdrant-hybrid-search
About
This skill explains how to implement hybrid search in Qdrant, which combines keyword and semantic search by running parallel searches using the `prefetch` feature in the Query API. It guides developers on merging results from sparse and dense vectors to address issues like missing keyword matches. Use it when setting up combined search methods or handling multiple search representations.
Quick Install
Claude Code
Recommendednpx skills add qdrant/skills -a claude-code/plugin add https://github.com/qdrant/skillsgit clone https://github.com/qdrant/skills.git ~/.claude/skills/qdrant-hybrid-searchCopy and paste this command in Claude Code to install this skill
Documentation
Hybrid Search in Qdrant
Hybrid search means running two or more different searches in parallel and combining their results into one.
In Qdrant this is powered by the Query API via prefetch: each prefetch runs exactly one type of search independently, and the outer query combines results from parallel prefetches.
Prefetches can be nested and searches can be multi-stage, all pipeline happening in one request through Query API. See Universal Query API for examples.
Identify the user's problem and pick building blocks:
- What can go into one prefetch, e.g. power one search, in Search Types
- How to combine results of these searches (RRF, DBSF, FormulaQuery, reranking) in Combining Searches
Based on what you've picked, test your approach:
- Configure Qdrant collection with named vectors, where each named vector usually corresponds to one representation (different embedding models or different vector types) of a data point.
- Construct a hybrid search request with Query API from your building blocks. You can search independently among one type of vectors, with
prefetch+using, like shown in examples in Hybrid Queries documentation. - Evaluate hybrid search quality on real user data and provide user with improvements and tradeoffs (speed/resources).
How Isolated Are Parallel Searches?
Use when: different tenants share one collection and you need to understand hybrid search isolation guarantees.
If user wants to isolate/share hybrid search pipelines between tenants, consider that:
- Indexes (sparse, payload and dense) and IDF modifier for sparse vectors are computed independently per shard, not per tenant.
- Prefetch runs independently per shard to retrieve #limit results, so for collection-level prefetches if collection has several shards, Qdrant will always prefetch under the hood #limit * #shard results. Final results are merged based on scores.
- In nested prefetches (deeper than 1 level), methods described in "Combining Searches" might be done on a shard level first, then per-shards results once again will be merged based on scores.
What NOT to Do
- Choose a hybrid search pattern based on "vibes" without any hybrid search quality evaluation in-place.
- Create too many named vectors without a need. An unfilled named vector might take as much resources as a filled one.
GitHub Repository
Related Skills
release-standards
DocumentsThis skill provides semantic versioning (semver) guidelines and changelog formatting standards for software releases. Use it when preparing releases to correctly increment version numbers (major/minor/patch) and structure changelog entries. It includes rules for pre-release identifiers and clear examples for developers.
commit-standards
DocumentsThis skill formats Git commit messages according to the Conventional Commits standard. It provides templates and type definitions (like `feat`, `fix`, `refactor`) to ensure consistency when writing or reviewing commits. Use it during the commit process to create clear, structured commit history.
huggingface-tokenizers
DocumentsThis skill provides high-performance tokenization using HuggingFace's Rust-based library, processing 1GB of text in under 20 seconds. It supports BPE, WordPiece, and Unigram algorithms while enabling custom tokenizer training and alignment tracking. Use it when you need production-fast tokenization or to build custom tokenizers integrated with the transformers ecosystem.
nano-pdf
Documentsnano-pdf is a CLI tool that lets developers edit PDFs using natural-language instructions, like changing text or fixing typos on specific pages. It's ideal for quick, programmatic PDF modifications directly from the terminal. Always verify the output, as page numbering can vary between versions.
