bigquery-patent-search
About
This skill enables developers to perform fast, cloud-based searches across over 76 million worldwide patents using Google BigQuery. It supports keyword searches across titles and abstracts, filtering by CPC classification codes, and retrieving detailed patent information. Use it for prior art searches, patent landscape research, or looking up specific patent details by publication number.
Documentation
BigQuery Patent Search Skill
Fast, cloud-based patent searching across 76 million+ worldwide patents using Google BigQuery.
When to Use
Invoke this skill when users ask to:
- Search for prior art patents
- Find patents in a specific technology area
- Search by CPC classification code
- Look up patent details by publication number
- Conduct freedom-to-operate searches
- Research patent landscapes
What This Skill Does
Provides access to Google's public patent dataset:
-
Keyword Search across 76M+ patents:
- Full-text search of titles, abstracts, claims
- Filter by country (US, EP, JP, CN, etc.)
- Filter by filing/grant date ranges
- Fast cloud-based queries (< 5 seconds)
-
CPC Classification Search:
- Search by CPC code (e.g., "G06F16/", "H04L29/06")
- Browse patent classifications
- Find patents in specific technical domains
-
Patent Details Retrieval:
- Get full patent text by publication number
- Access title, abstract, claims, description
- View CPC codes, inventors, assignees
- See filing and grant dates
Required Setup
This skill requires Google Cloud authentication:
Prerequisites:
- Google Cloud Project (free to create)
- BigQuery API enabled (free for reasonable usage)
- Application Default Credentials configured
Setup Commands:
# Install Google Cloud SDK (if not installed)
# Visit: https://cloud.google.com/sdk/docs/install
# Authenticate
gcloud auth application-default login
# Set project (get ID from console.cloud.google.com)
export GOOGLE_CLOUD_PROJECT=your-project-id
Environment Variable:
Set in .env file: GOOGLE_CLOUD_PROJECT=your-project-id
How to Use
When this skill is invoked:
-
Initialize BigQuery searcher:
import sys sys.path.insert(0, os.path.join(os.environ.get('CLAUDE_PLUGIN_ROOT', '.'), 'python')) from python.bigquery_search import BigQueryPatentSearch searcher = BigQueryPatentSearch() -
Search by keywords:
results = searcher.search_patents( query="blockchain authentication", limit=20, country="US", # Optional: filter by country start_year=2020, # Optional: filter by year end_year=2024 ) -
Search by CPC code:
results = searcher.search_by_cpc( cpc_code="G06F16/", # CPC prefix limit=20, country="US" ) -
Get patent details:
patent = searcher.get_patent( patent_number="US10123456B2" # Publication number )
BigQuery Dataset
Uses patents-public-data.patents on Google BigQuery:
- 76M+ worldwide patents
- 12M+ US patents with full text
- Updated weekly
- Free to query (no billing for reasonable usage)
Search Result Format
Each result includes:
{
"publication_number": "US10123456B2",
"title": "Method and system for...",
"abstract": "A system for...",
"filing_date": "2019-01-15",
"grant_date": "2020-06-30",
"country": "US",
"cpc_codes": ["G06F16/245", "H04L29/06"],
"inventors": ["John Doe", "Jane Smith"],
"assignee": "Example Corp"
}
Full patent details also include:
claims: Full text of all claimsdescription: Complete description sectionpriority_date: Earliest priority datefamily_id: Patent family ID
Presentation Format
Present search results as:
PATENT SEARCH RESULTS
====================
Query: "blockchain authentication"
Found: 247 patents (showing top 20)
Date Range: 2020-2024
Country: US
[1] US10123456B2 - System for blockchain-based authentication
Assignee: Example Corp
Filed: 2019-01-15 | Granted: 2020-06-30
CPC: G06F16/245, H04L29/06
Abstract: A system for authenticating users using blockchain
technology with distributed ledger verification...
[2] US10234567B1 - Method of secure authentication using blockchain
...
---
Top 5 Most Relevant:
1. US10123456B2 (95% relevance)
2. US10234567B1 (92% relevance)
...
Advanced Search Techniques
-
Boolean Operators in queries:
- "blockchain AND authentication"
- "encryption OR cryptography"
- "(mobile OR wireless) AND security"
-
Phrase Search:
- "distributed ledger technology"
- "public key infrastructure"
-
CPC Code Hierarchies:
- "G06F" = Computing
- "G06F16/" = Information retrieval
- "G06F16/245" = Structured query language
Common CPC Codes
- G06F: Computing, calculating, counting
- H04L: Digital communication
- G06Q: Business methods
- H04W: Wireless communication
- G06N: Computer systems based on specific models
- G06T: Image processing
Error Handling
If BigQuery is not configured:
- Check if
google-cloud-bigqueryis installed - Verify authentication:
gcloud auth application-default login - Confirm project ID in environment:
GOOGLE_CLOUD_PROJECT - Test with:
python scripts/test_bigquery.py
Cost Considerations
BigQuery pricing:
- First 1TB/month: FREE
- After 1TB: $5 per TB queried
- Typical query: 10-50 MB per search
- ~20,000 searches free per month
Tools Available
- Bash: To run Python BigQuery searches
- Read: To load saved search results
- Write: To save patent search results
- Grep: To search through saved results
Quick Install
/plugin add https://github.com/RobThePCGuy/Claude-Patent-Creator/tree/main/bigquery-patent-searchCopy and paste this command in Claude Code to install this skill
GitHub 仓库
Related Skills
sglang
MetaSGLang is a high-performance LLM serving framework that specializes in fast, structured generation for JSON, regex, and agentic workflows using its RadixAttention prefix caching. It delivers significantly faster inference, especially for tasks with repeated prefixes, making it ideal for complex, structured outputs and multi-turn conversations. Choose SGLang over alternatives like vLLM when you need constrained decoding or are building applications with extensive prefix sharing.
evaluating-llms-harness
TestingThis Claude Skill runs the lm-evaluation-harness to benchmark LLMs across 60+ standardized academic tasks like MMLU and GSM8K. It's designed for developers to compare model quality, track training progress, or report academic results. The tool supports various backends including HuggingFace and vLLM models.
llamaguard
OtherLlamaGuard is Meta's 7-8B parameter model for moderating LLM inputs and outputs across six safety categories like violence and hate speech. It offers 94-95% accuracy and can be deployed using vLLM, Hugging Face, or Amazon SageMaker. Use this skill to easily integrate content filtering and safety guardrails into your AI applications.
langchain
MetaLangChain is a framework for building LLM applications using agents, chains, and RAG pipelines. It supports multiple LLM providers, offers 500+ integrations, and includes features like tool calling and memory management. Use it for rapid prototyping and deploying production systems like chatbots, autonomous agents, and question-answering services.
