Back to Skills

bigquery-patent-search

RobThePCGuy
Updated Today
39 views
2
2
View on GitHub
Documentswordai

About

This skill enables developers to perform fast, cloud-based searches across over 76 million worldwide patents using Google BigQuery. It supports keyword searches across titles and abstracts, filtering by CPC classification codes, and retrieving detailed patent information. Use it for prior art searches, patent landscape research, or looking up specific patent details by publication number.

Documentation

BigQuery Patent Search Skill

Fast, cloud-based patent searching across 76 million+ worldwide patents using Google BigQuery.

When to Use

Invoke this skill when users ask to:

  • Search for prior art patents
  • Find patents in a specific technology area
  • Search by CPC classification code
  • Look up patent details by publication number
  • Conduct freedom-to-operate searches
  • Research patent landscapes

What This Skill Does

Provides access to Google's public patent dataset:

  1. Keyword Search across 76M+ patents:

    • Full-text search of titles, abstracts, claims
    • Filter by country (US, EP, JP, CN, etc.)
    • Filter by filing/grant date ranges
    • Fast cloud-based queries (< 5 seconds)
  2. CPC Classification Search:

    • Search by CPC code (e.g., "G06F16/", "H04L29/06")
    • Browse patent classifications
    • Find patents in specific technical domains
  3. Patent Details Retrieval:

    • Get full patent text by publication number
    • Access title, abstract, claims, description
    • View CPC codes, inventors, assignees
    • See filing and grant dates

Required Setup

This skill requires Google Cloud authentication:

Prerequisites:

  1. Google Cloud Project (free to create)
  2. BigQuery API enabled (free for reasonable usage)
  3. Application Default Credentials configured

Setup Commands:

# Install Google Cloud SDK (if not installed)
# Visit: https://cloud.google.com/sdk/docs/install

# Authenticate
gcloud auth application-default login

# Set project (get ID from console.cloud.google.com)
export GOOGLE_CLOUD_PROJECT=your-project-id

Environment Variable: Set in .env file: GOOGLE_CLOUD_PROJECT=your-project-id

How to Use

When this skill is invoked:

  1. Initialize BigQuery searcher:

    import sys
    sys.path.insert(0, os.path.join(os.environ.get('CLAUDE_PLUGIN_ROOT', '.'), 'python'))
    from python.bigquery_search import BigQueryPatentSearch
    
    searcher = BigQueryPatentSearch()
    
  2. Search by keywords:

    results = searcher.search_patents(
        query="blockchain authentication",
        limit=20,
        country="US",  # Optional: filter by country
        start_year=2020,  # Optional: filter by year
        end_year=2024
    )
    
  3. Search by CPC code:

    results = searcher.search_by_cpc(
        cpc_code="G06F16/",  # CPC prefix
        limit=20,
        country="US"
    )
    
  4. Get patent details:

    patent = searcher.get_patent(
        patent_number="US10123456B2"  # Publication number
    )
    

BigQuery Dataset

Uses patents-public-data.patents on Google BigQuery:

  • 76M+ worldwide patents
  • 12M+ US patents with full text
  • Updated weekly
  • Free to query (no billing for reasonable usage)

Search Result Format

Each result includes:

{
    "publication_number": "US10123456B2",
    "title": "Method and system for...",
    "abstract": "A system for...",
    "filing_date": "2019-01-15",
    "grant_date": "2020-06-30",
    "country": "US",
    "cpc_codes": ["G06F16/245", "H04L29/06"],
    "inventors": ["John Doe", "Jane Smith"],
    "assignee": "Example Corp"
}

Full patent details also include:

  • claims: Full text of all claims
  • description: Complete description section
  • priority_date: Earliest priority date
  • family_id: Patent family ID

Presentation Format

Present search results as:

PATENT SEARCH RESULTS
====================

Query: "blockchain authentication"
Found: 247 patents (showing top 20)
Date Range: 2020-2024
Country: US

[1] US10123456B2 - System for blockchain-based authentication
    Assignee: Example Corp
    Filed: 2019-01-15 | Granted: 2020-06-30
    CPC: G06F16/245, H04L29/06

    Abstract: A system for authenticating users using blockchain
    technology with distributed ledger verification...

[2] US10234567B1 - Method of secure authentication using blockchain
    ...

---

Top 5 Most Relevant:
1. US10123456B2 (95% relevance)
2. US10234567B1 (92% relevance)
...

Advanced Search Techniques

  1. Boolean Operators in queries:

    • "blockchain AND authentication"
    • "encryption OR cryptography"
    • "(mobile OR wireless) AND security"
  2. Phrase Search:

    • "distributed ledger technology"
    • "public key infrastructure"
  3. CPC Code Hierarchies:

    • "G06F" = Computing
    • "G06F16/" = Information retrieval
    • "G06F16/245" = Structured query language

Common CPC Codes

  • G06F: Computing, calculating, counting
  • H04L: Digital communication
  • G06Q: Business methods
  • H04W: Wireless communication
  • G06N: Computer systems based on specific models
  • G06T: Image processing

Error Handling

If BigQuery is not configured:

  1. Check if google-cloud-bigquery is installed
  2. Verify authentication: gcloud auth application-default login
  3. Confirm project ID in environment: GOOGLE_CLOUD_PROJECT
  4. Test with: python scripts/test_bigquery.py

Cost Considerations

BigQuery pricing:

  • First 1TB/month: FREE
  • After 1TB: $5 per TB queried
  • Typical query: 10-50 MB per search
  • ~20,000 searches free per month

Tools Available

  • Bash: To run Python BigQuery searches
  • Read: To load saved search results
  • Write: To save patent search results
  • Grep: To search through saved results

Quick Install

/plugin add https://github.com/RobThePCGuy/Claude-Patent-Creator/tree/main/bigquery-patent-search

Copy and paste this command in Claude Code to install this skill

GitHub 仓库

RobThePCGuy/Claude-Patent-Creator
Path: skills/bigquery-patent-search
bigqueryclaude-codeclaude-code-pluginfaissmcp-servermpep

Related Skills

sglang

Meta

SGLang is a high-performance LLM serving framework that specializes in fast, structured generation for JSON, regex, and agentic workflows using its RadixAttention prefix caching. It delivers significantly faster inference, especially for tasks with repeated prefixes, making it ideal for complex, structured outputs and multi-turn conversations. Choose SGLang over alternatives like vLLM when you need constrained decoding or are building applications with extensive prefix sharing.

View skill

evaluating-llms-harness

Testing

This Claude Skill runs the lm-evaluation-harness to benchmark LLMs across 60+ standardized academic tasks like MMLU and GSM8K. It's designed for developers to compare model quality, track training progress, or report academic results. The tool supports various backends including HuggingFace and vLLM models.

View skill

llamaguard

Other

LlamaGuard is Meta's 7-8B parameter model for moderating LLM inputs and outputs across six safety categories like violence and hate speech. It offers 94-95% accuracy and can be deployed using vLLM, Hugging Face, or Amazon SageMaker. Use this skill to easily integrate content filtering and safety guardrails into your AI applications.

View skill

langchain

Meta

LangChain is a framework for building LLM applications using agents, chains, and RAG pipelines. It supports multiple LLM providers, offers 500+ integrations, and includes features like tool calling and memory management. Use it for rapid prototyping and deploying production systems like chatbots, autonomous agents, and question-answering services.

View skill