Back to Skills

context-hierarchy-design

majiayu000
Updated Today
1 views
58
9
58
View on GitHub
Designaidesign

About

This skill helps developers design progressive context loading systems to optimize Claude's memory usage. It provides a three-tier hierarchy strategy for organizing CLAUDE.md imports, implementing just-in-time loading, and creating priming hierarchies. Use it when setting up context infrastructure, refactoring bloated files, or optimizing agent startup time.

Quick Install

Claude Code

Recommended
Plugin CommandRecommended
/plugin add https://github.com/majiayu000/claude-skill-registry
Git CloneAlternative
git clone https://github.com/majiayu000/claude-skill-registry.git ~/.claude/skills/context-hierarchy-design

Copy and paste this command in Claude Code to install this skill

Documentation

Context Hierarchy Design Skill

Design a memory hierarchy that loads context progressively based on task needs.

Purpose

Not all context is needed all the time. A well-designed hierarchy ensures agents get exactly the context they need without bloat.

When to Use

  • Setting up a new project's context infrastructure
  • Refactoring bloated CLAUDE.md files
  • Creating task-specific context loading
  • Optimizing agent startup time
  • Scaling context for multiple task types

The Three-Tier Memory Strategy

Tier 1: CLAUDE.md (minimal, always loaded)
        |
        v
Tier 2: Priming Commands (task-specific, on-demand)
        |
        v
Tier 3: File Reads (just-in-time, as needed)

Design Process

Step 1: Audit Current State

Analyze existing context infrastructure:

  1. Measure CLAUDE.md size
  2. Count imports and their sizes
  3. Review command coverage
  4. Identify task types
Checklist:
- [ ] CLAUDE.md token count
- [ ] Import count and total
- [ ] Task types identified
- [ ] Command coverage mapped

Step 2: Categorize Context

For each piece of existing context, ask:

QuestionIf Yes ->
Needed for EVERY task?Tier 1 (CLAUDE.md)
Needed for task TYPE?Tier 2 (Priming)
Needed for specific work?Tier 3 (On-demand)

Step 3: Design Tier 1 (CLAUDE.md)

Minimal essentials only:

# Project Name

## Context
One-sentence project description.

## Tooling
- Language: X
- Runtime: Y
- Package Manager: Z

## Key Commands
- `cmd test` - Run tests
- `cmd build` - Build project

## Critical Rules
1. Most important rule
2. Second most important
3. Third most important (max 5)

Target size: less than 2KB.

Step 4: Design Tier 2 (Priming Commands)

Create commands for each task type:

Task TypeCommandContext Loaded
General/primeREADME, git status
Bug Fix/prime-bugRecent commits, test files
Feature/prime-featureArchitecture, API patterns
Review/prime-reviewStyle guide, PR diff
Docs/prime-docsExisting docs, API surface

Priming Command Template:

# Prime: [Task Type]

## Run
[Discovery commands for current state]

## Read
[Essential files for this task type]

## Report
[Summary prompt for understanding]

Step 5: Design Tier 3 (On-Demand)

Identify files loaded during work:

CategoryLoading Strategy
Source filesRead specific file when editing
Test filesRead when running tests
ConfigsRead when configuring
DependenciesRead when debugging deps

Principle: Never pre-load what can be loaded on-demand.

Output Format

Design document structure:

# Context Hierarchy Design: [Project]

## Tier 1: CLAUDE.md (Always Loaded)

**Target Size:** <2KB
**Content:**
- Project context (1 sentence)
- Tooling (language, runtime, PM)
- Key commands (3-5)
- Critical rules (3-5)

## Tier 2: Priming Commands (Task-Specific)

### /prime
**Purpose:** General codebase understanding
**Loads:** README, git status, project structure

### /prime-bug
**Purpose:** Bug fixing context
**Loads:** Recent commits, test patterns, error logs

### /prime-feature
**Purpose:** Feature development context
**Loads:** Architecture, API patterns, similar features

## Tier 3: On-Demand (Just-in-Time)

**Strategy:** Load during execution, not pre-emptively
- Source files: Read when editing
- Tests: Read when testing
- Configs: Read when configuring

## Migration Plan

1. [ ] Backup current CLAUDE.md
2. [ ] Create minimal CLAUDE.md
3. [ ] Create priming commands
4. [ ] Test with common task types
5. [ ] Iterate based on usage

Design Patterns

Pattern: Import Hierarchy

CLAUDE.md (always)
  -> imports/core.md (always, if large)
  -> imports/tooling.md (conditional)
  -> imports/workflows.md (conditional)

Pattern: Directory-Scoped Memory

/project
  CLAUDE.md (project-wide)
  /frontend
    CLAUDE.md (frontend-specific)
  /backend
    CLAUDE.md (backend-specific)

Pattern: Task-Type Priming

/prime -> Base context
/prime-{type} -> Type-specific context

Validation Criteria

After design, verify:

  • CLAUDE.md under 2KB
  • Every task type has priming command
  • No "just in case" loading
  • Progressive disclosure works
  • Common workflows smooth

Anti-Patterns to Avoid

Anti-PatternProblemSolution
Everything in CLAUDE.mdBloatMove to priming
No priming commandsStatic contextAdd task-type priming
Import everythingWasted tokensConditional imports
Directory CLAUDE.md bloatCompoundingKeep scoped minimal

Key Quote

"Use context priming over CLAUDE.md or similar auto-loading memory files. Engineering work constantly changes, but memory files only grow."

Cross-References

  • @context-priming-patterns.md - Priming command patterns
  • @rd-framework.md - Reduce unnecessary context
  • @context-layers.md - What loads into context
  • @minimum-context-principle.md - Include only what's necessary

Version History

  • v1.0.0 (2025-12-26): Initial release

Last Updated

Date: 2025-12-26 Model: claude-opus-4-5-20251101

GitHub Repository

majiayu000/claude-skill-registry
Path: skills/context-hierarchy-design

Related Skills

content-collections

Meta

This skill provides a production-tested setup for Content Collections, a TypeScript-first tool that transforms Markdown/MDX files into type-safe data collections with Zod validation. Use it when building blogs, documentation sites, or content-heavy Vite + React applications to ensure type safety and automatic content validation. It covers everything from Vite plugin configuration and MDX compilation to deployment optimization and schema validation.

View skill

creating-opencode-plugins

Meta

This skill provides the structure and API specifications for creating OpenCode plugins that hook into 25+ event types like commands, files, and LSP operations. It offers implementation patterns for JavaScript/TypeScript modules that intercept and extend the AI assistant's lifecycle. Use it when you need to build event-driven plugins for monitoring, custom handling, or extending OpenCode's capabilities.

View skill

sglang

Meta

SGLang is a high-performance LLM serving framework that specializes in fast, structured generation for JSON, regex, and agentic workflows using its RadixAttention prefix caching. It delivers significantly faster inference, especially for tasks with repeated prefixes, making it ideal for complex, structured outputs and multi-turn conversations. Choose SGLang over alternatives like vLLM when you need constrained decoding or are building applications with extensive prefix sharing.

View skill

evaluating-llms-harness

Testing

This Claude Skill runs the lm-evaluation-harness to benchmark LLMs across 60+ standardized academic tasks like MMLU and GSM8K. It's designed for developers to compare model quality, track training progress, or report academic results. The tool supports various backends including HuggingFace and vLLM models.

View skill