SKILL·70C11F

review-codebase

Name: review-codebase
Author: pjt222

pjt222

Updated 1 month ago

9 views

Metadesign

About

This skill performs a comprehensive, multi-phase review of an entire codebase, analyzing architecture, security, code quality, and UX/accessibility in a single coordinated pass. It outputs a structured table of prioritized findings with severity ratings, which is formatted for direct conversion into GitHub issues. Use it for a deep, holistic audit rather than for reviewing isolated changes or single domains.

Quick Install

Claude Code

Recommended

Primary

npx skills add pjt222/agent-almanac -a claude-code

Plugin CommandAlternative

/plugin add https://github.com/pjt222/agent-almanac

Git CloneAlternative

git clone https://github.com/pjt222/agent-almanac.git ~/.claude/skills/review-codebase

Copy and paste this command in Claude Code to install this skill

Documentation

Review Codebase

Multi-phase deep codebase review producing severity-rated findings with fix-order recommendations. Unlike review-pull-request (scoped to a diff) or single-domain reviews (security-audit-codebase, review-software-architecture), this skill covers entire project or subproject across all quality dimensions in one pass.

When Use

Whole-project or subproject review (not PR-scoped)
New codebase onboarding — building mental model of what exists and what needs attention
Periodic health checks after sustained development
Pre-release quality gate across architecture, security, code quality, UX
When output should feed direct into issue creation or sprint planning

Inputs

Required: target_path — root directory of codebase or subproject to review
Optional:
- scope — which phases to run: full (default), security, architecture, quality, ux
- output_format — findings (table only), report (narrative), both (default)
- severity_threshold — minimum severity to include: LOW (default), MEDIUM, HIGH, CRITICAL

Steps

Step 1: Census

Inventory codebase to establish scope and identify review targets.

Count files by language/type: find target_path -type f | sort by extension
Measure total line counts per language
ID test directories and estimate test coverage (files with tests vs files without)
Check dependency state: lockfiles present, outdated dependencies, known vulnerabilities
Note build system, CI/CD configuration, documentation state
Record census as opening section of report

Got: Factual inventory — file counts, languages, test presence, dependency health. No judgments yet.

If fail: Target path empty or inaccessible? Stop and report. Specific subdirectories inaccessible? Note them and continue with what is available.

Step 2: Architecture Review

Assess structural health: coupling, duplication, data flow, separation of concerns.

Map module/directory structure. ID primary architectural pattern
Check for code duplication — repeated logic across files, copy-paste patterns
Assess coupling — how many files must change for single feature modification
Evaluate data flow — clear boundaries between layers (UI, logic, data)?
ID dead code, unused exports, orphaned files
Check for consistent patterns — does codebase follow its own conventions?
Rate each finding: CRITICAL, HIGH, MEDIUM, or LOW

Got: List of architectural findings with severity ratings and file references. Common findings: mode dispatch duplication, missing abstraction layers, circular dependencies.

If fail: Codebase too small for meaningful architecture review (< 5 files)? Note this and skip to Step 3. Architecture review needs enough code to have structure.

Step 3: Security Audit

Identify security vulnerabilities and defensive coding gaps.

Scan for injection vectors: HTML injection (innerHTML), SQL injection, command injection
Check authentication and authorization patterns (if applicable)
Review error handling — errors silently swallowed? Error messages leak internals?
Audit dependency versions against known CVEs
Check for hardcoded secrets, API keys, credentials
Review Docker/container security: root user, exposed ports, build secrets
Check localStorage/sessionStorage for sensitive data storage
Rate each finding: CRITICAL, HIGH, MEDIUM, or LOW

Got: List of security findings with severity, affected files, remediation guidance. CRITICAL findings include injection vulnerabilities and exposed secrets.

If fail: No security-relevant code exists (pure documentation project)? Note this and skip to Step 4.

Step 4: Code Quality

Evaluate maintainability, readability, defensive coding.

ID magic numbers and hardcoded values that should be named constants
Check for consistent naming conventions across codebase
Find missing input validation at system boundaries
Assess error handling patterns — consistent? Provide useful messages?
Check for commented-out code, TODO/FIXME markers, incomplete implementations
Review test quality — tests testing behavior or implementation details?
Rate each finding: CRITICAL, HIGH, MEDIUM, or LOW

Got: List of quality findings focused on maintainability. Common findings: magic numbers, inconsistent patterns, missing guards.

If fail: Codebase generated or minified? Note this and adjust expectations. Generated code has different quality criteria than hand-written code.

Step 5: UX and Accessibility (if frontend exists)

Evaluate user experience and accessibility compliance.

Check ARIA roles, labels, landmarks on interactive elements
Verify keyboard navigation — can all interactive elements be reached via Tab?
Test focus management — does focus move logical when panels open/close?
Check responsive design — test at common breakpoints (320px, 768px, 1024px)
Verify color contrast ratios meet WCAG 2.1 AA standards
Check screen reader compatibility — dynamic content changes announced?
Rate each finding: CRITICAL, HIGH, MEDIUM, or LOW

Got: List of UX/a11y findings with WCAG references where applicable. No frontend exists? This step produces "N/A — no frontend code detected."

If fail: Frontend code exists but cannot be rendered (missing build step)? Audit source code statically and note that runtime testing was not possible.

Step 6: Findings Synthesis

Compile all findings into prioritized summary.

Merge findings from all phases into single table
Sort by severity (CRITICAL first, then HIGH, MEDIUM, LOW)
Within each severity level, group by theme (security, architecture, quality, UX)
For each finding, include: severity, phase, file(s), one-line description, suggested fix
Produce recommended fix order that considers dependencies between fixes
Summarize: total findings by severity, top 3 priorities, estimated effort level

Got: Findings table with columns: #, Severity, Phase, File(s), Finding, Fix. Fix-order recommendation that accounts for dependencies (e.g., "refactor architecture before adding tests").

If fail: No findings produced? This is itself a finding — either codebase exceptionally clean or review too shallow. Re-examine at least one phase with deeper inspection.

Checks

All requested phases completed (or explicit skipped with justification)
Every finding has severity rating (CRITICAL/HIGH/MEDIUM/LOW)
Every finding references at least one file or directory
Findings table sorted by severity
Fix-order recommendations account for dependencies between findings
Summary includes total counts by severity
If output_format includes report, narrative sections accompany table

Scaling with Rest

Between review phases, use /rest as checkpoint — especially between phases 2-5, which need different analytical perspectives. Checkpoint rest (brief, transitional) prevents momentum of one phase from biasing next. See rest skill "Scaling Rest" section for guidance on checkpoint vs full rest.

Pitfalls

Boil the ocean: Review every line of large codebase produces noise. Focus on high-impact areas: entry points, security boundaries, architectural seams
Severity inflation: Not every finding is CRITICAL. Reserve CRITICAL for exploitable vulnerabilities and data-loss risks. Most architectural issues are MEDIUM
Miss the forest for the trees: Individual code quality issues matter less than systemic patterns. Magic numbers appear in 20 files? That is one architectural finding, not 20 quality findings
Skip the census: Census (Step 1) seems bureaucratic but prevents reviewing code that does not exist or missing entire directories
Phase bleed: Security findings during architecture review, or quality findings during security audit. Note them for correct phase rather than mix concerns — produces cleaner findings table

GitHub Repository

pjt222/agent-almanac

Path: i18n/caveman/skills/review-codebase

agentsagentskillsai-assisted-developmentclaude-codeskillsteams

FAQ

Frequently asked questions

What is the review-codebase skill?

review-codebase is a Claude Skill by pjt222. Skills package instructions and resources that Claude loads on demand, so Claude can perform review-codebase-related tasks without extra prompting.

How do I install review-codebase?

Use the install commands on this page: add review-codebase to Claude Code as a plugin, or clone its repository into your skills directory, then restart Claude so it picks up the skill.

What category does review-codebase belong to?

review-codebase is in the Meta category, tagged design.

Is review-codebase free to use?

Yes. review-codebase is listed on AIMCP and free to install. It runs inside Claude, so no separate service account is required to use the skill itself.

Related Skills

content-collections

Meta

This skill provides a production-tested setup for Content Collections, a TypeScript-first tool that transforms Markdown/MDX files into type-safe data collections with Zod validation. Use it when building blogs, documentation sites, or content-heavy Vite + React applications to ensure type safety and automatic content validation. It covers everything from Vite plugin configuration and MDX compilation to deployment optimization and schema validation.

View skill

polymarket

Meta

This skill enables developers to build applications with the Polymarket prediction markets platform, including API integration for trading and market data. It also provides real-time data streaming via WebSocket to monitor live trades and market activity. Use it for implementing trading strategies or creating tools that process live market updates.

View skill

creating-opencode-plugins

Meta

This skill helps developers create OpenCode plugins that hook into 25+ event types like commands, files, and LSP operations. It provides the plugin structure, event API specifications, and implementation patterns for JavaScript/TypeScript modules. Use it when you need to intercept, monitor, or extend the OpenCode AI assistant's lifecycle with custom event-driven logic.

View skill

sglang

Meta

SGLang is a high-performance LLM serving framework that specializes in fast, structured generation for JSON, regex, and agentic workflows using its RadixAttention prefix caching. It delivers significantly faster inference, especially for tasks with repeated prefixes, making it ideal for complex, structured outputs and multi-turn conversations. Choose SGLang over alternatives like vLLM when you need constrained decoding or are building applications with extensive prefix sharing.

View skill

review-codebase

About

Quick Install

Claude Code

Documentation

Review Codebase

When Use

Inputs

Steps

Step 1: Census

Step 2: Architecture Review

Step 3: Security Audit

Step 4: Code Quality

Step 5: UX and Accessibility (if frontend exists)

Step 6: Findings Synthesis

Checks

Scaling with Rest

Pitfalls

See Also

GitHub Repository

Frequently asked questions

What is the review-codebase skill?

How do I install review-codebase?

What category does review-codebase belong to?

Is review-codebase free to use?

Related Skills