SKILL·9E55E6

review-codebase

Name: review-codebase
Author: pjt222

pjt222

Mis à jour 1 month ago

9 vues

Métadesign

À propos

Cette compétence effectue un examen complet et multiphase d'une base de code entière, analysant l'architecture, la sécurité, la qualité du code et l'UX/accessibilité en une seule passe. Elle produit un tableau priorisé des résultats avec des niveaux de gravité, formaté pour une conversion directe en issues GitHub. Utilisez-la pour un audit approfondi et coordonné, plutôt que pour examiner des modifications isolées comme des pull requests.

Installation rapide

Claude Code

Recommandé

Principal

npx skills add pjt222/agent-almanac -a claude-code

Commande PluginAlternatif

/plugin add https://github.com/pjt222/agent-almanac

Git CloneAlternatif

git clone https://github.com/pjt222/agent-almanac.git ~/.claude/skills/review-codebase

Copiez et collez cette commande dans Claude Code pour installer cette compétence

Documentation

Review Codebase

Multi-phase deep codebase review producing severity-rated findings with fix-order recommendations. Unlike review-pull-request (scoped to a diff) or single-domain reviews (security-audit-codebase, review-software-architecture), this skill covers an entire project or subproject across all quality dimensions in one pass.

When to Use

Whole-project or subproject review (not PR-scoped)
New codebase onboarding — building a mental model of what exists and what needs attention
Periodic health checks after sustained development
Pre-release quality gate across architecture, security, code quality, and UX
When the output should feed directly into issue creation or sprint planning

Inputs

Required: target_path — root directory of the codebase or subproject to review
Optional:
- scope — which phases to run: full (default), security, architecture, quality, ux
- output_format — findings (table only), report (narrative), both (default)
- severity_threshold — minimum severity to include: LOW (default), MEDIUM, HIGH, CRITICAL

Procedure

Step 1: Census

Inventory the codebase to establish scope and identify review targets.

Count files by language/type: find target_path -type f | sort by extension
Measure total line counts per language
Identify test directories and estimate test coverage (files with tests vs files without)
Check dependency state: lockfiles present, outdated dependencies, known vulnerabilities
Note build system, CI/CD configuration, and documentation state
Record the census as the opening section of the report

Got: A factual inventory — file counts, languages, test presence, dependency health. No judgments yet.

If fail: If the target path is empty or inaccessible, stop and report. If specific subdirectories are inaccessible, note them and continue with what is available.

Step 2: Architecture Review

Assess structural health: coupling, duplication, data flow, and separation of concerns.

Map the module/directory structure and identify the primary architectural pattern
Check for code duplication — repeated logic across files, copy-paste patterns
Assess coupling — how many files must change for a single feature modification
Evaluate data flow — are there clear boundaries between layers (UI, logic, data)?
Identify dead code, unused exports, and orphaned files
Check for consistent patterns — does the codebase follow its own conventions?
Rate each finding: CRITICAL, HIGH, MEDIUM, or LOW

Got: A list of architectural findings with severity ratings and file references. Common findings: mode dispatch duplication, missing abstraction layers, circular dependencies.

If fail: If the codebase is too small for meaningful architecture review (< 5 files), note this and skip to Step 3. Architecture review requires enough code to have structure.

Step 3: Security Audit

Identify security vulnerabilities and defensive coding gaps.

Scan for injection vectors: HTML injection (innerHTML), SQL injection, command injection
Check authentication and authorization patterns (if applicable)
Review error handling — are errors silently swallowed? Do error messages leak internals?
Audit dependency versions against known CVEs
Check for hardcoded secrets, API keys, or credentials
Review Docker/container security: root user, exposed ports, build secrets
Check localStorage/sessionStorage for sensitive data storage
Rate each finding: CRITICAL, HIGH, MEDIUM, or LOW

Got: A list of security findings with severity, affected files, and remediation guidance. CRITICAL findings include injection vulnerabilities and exposed secrets.

If fail: If no security-relevant code exists (pure documentation project), note this and skip to Step 4.

Step 4: Code Quality

Evaluate maintainability, readability, and defensive coding.

Identify magic numbers and hardcoded values that should be named constants
Check for consistent naming conventions across the codebase
Find missing input validation at system boundaries
Assess error handling patterns — are they consistent? Do they provide useful messages?
Check for commented-out code, TODO/FIXME markers, and incomplete implementations
Review test quality — are tests testing behavior or implementation details?
Rate each finding: CRITICAL, HIGH, MEDIUM, or LOW

Got: A list of quality findings focused on maintainability. Common findings: magic numbers, inconsistent patterns, missing guards.

If fail: If the codebase is generated or minified, note this and adjust expectations. Generated code has different quality criteria than hand-written code.

Step 5: UX and Accessibility (if frontend exists)

Evaluate user experience and accessibility compliance.

Check ARIA roles, labels, and landmarks on interactive elements
Verify keyboard navigation — can all interactive elements be reached via Tab?
Test focus management — does focus move logically when panels open/close?
Check responsive design — test at common breakpoints (320px, 768px, 1024px)
Verify color contrast ratios meet WCAG 2.1 AA standards
Check screen reader compatibility — are dynamic content changes announced?
Rate each finding: CRITICAL, HIGH, MEDIUM, or LOW

Got: A list of UX/a11y findings with WCAG references where applicable. If no frontend exists, this step produces "N/A — no frontend code detected."

If fail: If frontend code exists but cannot be rendered (missing build step), audit the source code statically and note that runtime testing was not possible.

Step 6: Findings Synthesis

Compile all findings into a prioritized summary.

Merge findings from all phases into a single table
Sort by severity (CRITICAL first, then HIGH, MEDIUM, LOW)
Within each severity level, group by theme (security, architecture, quality, UX)
For each finding, include: severity, phase, file(s), one-line description, suggested fix
Produce a recommended fix order that considers dependencies between fixes
Summarize: total findings by severity, top 3 priorities, estimated effort level

Got: A findings table with columns: #, Severity, Phase, File(s), Finding, Fix. A fix-order recommendation that accounts for dependencies (e.g., "refactor architecture before adding tests").

If fail: If no findings were produced, this is itself a finding — either the codebase is exceptionally clean or the review was too shallow. Re-examine at least one phase with deeper inspection.

Validation

All requested phases were completed (or explicitly skipped with justification)
Every finding has a severity rating (CRITICAL/HIGH/MEDIUM/LOW)
Every finding references at least one file or directory
The findings table is sorted by severity
Fix-order recommendations account for dependencies between findings
The summary includes total counts by severity
If output_format includes report, narrative sections accompany the table

Scaling with Rest

Between review phases, use /rest as a checkpoint — especially between phases 2-5, which require different analytical perspectives. A checkpoint rest (brief, transitional) prevents the momentum of one phase from biasing the next. See the rest skill's "Scaling Rest" section for guidance on checkpoint vs full rest.

Pitfalls

Boiling the ocean: Reviewing every line of a large codebase produces noise. Focus on high-impact areas: entry points, security boundaries, and architectural seams
Severity inflation: Not every finding is CRITICAL. Reserve CRITICAL for exploitable vulnerabilities and data-loss risks. Most architectural issues are MEDIUM
Missing the forest for the trees: Individual code quality issues matter less than systemic patterns. If magic numbers appear in 20 files, that is one architectural finding, not 20 quality findings
Skipping the census: The census (Step 1) seems bureaucratic but prevents reviewing code that does not exist or missing entire directories
Phase bleed: Security findings during architecture review, or quality findings during security audit. Note them for the correct phase rather than mixing concerns — it produces a cleaner findings table

Related Skills

security-audit-codebase — deep-dive security audit when the review-codebase security phase reveals complex vulnerabilities
review-software-architecture — detailed architecture review for specific subsystems
review-ux-ui — comprehensive UX/accessibility audit beyond what phase 5 covers
review-pull-request — diff-scoped review for individual changes
clean-codebase — implements the code quality fixes identified by this review
create-github-issues — converts findings table into tracked GitHub issues

Dépôt GitHub

pjt222/agent-almanac

Chemin: i18n/caveman-lite/skills/review-codebase

agentsagentskillsai-assisted-developmentclaude-codeskillsteams

FAQ

Frequently asked questions

What is the review-codebase skill?

review-codebase is a Claude Skill by pjt222. Skills package instructions and resources that Claude loads on demand, so Claude can perform review-codebase-related tasks without extra prompting.

How do I install review-codebase?

Use the install commands on this page: add review-codebase to Claude Code as a plugin, or clone its repository into your skills directory, then restart Claude so it picks up the skill.

What category does review-codebase belong to?

review-codebase is in the Meta category, tagged design.

Is review-codebase free to use?

Yes. review-codebase is listed on AIMCP and free to install. It runs inside Claude, so no separate service account is required to use the skill itself.

Compétences associées

content-collections

Méta

Cette compétence propose une configuration éprouvée en production pour Content Collections, un outil axé sur TypeScript qui transforme des fichiers Markdown/MDX en collections de données typées de manière sûre avec une validation Zod. Utilisez-la lors de la création de blogs, de sites de documentation ou d'applications Vite + React riches en contenu pour garantir la sécurité de typage et la validation automatique du contenu. Elle couvre tout, de la configuration du plugin Vite et de la compilation MDX à l'optimisation des déploiements et la validation des schémas.

Voir la compétence

polymarket

Méta

Cette compétence permet aux développeurs de créer des applications avec la plateforme de marchés prédictifs Polymarket, incluant l'intégration d'API pour le trading et les données de marché. Elle fournit également une diffusion de données en temps réel via WebSocket pour surveiller les transactions en direct et l'activité du marché. Utilisez-la pour mettre en œuvre des stratégies de trading ou pour créer des outils traitant les mises à jour de marché en direct.

Voir la compétence

creating-opencode-plugins

Méta

Cette compétence aide les développeurs à créer des plugins OpenCode qui s'interconnectent avec plus de 25 types d'événements tels que les commandes, les fichiers et les opérations LSP. Elle fournit la structure du plugin, les spécifications de l'API événementielle et les modèles d'implémentation pour les modules JavaScript/TypeScript. Utilisez-la lorsque vous avez besoin d'intercepter, de surveiller ou d'étendre le cycle de vie de l'assistant IA OpenCode avec une logique personnalisée pilotée par les événements.

Voir la compétence

sglang

Méta

SGLang est un framework de service LLM haute performance spécialisé dans la génération rapide et structurée pour les workflows JSON, regex et agentiques grâce à son cache de préfixe RadixAttention. Il offre une inférence nettement plus rapide, particulièrement pour les tâches avec des préfixes répétés, ce qui le rend idéal pour les sorties complexes et structurées ainsi que les conversations multi-tours. Choisissez SGLang plutôt que des alternatives comme vLLM lorsque vous avez besoin d'un décodage contraint ou que vous construisez des applications avec un partage étendu de préfixes.

Voir la compétence