review-codebase
Über
Diese Fähigkeit führt eine umfassende, mehrphasige Überprüfung eines gesamten Codebestands durch und analysiert Architektur, Sicherheit, Codequalität sowie UX/Barrierefreiheit in einem einzigen Durchlauf. Sie gibt eine priorisierte Tabelle mit Ergebnissen und Schweregradbewertungen aus, die für die direkte Umwandlung in GitHub-Issues formatiert ist. Nutzen Sie sie für eine tiefgreifende, koordinierte Prüfung, nicht für die Überprüfung isolierter Änderungen wie Pull Requests.
Schnellinstallation
Claude Code
Empfohlennpx skills add pjt222/agent-almanac -a claude-code/plugin add https://github.com/pjt222/agent-almanacgit clone https://github.com/pjt222/agent-almanac.git ~/.claude/skills/review-codebaseKopieren Sie diesen Befehl und fügen Sie ihn in Claude Code ein, um diese Fähigkeit zu installieren
Dokumentation
Review Codebase
Multi-phase deep codebase review producing severity-rated findings with fix-order recommendations. Unlike review-pull-request (scoped to a diff) or single-domain reviews (security-audit-codebase, review-software-architecture), this skill covers an entire project or subproject across all quality dimensions in one pass.
When to Use
- Whole-project or subproject review (not PR-scoped)
- New codebase onboarding — building a mental model of what exists and what needs attention
- Periodic health checks after sustained development
- Pre-release quality gate across architecture, security, code quality, and UX
- When the output should feed directly into issue creation or sprint planning
Inputs
- Required:
target_path— root directory of the codebase or subproject to review - Optional:
scope— which phases to run:full(default),security,architecture,quality,uxoutput_format—findings(table only),report(narrative),both(default)severity_threshold— minimum severity to include:LOW(default),MEDIUM,HIGH,CRITICAL
Procedure
Step 1: Census
Inventory the codebase to establish scope and identify review targets.
- Count files by language/type:
find target_path -type f | sort by extension - Measure total line counts per language
- Identify test directories and estimate test coverage (files with tests vs files without)
- Check dependency state: lockfiles present, outdated dependencies, known vulnerabilities
- Note build system, CI/CD configuration, and documentation state
- Record the census as the opening section of the report
Got: A factual inventory — file counts, languages, test presence, dependency health. No judgments yet.
If fail: If the target path is empty or inaccessible, stop and report. If specific subdirectories are inaccessible, note them and continue with what is available.
Step 2: Architecture Review
Assess structural health: coupling, duplication, data flow, and separation of concerns.
- Map the module/directory structure and identify the primary architectural pattern
- Check for code duplication — repeated logic across files, copy-paste patterns
- Assess coupling — how many files must change for a single feature modification
- Evaluate data flow — are there clear boundaries between layers (UI, logic, data)?
- Identify dead code, unused exports, and orphaned files
- Check for consistent patterns — does the codebase follow its own conventions?
- Rate each finding: CRITICAL, HIGH, MEDIUM, or LOW
Got: A list of architectural findings with severity ratings and file references. Common findings: mode dispatch duplication, missing abstraction layers, circular dependencies.
If fail: If the codebase is too small for meaningful architecture review (< 5 files), note this and skip to Step 3. Architecture review requires enough code to have structure.
Step 3: Security Audit
Identify security vulnerabilities and defensive coding gaps.
- Scan for injection vectors: HTML injection (
innerHTML), SQL injection, command injection - Check authentication and authorization patterns (if applicable)
- Review error handling — are errors silently swallowed? Do error messages leak internals?
- Audit dependency versions against known CVEs
- Check for hardcoded secrets, API keys, or credentials
- Review Docker/container security: root user, exposed ports, build secrets
- Check localStorage/sessionStorage for sensitive data storage
- Rate each finding: CRITICAL, HIGH, MEDIUM, or LOW
Got: A list of security findings with severity, affected files, and remediation guidance. CRITICAL findings include injection vulnerabilities and exposed secrets.
If fail: If no security-relevant code exists (pure documentation project), note this and skip to Step 4.
Step 4: Code Quality
Evaluate maintainability, readability, and defensive coding.
- Identify magic numbers and hardcoded values that should be named constants
- Check for consistent naming conventions across the codebase
- Find missing input validation at system boundaries
- Assess error handling patterns — are they consistent? Do they provide useful messages?
- Check for commented-out code, TODO/FIXME markers, and incomplete implementations
- Review test quality — are tests testing behavior or implementation details?
- Rate each finding: CRITICAL, HIGH, MEDIUM, or LOW
Got: A list of quality findings focused on maintainability. Common findings: magic numbers, inconsistent patterns, missing guards.
If fail: If the codebase is generated or minified, note this and adjust expectations. Generated code has different quality criteria than hand-written code.
Step 5: UX and Accessibility (if frontend exists)
Evaluate user experience and accessibility compliance.
- Check ARIA roles, labels, and landmarks on interactive elements
- Verify keyboard navigation — can all interactive elements be reached via Tab?
- Test focus management — does focus move logically when panels open/close?
- Check responsive design — test at common breakpoints (320px, 768px, 1024px)
- Verify color contrast ratios meet WCAG 2.1 AA standards
- Check screen reader compatibility — are dynamic content changes announced?
- Rate each finding: CRITICAL, HIGH, MEDIUM, or LOW
Got: A list of UX/a11y findings with WCAG references where applicable. If no frontend exists, this step produces "N/A — no frontend code detected."
If fail: If frontend code exists but cannot be rendered (missing build step), audit the source code statically and note that runtime testing was not possible.
Step 6: Findings Synthesis
Compile all findings into a prioritized summary.
- Merge findings from all phases into a single table
- Sort by severity (CRITICAL first, then HIGH, MEDIUM, LOW)
- Within each severity level, group by theme (security, architecture, quality, UX)
- For each finding, include: severity, phase, file(s), one-line description, suggested fix
- Produce a recommended fix order that considers dependencies between fixes
- Summarize: total findings by severity, top 3 priorities, estimated effort level
Got: A findings table with columns: #, Severity, Phase, File(s), Finding, Fix. A fix-order recommendation that accounts for dependencies (e.g., "refactor architecture before adding tests").
If fail: If no findings were produced, this is itself a finding — either the codebase is exceptionally clean or the review was too shallow. Re-examine at least one phase with deeper inspection.
Validation
- All requested phases were completed (or explicitly skipped with justification)
- Every finding has a severity rating (CRITICAL/HIGH/MEDIUM/LOW)
- Every finding references at least one file or directory
- The findings table is sorted by severity
- Fix-order recommendations account for dependencies between findings
- The summary includes total counts by severity
- If
output_formatincludesreport, narrative sections accompany the table
Scaling with Rest
Between review phases, use /rest as a checkpoint — especially between phases 2-5, which require different analytical perspectives. A checkpoint rest (brief, transitional) prevents the momentum of one phase from biasing the next. See the rest skill's "Scaling Rest" section for guidance on checkpoint vs full rest.
Pitfalls
- Boiling the ocean: Reviewing every line of a large codebase produces noise. Focus on high-impact areas: entry points, security boundaries, and architectural seams
- Severity inflation: Not every finding is CRITICAL. Reserve CRITICAL for exploitable vulnerabilities and data-loss risks. Most architectural issues are MEDIUM
- Missing the forest for the trees: Individual code quality issues matter less than systemic patterns. If magic numbers appear in 20 files, that is one architectural finding, not 20 quality findings
- Skipping the census: The census (Step 1) seems bureaucratic but prevents reviewing code that does not exist or missing entire directories
- Phase bleed: Security findings during architecture review, or quality findings during security audit. Note them for the correct phase rather than mixing concerns — it produces a cleaner findings table
Related Skills
security-audit-codebase— deep-dive security audit when the review-codebase security phase reveals complex vulnerabilitiesreview-software-architecture— detailed architecture review for specific subsystemsreview-ux-ui— comprehensive UX/accessibility audit beyond what phase 5 coversreview-pull-request— diff-scoped review for individual changesclean-codebase— implements the code quality fixes identified by this reviewcreate-github-issues— converts findings table into tracked GitHub issues
GitHub Repository
Verwandte Skills
content-collections
MetaDiese Skill bietet eine produktionsgetestete Einrichtung für Content Collections – ein TypeScript-first-Tool, das Markdown/MDX-Dateien in typsichere Datensammlungen mit Zod-Validierung umwandelt. Verwenden Sie ihn beim Erstellen von Blogs, Dokumentationsseiten oder inhaltsstarken Vite + React-Anwendungen, um Typsicherheit und automatische Inhaltsvalidierung zu gewährleisten. Er behandelt alles von der Vite-Plugin-Konfiguration und MDX-Kompilierung bis hin zur Deployment-Optimierung und Schema-Validierung.
polymarket
MetaDiese Fähigkeit ermöglicht es Entwicklern, Anwendungen mit der Polymarket-Prognosemärkte-Plattform zu erstellen, einschließlich API-Integration für Handel und Marktdaten. Sie bietet außerdem Echtzeit-Datenstreaming über WebSocket, um Live-Trades und Marktaktivitäten zu überwachen. Nutzen Sie sie zur Implementierung von Handelsstrategien oder zur Erstellung von Tools, die Live-Marktaktualisierungen verarbeiten.
creating-opencode-plugins
MetaDiese Fähigkeit unterstützt Entwickler dabei, OpenCode-Plugins zu erstellen, die in über 25 Ereignistypen wie Befehle, Dateien und LSP-Operationen eingreifen. Sie bietet die Plugin-Struktur, Event-API-Spezifikationen und Implementierungsmuster für JavaScript/TypeScript-Module. Nutzen Sie sie, wenn Sie den Lebenszyklus des OpenCode KI-Assistenten mit benutzerdefinierter ereignisgesteuerter Logik abfangen, überwachen oder erweitern müssen.
sglang
MetaSGLang ist ein hochperformantes LLM-Serving-Framework, das sich auf schnelle, strukturierte Generierung für JSON, Regex und agentenbasierte Workflows unter Verwendung seines RadixAttention-Prefix-Cachings spezialisiert. Es bietet deutlich schnellere Inferenz, insbesondere für Aufgaben mit wiederholten Präfixen, was es ideal für komplexe, strukturierte Ausgaben und Mehrfachdialoge macht. Wählen Sie SGLang gegenüber Alternativen wie vLLM, wenn Sie constrained decoding benötigen oder Anwendungen mit umfangreicher Präfix-Weitergabe entwickeln.
