SKILL·D0A93B

analyze-codebase-workflow

Name: analyze-codebase-workflow
Author: pjt222

pjt222

Actualizado 1 month ago

9 vistas

Diseñowordautomationdata

Acerca de

Esta habilidad analiza automáticamente bases de código para detectar flujos de trabajo, canalizaciones de datos y dependencias de archivos utilizando el motor `put_auto()` de putior. Genera un plan de anotación que mapea patrones de E/S en más de 30 lenguajes, ideal para incorporarse a proyectos desconocidos o iniciar la integración con putior. Úsela para comprender el flujo de datos, auditar canalizaciones o preparar la anotación de archivos fuente.

Instalación rápida

Claude Code

Recomendado

Principal

npx skills add pjt222/agent-almanac -a claude-code

Comando PluginAlternativo

/plugin add https://github.com/pjt222/agent-almanac

Git CloneAlternativo

git clone https://github.com/pjt222/agent-almanac.git ~/.claude/skills/analyze-codebase-workflow

Copia y pega este comando en Claude Code para instalar esta habilidad

Documentación

Analyze Codebase Workflow

Survey arbitrary repository. Auto-detect data flows, file I/O, script dependencies. Produce structured annotation plan for manual refinement.

When Use

Onboarding onto unfamiliar codebase, need to understand data flow
Starting putior integration in project with no PUT annotations yet
Auditing existing project's data pipeline before documentation
Preparing annotation plan before running annotate-source-files

Inputs

Required: Path to repository or source directory to analyze
Optional: Specific subdirectories to focus on (default: entire repo)
Optional: Languages to include or exclude (default: all detected)
Optional: Detection scope: inputs only, outputs only, or both (default: both + dependencies)

Steps

Step 1: Survey Repository Structure

Identify source files and their languages. Understand what putior can analyze.

library(putior)

# List all supported languages and their extensions
list_supported_languages()
list_supported_languages(detection_only = TRUE)  # Only languages with auto-detection

# Get supported extensions
exts <- get_supported_extensions()

Use file listing to understand repo composition:

# Count files by extension in the target directory
find /path/to/repo -type f | sed 's/.*\.//' | sort | uniq -c | sort -rn | head -20

Got: List of file extensions present in repo, with counts. Map against get_supported_extensions() to know coverage.

If fail: Repo has no files matching supported extensions? Putior cannot auto-detect workflows. Consider whether language is supported but files use non-standard extensions.

Step 2: Check Language Detection Coverage

For each detected language, verify auto-detection pattern availability.

# Check which languages have auto-detection patterns (18 languages, 902 patterns)
detection_langs <- list_supported_languages(detection_only = TRUE)
cat("Languages with auto-detection:\n")
print(detection_langs)

# Get pattern counts for specific languages found in the repo
for (lang in c("r", "python", "javascript", "sql", "dockerfile", "makefile")) {
  patterns <- get_detection_patterns(lang)
  cat(sprintf("%s: %d input, %d output, %d dependency patterns\n",
    lang,
    length(patterns$input),
    length(patterns$output),
    length(patterns$dependency)
  ))
}

Got: Pattern counts printed for each language. R has 124 patterns, Python 159, JavaScript 71, etc.

If fail: Language returns no patterns? Supports manual annotations but not auto-detection. Plan to annotate those files manually.

Step 3: Run Auto-Detection

Execute put_auto() on target directory to discover workflow elements.

# Full auto-detection
workflow <- put_auto("./src/",
  detect_inputs = TRUE,
  detect_outputs = TRUE,
  detect_dependencies = TRUE
)

# Exclude build scripts and test helpers from scanning
workflow <- put_auto("./src/",
  detect_inputs = TRUE,
  detect_outputs = TRUE,
  detect_dependencies = TRUE,
  exclude = c("build-", "test_helper")
)

# View detected workflow nodes
print(workflow)

# Check node count
cat(sprintf("Detected %d workflow nodes\n", nrow(workflow)))

For large repos, analyze subdirectories incrementally:

# Analyze specific subdirectories
etl_workflow <- put_auto("./src/etl/")
api_workflow <- put_auto("./src/api/")

Got: Data frame with columns including id, label, input, output, source_file. Each row represents detected workflow step.

If fail: Result empty? Source files may not contain recognizable I/O patterns. Try enabling debug logging: workflow <- put_auto("./src/", log_level = "DEBUG") to see which files scanned and which patterns match.

Step 4: Generate Initial Diagram

Visualize auto-detected workflow. Assess coverage and identify gaps.

# Generate diagram from auto-detected workflow
cat(put_diagram(workflow, theme = "github"))

# With source file info for traceability
cat(put_diagram(workflow, show_source_info = TRUE))

# Save to file for review
writeLines(put_diagram(workflow, theme = "github"), "workflow-auto.md")

Got: Mermaid flowchart showing detected nodes connected by data flow edges. Nodes labeled with meaningful function/file names.

If fail: Diagram shows disconnected nodes? Auto-detection found I/O patterns but couldn't infer connections. Normal — connections derived from matching output filenames to input filenames. Annotation plan (next step) addresses gaps.

Step 5: Produce Annotation Plan

Generate structured plan documenting what found and what needs manual annotation.

# Generate annotation suggestions
put_generate("./src/", style = "single")

# For multiline style (more readable for complex workflows)
put_generate("./src/", style = "multiline")

# Copy suggestions to clipboard for easy pasting
put_generate("./src/", output = "clipboard")

Document plan with coverage assessment:

## Annotation Plan

### Auto-Detected (no manual work needed)
- `src/etl/extract.R` — 3 inputs, 2 outputs detected
- `src/etl/transform.py` — 1 input, 1 output detected

### Needs Manual Annotation
- `src/api/handler.js` — Language supported but no I/O patterns matched
- `src/config/setup.sh` — Only 12 shell patterns; complex logic missed

### Not Supported
- `src/legacy/process.f90` — Fortran not in detection languages

### Recommended Connections
- extract.R output `data.csv` → transform.py input `data.csv` (auto-linked)
- transform.py output `clean.parquet` → load.R input (needs annotation)

Got: Clear plan separating auto-detected files from those needing manual annotation. Specific recommendations for each file.

If fail: put_generate() produces no output? Ensure directory path correct and contains source files in supported languages.

Checks

put_auto() executes without errors on target directory
Detected workflow has at least one node (unless repo has no recognizable I/O)
put_diagram() produces valid Mermaid code from auto-detected workflow
put_generate() produces annotation suggestions for files with detected patterns
Annotation plan document created with coverage assessment

Pitfalls

Scanning too broadly: Running put_auto(".") on repo root may include node_modules/, .git/, venv/, etc. Target specific source directories.
Expecting full coverage: Auto-detection finds file I/O and library calls, not business logic. 40-60% coverage rate typical; rest needs manual annotation.
Ignoring dependencies: detect_dependencies = TRUE flag catches source(), import, require() calls that link scripts together. Disabling it loses cross-file connections.
Language mismatch: Files with non-standard extensions (e.g., .R vs .r, .jsx vs .js) may not be detected. Use get_comment_prefix() to check if extension recognized. Note extensionless files like Dockerfile and Makefile supported via exact filename matching.
Large repos: For repos with 100+ source files, analyze by module/directory to keep diagrams readable.

Repositorio GitHub

pjt222/agent-almanac

Ruta: i18n/caveman/skills/analyze-codebase-workflow

agentsagentskillsai-assisted-developmentclaude-codeskillsteams

FAQ

Frequently asked questions

What is the analyze-codebase-workflow skill?

analyze-codebase-workflow is a Claude Skill by pjt222. Skills package instructions and resources that Claude loads on demand, so Claude can perform analyze-codebase-workflow-related tasks without extra prompting.

How do I install analyze-codebase-workflow?

Use the install commands on this page: add analyze-codebase-workflow to Claude Code as a plugin, or clone its repository into your skills directory, then restart Claude so it picks up the skill.

What category does analyze-codebase-workflow belong to?

analyze-codebase-workflow is in the Design category, tagged word, automation and data.

Is analyze-codebase-workflow free to use?

Yes. analyze-codebase-workflow is listed on AIMCP and free to install. It runs inside Claude, so no separate service account is required to use the skill itself.

Habilidades relacionadas

executing-plans

Diseño

Utilice la habilidad executing-plans cuando tenga un plan de implementación completo para ejecutar en lotes controlados con puntos de revisión. Esta habilidad carga y revisa críticamente el plan, luego ejecuta tareas en pequeños lotes (por defecto 3 tareas) mientras reporta el progreso entre cada lote para la revisión del arquitecto. Esto asegura una implementación sistemática con puntos de control de calidad integrados.

Ver habilidad

requesting-code-review

Diseño

Esta habilidad despacha un subagente revisor de código para analizar los cambios en el código frente a los requisitos antes de proceder. Debe usarse después de completar tareas, implementar funciones principales o antes de fusionar con la rama principal. La revisión ayuda a detectar problemas de forma temprana al comparar la implementación actual con el plan original.

Ver habilidad

connect-mcp-server

Diseño

Esta habilidad proporciona una guía integral para que los desarrolladores conecten servidores MCP a Claude Code mediante transportes HTTP, stdio o SSE. Cubre la instalación, configuración, autenticación y seguridad para integrar servicios externos como GitHub, Notion y APIs personalizadas. Úsala al configurar integraciones MCP, al configurar herramientas externas o al trabajar con el Protocolo de Contexto del Modelo de Claude.

Ver habilidad

web-cli-teleport

Diseño

Esta habilidad ayuda a los desarrolladores a elegir entre las interfaces web y CLI de Claude Code mediante el análisis de tareas, y luego permite la teletransportación fluida de sesiones entre estos entornos. Optimiza el flujo de trabajo gestionando el estado y el contexto de la sesión al cambiar entre web, CLI o móvil. Úsala para proyectos complejos que requieren diferentes herramientas en varias etapas.

Ver habilidad

analyze-codebase-workflow

Acerca de

Instalación rápida

Claude Code

Documentación

Analyze Codebase Workflow

When Use

Inputs

Steps

Step 1: Survey Repository Structure

Step 2: Check Language Detection Coverage

Step 3: Run Auto-Detection

Step 4: Generate Initial Diagram

Step 5: Produce Annotation Plan

Checks

Pitfalls

See Also

Repositorio GitHub

Frequently asked questions

What is the analyze-codebase-workflow skill?

How do I install analyze-codebase-workflow?

What category does analyze-codebase-workflow belong to?

Is analyze-codebase-workflow free to use?

Habilidades relacionadas