backprop

JuliusBrussee

Actualizado 5 days ago

1,004

Pruebasaitesting

Acerca de

La habilidad `backprop` convierte errores en mejoras de especificaciones al analizar fallos y agregar nuevos invariantes (§V) para prevenir su recurrencia. Se activa ante fallos en pruebas, reportes de errores o incidentes, asegurando que la especificación se actualice junto con la corrección del código. Esto constituye un diferenciador fundamental respecto a simplemente corregir código, ya que fortalece sistemáticamente la especificación para detectar categorías completas de errores futuros.

Instalación rápida

Claude Code

Recomendado

Principal

npx skills add JuliusBrussee/blueprint -a claude-code

Comando PluginAlternativo

/plugin add https://github.com/JuliusBrussee/blueprint

Git CloneAlternativo

git clone https://github.com/JuliusBrussee/blueprint.git ~/.claude/skills/backprop

Copia y pega este comando en Claude Code para instalar esta habilidad

Documentación

backprop — bug → spec

Plan-then-execute fixes the code & forgets. SDD fixes the code AND edits spec so recurrence is impossible. That edit is backprop.

WHEN TO BACKPROP

Test failed at /build verification.
User reports bug.
Post-mortem after production incident.
/check flags VIOLATE with root cause found.

SIX STEPS

1. TRACE

Read failure output / bug report. Find exact file:line of wrong behavior. Name root cause in one caveman sentence.

2. ANALYZE

Ask three questions:

Would a new §V invariant catch this class of bug? (most common: yes)
Is §I wrong — did spec claim shape the code cannot deliver? (sometimes)
Is §T wrong — did we build the wrong thing? (rare but real)

3. PROPOSE

Draft the spec change. Never skip §B; §V/§I/§T are case-by-case.

Template:

§B row: B<next>|<date>|<root cause>|V<N>
§V line: V<next>: <testable rule that would have caught it>

Example:

§B row: B3|2026-04-20|refund job ran twice on retry|V7
§V line: V7: ∀ refund → idempotency key check before charge reversal

4. GENERATE TEST

New invariant without test = lie. Add failing test first. Name test so it cites the invariant: TestV7_RefundIdempotent.

5. VERIFY

Fix code. Run test. Must pass. Run full suite. Must not regress.

6. LOG

Commit spec edit + test + code fix together. Commit msg: backprop §B.<n> + §V.<N>: <one-line cause>.

WHAT MAKES A GOOD INVARIANT

Testable in code (grep-able or assert-able).
Scoped to a behavior, not a file.
Stated positively when possible (! hold over ⊥ forbid).
References §I surface where it applies.

Bad: V8: code should be correct. Good: V8: ∀ pg_query ! params interpolated via driver, ⊥ string concat.

WHEN NOT TO ADD §V

Bug was purely mechanical typo with no class (i++ vs i-- in throwaway).
Fix is a one-time migration.
Root cause is external dep (upgrade deps instead, note in §C).

Still append §B entry — record that this failure mode was considered. Future bug with same smell → §B search shows precedent.

OUTPUT SHAPE

Every backprop run produces:

§B entry (always).
§V entry (usually).
Test file (when §V added).
Code fix.
One commit.

No dashboards. No log files. SPEC.md + git is the full history.

Repositorio GitHub

JuliusBrussee/blueprint

Ruta: skills/backprop

claude-codeparallel-agentsskillsspec-driven-developmenttest-driven-development

Habilidades relacionadas

evaluating-llms-harness

Pruebas

Esta Skill de Claude ejecuta el benchmark lm-evaluation-harness para evaluar modelos de lenguaje en más de 60 tareas académicas estandarizadas como MMLU y GSM8K. Está diseñada para que los desarrolladores comparen la calidad de los modelos, realicen seguimiento del progreso del entrenamiento o reporten resultados académicos. La herramienta admite varios backends, incluidos modelos de HuggingFace y vLLM.

Ver habilidad

cloudflare-cron-triggers

Pruebas

Esta habilidad proporciona conocimiento integral para implementar Cron Triggers de Cloudflare y programar Workers mediante expresiones cron. Cubre la configuración de tareas periódicas, trabajos de mantenimiento y flujos de trabajo automatizados, manejando problemas comunes como expresiones cron inválidas y inconvenientes de zonas horarias. Los desarrolladores pueden utilizarla para configurar manejadores programados, probar activadores cron e integrar con Workflows y Green Compute.

Ver habilidad

webapp-testing

Pruebas

Esta habilidad de Claude proporciona un kit de herramientas basado en Playwright para probar aplicaciones web locales mediante scripts de Python. Permite verificación de frontend, depuración de interfaz de usuario, captura de pantallas y visualización de registros, mientras gestiona los ciclos de vida del servidor. Úsela para tareas de automatización de navegadores, pero ejecute los scripts directamente en lugar de leer su código fuente para evitar contaminación del contexto.

Ver habilidad

finishing-a-development-branch

Pruebas

Esta habilidad ayuda a los desarrolladores a completar el trabajo terminado verificando que las pruebas pasen y luego presentando opciones estructuradas de integración. Guía el flujo de trabajo para fusionar, crear PRs o limpiar ramas después de que se completa la implementación. Úsala cuando tu código esté listo y probado para finalizar sistemáticamente el proceso de desarrollo.

Ver habilidad