SKILL·D31069

pre-mortem

Name: pre-mortem
Author: avelikiy

avelikiy

Updated 1 month ago

9 views

Metaaidesign

About

The pre-mortem skill helps developers identify concrete risks before implementation by imagining a project has already failed catastrophically and working backwards to find likely causes. It's used at planning stages—like architecture review or threat modeling—for irreversible or high-impact features. This method forces specific risk identification beyond generic lists, using a structured 5-step process.

Quick Install

Claude Code

Recommended

Primary

npx skills add avelikiy/great_cto -a claude-code

Plugin CommandAlternative

/plugin add https://github.com/avelikiy/great_cto

Git CloneAlternative

git clone https://github.com/avelikiy/great_cto.git ~/.claude/skills/pre-mortem

Copy and paste this command in Claude Code to install this skill

Documentation

Pre-mortem — fail-it-before-you-build-it

A retrospective for a project that hasn't happened yet. Surfaces real risks that "list every risk" prompts miss.

Originated in Gary Klein's research at MIT Sloan, now standard at AWS and other ops-mature orgs.

The 5-step pre-mortem

Step 1. Imagine you're 6 months in the future

The project shipped. It is a clear, public failure. There's a Reddit thread about it. The CEO is asking what went wrong.

Step 2. Write the post-mortem newspaper headline

One sentence. Concrete. Specific. Examples:

❌ Bad: "We had some quality issues."
✅ Good: "On 2026-09-12, the Stripe webhook handler deduplicated by raw body hash, so 30K customers were double-charged after Stripe retried delivery during a network blip."

The headline forces you to name the failure mode SPECIFICALLY.

Step 3. List every individual reason this exact failure happened

Brainstorm 10-15 reasons. Be specific. Each item should reference:

A real component / file
A real failure mode (race condition, schema mismatch, expired credential)
A real human factor (oncall didn't see alert, runbook was outdated)

Reject hand-waves like "testing was insufficient." Replace with "we didn't write a property-based test for the dedup-key collision case."

Step 4. Rank by likelihood × severity

For each cause, score:

Likelihood: 1-5 (1=once-in-a-decade, 5=monthly)
Severity: 1-5 (1=cosmetic, 5=data loss / regulatory breach)
Risk score: likelihood × severity

Top 3 by risk score → these are your highest-priority mitigations.

Step 5. For each top-3 cause, write a guardrail in the plan

Each guardrail is a concrete change to the plan:

A test that would have caught it
A circuit breaker / feature flag
A runbook entry
A monitoring alert with specific SLO

If a top-3 cause CANNOT be mitigated within the time/budget, escalate to the user: "This plan accepts the risk of X with no mitigation."

Template — add to PLAN-*.md

## Pre-mortem

Six months from now, this project failed. Headline:

> <one-sentence failure headline>

### Top reasons (likelihood × severity)

| Cause | L | S | Risk | Mitigation in plan |
|---|---|---|---|---|
| <specific cause> | 4 | 5 | 20 | <Task #N: write idempotency test> |
| ... | | | | |

### Accepted risks (no mitigation)

- <risk> — accepted because <budget/scope reason>. Owner: <name>.

Common failure modes by archetype

Quick start — most-common pre-mortem causes per archetype:

Archetype	Common failure
fintech / commerce	Idempotency-key collision; double-charge during retry storm
healthcare	PHI leak via debug log; BAA not signed with vendor
web3	Oracle staleness; flash-loan exploit on bonding curve
mlops	Training/serving skew; model drift undetected
iot-embedded	OTA bricks devices in a region with no recovery path
data-platform	Late-arriving data overwrites correct values
ai-system / agent-product	Prompt injection exfiltrates other users' data
enterprise-saas	Cross-tenant data leak via RLS gap
cli-tool	Destructive flag with no confirmation (rm -rf equivalent)
library	Breaking change in minor version bump

Anti-patterns in pre-mortems

❌ Vague risks. "Performance might be a problem." Be specific: which operation, at what load, what's the SLO.

❌ Cosmic risks. "AWS could go down." Yes, but that's not actionable. Focus on what you can mitigate.

❌ Defensive list. Listing risks you've already mitigated to look thorough. Only list risks the current plan does NOT yet address.

❌ Skip the headline. Without the headline, the team won't believe the failure scenario is real.

When to skip

nano project_size — pre-mortem is overhead.
Pure refactor with full test coverage — guardrails already exist.
Bug-fix with one-line repro — risk is well-bounded.

GitHub Repository

avelikiy/great_cto

Path: skills/pre-mortem

agentic-codingclaude-code-pluginclaude-code-skillsclaude-code-subagentscode-reviewcto

FAQ

Frequently asked questions

What is the pre-mortem skill?

pre-mortem is a Claude Skill by avelikiy. Skills package instructions and resources that Claude loads on demand, so Claude can perform pre-mortem-related tasks without extra prompting.

How do I install pre-mortem?

Use the install commands on this page: add pre-mortem to Claude Code as a plugin, or clone its repository into your skills directory, then restart Claude so it picks up the skill.

What category does pre-mortem belong to?

pre-mortem is in the Meta category, tagged ai and design.

Is pre-mortem free to use?

Yes. pre-mortem is listed on AIMCP and free to install. It runs inside Claude, so no separate service account is required to use the skill itself.

Related Skills

content-collections

Meta

This skill provides a production-tested setup for Content Collections, a TypeScript-first tool that transforms Markdown/MDX files into type-safe data collections with Zod validation. Use it when building blogs, documentation sites, or content-heavy Vite + React applications to ensure type safety and automatic content validation. It covers everything from Vite plugin configuration and MDX compilation to deployment optimization and schema validation.

View skill

polymarket

Meta

This skill enables developers to build applications with the Polymarket prediction markets platform, including API integration for trading and market data. It also provides real-time data streaming via WebSocket to monitor live trades and market activity. Use it for implementing trading strategies or creating tools that process live market updates.

View skill

creating-opencode-plugins

Meta

This skill helps developers create OpenCode plugins that hook into 25+ event types like commands, files, and LSP operations. It provides the plugin structure, event API specifications, and implementation patterns for JavaScript/TypeScript modules. Use it when you need to intercept, monitor, or extend the OpenCode AI assistant's lifecycle with custom event-driven logic.

View skill

sglang

Meta

SGLang is a high-performance LLM serving framework that specializes in fast, structured generation for JSON, regex, and agentic workflows using its RadixAttention prefix caching. It delivers significantly faster inference, especially for tasks with repeated prefixes, making it ideal for complex, structured outputs and multi-turn conversations. Choose SGLang over alternatives like vLLM when you need constrained decoding or are building applications with extensive prefix sharing.

View skill