返回技能列表

launch-runbook

rampstackco
更新于 2 days ago
4 次查看
239
27
239
在 GitHub 上查看
其他design

关于

This skill helps developers create and execute a structured launch plan for websites or products. It covers the entire go-live sequence, including pre-launch checks, DNS cutover, monitoring, and rollback procedures. Use it to coordinate launch day activities and build deployment checklists.

快速安装

Claude Code

推荐
主要方式
npx skills add rampstackco/claude-skills -a claude-code
插件命令备选方式
/plugin add https://github.com/rampstackco/claude-skills
Git 克隆备选方式
git clone https://github.com/rampstackco/claude-skills.git ~/.claude/skills/launch-runbook

在 Claude Code 中复制并粘贴此命令以安装该技能

技能文档

Launch Runbook

Plan and execute the launch of a website, product, or major release. The runbook is the document everyone uses on launch day. Stack-agnostic.

This skill is for the launch event. For pre-launch QA, use qa-testing. For post-launch incident handling, use incident-response.


When to use

  • Launching a new website or major redesign
  • Migrating from one platform to another
  • Releasing a major product or feature
  • Coordinating cross-team launches
  • Building a runbook for a recurring deploy

When NOT to use

  • Pre-launch testing (use qa-testing)
  • Post-launch incident response (use incident-response)
  • After-launch retrospective (use after-action-report)

Required inputs

  • The launch scope (what's being launched)
  • The launch window (date, time, duration)
  • The team (roles, on-call rotation)
  • The rollback criteria (when to abort)
  • The communication plan (who tells whom what, when)

The framework: 4 phases

A launch has four phases. The runbook covers all four.

Phase 1: Pre-launch (T-30 days to T-1 hour)

Verify everything is ready before the launch window.

T-30 days:

  • Final scope locked
  • Cross-team commitments confirmed
  • Pre-launch QA scheduled
  • Comms plan drafted

T-7 days:

  • Pre-launch QA complete
  • All critical and major issues resolved
  • Performance baseline measured
  • Rollback procedures documented and tested
  • DNS TTL lowered (if DNS change is part of launch)

T-1 day:

  • Final go/no-go meeting
  • Roles confirmed
  • Communication channels set up
  • Backup of current production state

T-1 hour:

  • Team assembled in shared communication channel
  • Tools and access verified
  • Final smoke test on staging

Phase 2: Cutover (T-0)

The actual launch. Sequenced steps with owners and verifications.

Standard cutover steps:

  1. Announce start to internal team
  2. Enable maintenance mode (if applicable)
  3. Run final database migrations (if applicable)
  4. Deploy code to production
  5. Verify deploy completed without errors
  6. Run smoke tests on production
  7. DNS cutover (if applicable)
  8. Verify DNS propagation
  9. Disable maintenance mode
  10. Run full smoke tests on production
  11. Announce launch to internal team
  12. Begin monitoring window

Each step has:

  • Owner
  • Pre-conditions
  • Action
  • Verification
  • Time estimate
  • Rollback procedure

Phase 3: Verification (T+0 to T+24 hours)

Confirm the launch is healthy.

Within first hour:

  • Critical user flows working (checkout, signup, login)
  • No spike in error rates
  • Performance within expected ranges
  • Analytics tracking firing
  • Email and notifications working

Within first 24 hours:

  • No regression in key business metrics
  • No accumulating error patterns
  • Core Web Vitals stable
  • Search Console showing no critical issues (if SEO-relevant)

Phase 4: Stabilization (T+24 hours to T+7 days)

Monitor the long tail.

  • Track error rates day over day
  • Track performance day over day
  • Track key business metrics vs baseline
  • Address any non-blocking issues identified
  • Plan the AAR (after-action report)

Roles and responsibilities

A launch has clear role assignments. Ambiguity here is the most common cause of launch chaos.

RoleResponsibility
Launch leadOwns the runbook. Calls go/no-go. Calls rollback.
Deploy operatorExecutes the technical deploy steps.
QA leadRuns verification tests and confirms each milestone.
Comms leadPosts internal updates, manages external messaging.
On-call engineerAvailable for issues during and after launch.
Stakeholder repApproves on behalf of business stakeholders.

For small teams, one person may fill multiple roles. Each role's responsibilities should still be explicit.


Rollback criteria

Define before the launch. Decisions are easier to make pre-emptively than under pressure.

Automatic rollback triggers:

  • Error rate exceeds X percent of normal
  • Critical user flow (defined) is broken
  • Database integrity issue
  • Security vulnerability discovered post-deploy

Discretionary rollback triggers:

  • Performance degradation beyond Y percent
  • Significant degradation in key business metric
  • Customer-facing error patterns

Decision authority: The launch lead calls rollback. Pre-define who acts as deputy if launch lead is unavailable.


Communication plan

Internal channels

  • Primary launch channel: Real-time chat for the launch team only
  • Status channel: Broader internal updates
  • War room: Optional video call for high-stakes launches

Update cadence during launch

  • Every 15 minutes during cutover
  • Every hour during verification phase
  • Daily during stabilization phase

External communication

  • Customer-facing announcement: Pre-drafted, scheduled to publish at confirmed-success milestone
  • Status page: Updated proactively if any user impact
  • Support team: Briefed in advance on what's launching, common questions, escalation path

Workflow

  1. Build the runbook 30 days out. Scope, sequence, roles, rollback criteria, comms plan.
  2. Test the rollback procedure. Untested rollback is hope, not procedure.
  3. Run a tabletop exercise. Walk through the runbook with the full team. Find gaps.
  4. Lower DNS TTL 48 to 72 hours before launch (if DNS change is part of launch).
  5. Day-of: Run the runbook step by step. Verify each step before moving to next.
  6. Monitor. First hour, first day, first week. Document anything noteworthy.
  7. Schedule the AAR within 1 to 2 weeks of launch.

Failure patterns

  • Runbook written by one person, not reviewed. Single perspective misses scenarios.
  • No tested rollback. Discovering rollback is broken at the moment you need it.
  • Vague step descriptions. "Deploy to production" without specifying which tool, which command, which environment.
  • No verification step after each action. Errors propagate.
  • Communication gaps. Team doesn't know launch is happening, or doesn't know it succeeded.
  • Launching at end of day Friday. Or before a holiday. Reduce the time available to respond.
  • Skipping pre-launch QA to hit a date. The bugs appear on launch day instead.
  • Launch fatigue. Long launches without breaks lead to errors. Plan rest cycles for multi-day launches.
  • No on-call for first 24 hours. Someone must be reachable.

Output format

Default output: a markdown runbook at launch-runbook-[project].md plus supporting checklists.

Structure:

  1. Launch metadata (what, when, who)
  2. Roles and responsibilities
  3. Pre-launch checklist (T-30, T-7, T-1, T-1hr)
  4. Cutover sequence (numbered steps, owners, verifications)
  5. Rollback procedure
  6. Rollback criteria (automatic and discretionary)
  7. Communication plan
  8. Verification checklist (first hour, first day)
  9. Stabilization plan (first week)
  10. Contacts (escalation paths, on-call)

Reference files

GitHub 仓库

rampstackco/claude-skills
路径: skills/launch-runbook
0
agent-skillsai-agentsanthropicclaudeclaude-aiclaude-code

相关推荐技能

media-asset-management

其他

该Skill用于规划和实施图像、视频及可下载资产的媒体处理管线。它帮助开发者设计存储与交付方案,选择现代格式(如WebP/AVIF)、设置响应式图像、选择视频托管服务,并优化缓慢或分散的资产工作流。适用于从构建品牌资产库到审计图像管道性能等多种媒体管理场景。

查看技能

monitoring-and-alerting

其他

该Skill帮助开发者设计和运行网站或应用的监控告警系统。它适用于设置可用性检查、定义SLO、配置错误跟踪、设计告警策略和值班轮换等场景。关键能力包括指导如何选择监控指标、避免告警疲劳,并在发生事件时识别监控缺口。

查看技能

after-action-report

其他

该Skill用于对已完成的发布、事件或项目进行结构化复盘,生成包含时间线、根因分析和可执行经验教训的总结报告。它能在用户提及复盘、回顾或经验总结等关键词时自动触发,特别适合在项目上线或事件解决后快速沉淀知识。其核心价值在于通过系统化分析产出可行动的改进建议,而非简单记录。

查看技能

security-baseline

其他

该Skill为网站和Web应用建立安全基线,适用于上线前安全审查、定期审计或新环境配置。它能指导HTTPS/TLS配置、安全头设置、密钥管理、CSP策略评估等基础安全加固工作。通过触发关键词(如OWASP、漏洞扫描)提供栈无关的标准化安全实践,帮助开发者满足合规要求并筑牢安全防线。

查看技能