scale-colony
关于
This skill provides strategies for scaling distributed systems and teams by applying colony-based patterns like budding and role differentiation. It helps recognize growth phases, implement fission protocols, and detect scaling limits when communication overhead or coordination failures increase. Use it proactively when systems that worked at a small scale begin to break down as they grow.
快速安装
Claude Code
推荐npx skills add pjt222/agent-almanac -a claude-code/plugin add https://github.com/pjt222/agent-almanacgit clone https://github.com/pjt222/agent-almanac.git ~/.claude/skills/scale-colony在 Claude Code 中复制并粘贴此命令以安装该技能
技能文档
Kolonie skalieren
Scale distributed systems, teams, or organizations durch colony budding (splitting), role differentiation (age polyethism), and growth-triggered architectural transitions — maintaining coordination quality as the colony grows beyond its initial design capacity.
Wann verwenden
- A team or system that worked at 10 agents is breaking down at 50
- Communication overhead is growing faster than productive output
- Coordination patterns that were implicit need to become explicit
- Planning a growth phase and wanting to scale proactively anstatt reactively
- Observing coordination failures that correlate with size (lost messages, duplicated work, unclear ownership)
- The existing system needs to split into semi-autonomous sub-colonies
Eingaben
- Erforderlich: Current colony size and target growth (or growth rate)
- Erforderlich: Current coordination mechanisms and their stress points
- Optional: Colony structure (flat, hierarchical, clustered)
- Optional: Role differentiation already in place
- Optional: Growth timeline and constraints
- Optional: Inter-colony coordination needs (if splitting)
Vorgehensweise
Schritt 1: Recognize the Growth Phase
Identifizieren which scaling phase the colony is in to apply appropriate strategies.
- Classify the current growth phase:
Colony Growth Phases:
┌───────────┬──────────────┬───────────────────────────────────────────┐
│ Phase │ Size Range │ Characteristics │
├───────────┼──────────────┼───────────────────────────────────────────┤
│ Founding │ 1-7 agents │ Everyone does everything, direct comms, │
│ │ │ implicit coordination, high agility │
├───────────┼──────────────┼───────────────────────────────────────────┤
│ Growth │ 8-30 agents │ Roles emerge, some specialization, comms │
│ │ │ overhead increases, need for structure │
├───────────┼──────────────┼───────────────────────────────────────────┤
│ Maturity │ 30-100 agents│ Formal roles, layered coordination, │
│ │ │ sub-groups form, inter-group coordination │
├───────────┼──────────────┼───────────────────────────────────────────┤
│ Fission │ 100+ agents │ Colony too large for single coordination │
│ │ │ framework, must bud into sub-colonies │
└───────────┴──────────────┴───────────────────────────────────────────┘
- Identifizieren growth stress signals:
- Communication overload: messages per agent per day increasing faster than colony size
- Decision latency: time from proposal to decision increasing
- Coordination failures: duplicated work, dropped tasks, conflicting actions increasing
- Knowledge dilution: new agents take longer to become productive
- Identity loss: agents can't describe the colony's purpose consistently
- Bestimmen if the colony is about to cross a phase boundary or has already crossed it
Erwartet: Clear identification of the current growth phase and the specific stress signals indicating the colony is approaching or has crossed a phase boundary.
Bei Fehler: If the phase isn't clear, measure three concrete metrics: communication volume per agent, decision latency, and coordination failure rate. Plot them over time. The inflection points reveal phase transitions. If metrics aren't available, the colony is likely in the Founding phase (where metrics aren't yet needed).
Schritt 2: Implementieren Role Differentiation (Age Polyethism)
Introduce progressive specialization where agents take on different roles basierend auf experience and colony needs.
- Definieren the role progression path:
- Newcomers: observation, learning, simple tasks (low autonomy, high guidance)
- Workers: standard task execution, signal following (moderate autonomy)
- Specialists: domain expertise, complex tasks, mentoring newcomers (high autonomy)
- Foragers/Scouts: exploration, innovation, external interface (see
forage-resources) - Coordinators: inter-group communication, conflict resolution, quorum management
- Implementieren role transitions:
- Transitions are triggered by experience thresholds, not appointment
- An agent that has completed a threshold number of tasks erfolgreich transitions to the next role (calibrate the threshold basierend auf task complexity and colony growth rate — e.g., 5-10 tasks for simple roles, 20-30 for specialist roles)
- Reverse transitions are possible (specialist returns to worker role in a new domain)
- The colony's role distribution adapts to current needs:
- Growing colony → more newcomer slots, active mentoring
- Stable colony → balanced distribution across all roles
- Threatened colony → more defenders, fewer scouts (see
defend-colony)
- Preserve role flexibility:
- No agent is permanently locked into a role
- Emergency protocols can temporarily reassign any agent to any role
- Cross-training ensures agents can cover adjacent roles
Erwartet: A role structure where agents naturally progress from simple to complex responsibilities, with the colony's role distribution reflecting its current needs and phase.
Bei Fehler: If role differentiation creates rigid silos, increase cross-training requirements and rotation frequency. If newcomers struggle to progress, the mentoring system is insufficient — pair each newcomer with a specialist for their first N tasks. If too many agents cluster in one role, the transition triggers are miscalibrated — adjust thresholds basierend auf colony-wide role demand.
Schritt 3: Restructure Coordination for Scale
Adapt the coordination mechanisms from coordinate-swarm to handle increased colony size.
- Replace direct communication with layered signaling:
- Founding phase: everyone talks to everyone (N×N communication)
- Growth phase: cluster into squads of 5-8; direct communication innerhalb squads, signal-based zwischen squads
- Maturity phase: squads form departments; intra-squad direct, inter-squad signal, inter-department broadcast
- Implementieren coordination layers:
- Local coordination: innerhalb a squad, direct signal exchange (stigmergy)
- Regional coordination: zwischen squads in the same department, aggregated signals
- Colony coordination: zwischen departments, broadcast signals only for colony-wide decisions
- Entwerfen inter-layer interfaces:
- Each squad has one designated communicator who aggregates and relays signals
- Communicators filter noise: not every local signal gets relayed upward
- Colony-wide broadcasts are rare and reserved for quorum decisions, alarm escalation, or major state changes
- Communication overhead budget:
- Target: each agent spends <20% of capacity on coordination
- Messen actual overhead; if it exceeds the budget, add another coordination layer or split the oversized squad
Erwartet: A layered coordination structure where communication overhead grows logarithmically (not linearly) with colony size. Local coordination is fast and direct; colony-wide coordination is slower but still functional.
Bei Fehler: If coordination layers create information bottlenecks (communicators become overloaded), add redundant communicators or reduce the relay frequency. If layers create isolation (squads don't know what other squads are doing), increase the inter-layer signal frequency or create cross-squad liaison roles.
Schritt 4: Ausfuehren Colony Budding (Fission)
Aufteilen the colony into semi-autonomous sub-colonies when it exceeds single-coordination capacity.
- Recognize fission triggers:
- Colony exceeds 100 agents (or the coordination layer count exceeds 3)
- Communication overhead exceeds 30% of agent capacity despite layering
- Decision latency exceeds acceptable thresholds for time-sensitive operations
- Subgroups have developed distinct identities and can operate independently
- Planen the fission:
- Identifizieren natural split lines (existing clusters, domain boundaries, geographic separation)
- Sicherstellen each daughter colony has a viable role distribution (can't split all specialists into one colony)
- Each daughter colony must have: mindestens one coordinator, sufficient workers, and access to shared resources
- Definieren the inter-colony interface: what information is shared, what is independent
- Ausfuehren the split:
- Announce the fission plan and timeline (consensus required — see
build-consensus) - Uebertragen agents to daughter colonies basierend auf existing cluster membership
- Establish inter-colony communication channels (lightweight, asynchronous)
- Each daughter colony bootstraps its own local coordination (inheriting patterns from the parent)
- Announce the fission plan and timeline (consensus required — see
- Post-fission stabilization:
- Ueberwachen each daughter colony for viability (can it sustain itself?)
- Inter-colony coordination sollte minimal (quarterly sync, not daily)
- If a daughter colony fails, reabsorb it into the nearest viable colony
Erwartet: Two or more viable daughter colonies, each operating semi-autonomously with their own coordination, connected by lightweight inter-colony interfaces.
Bei Fehler: If daughter colonies are too small to be viable, the fission was premature — remerge and try again at a larger size. If inter-colony coordination becomes as heavy as pre-fission single-colony coordination, the split lines were wrong — the colonies are too interdependent. Re-draw boundaries along natural independence lines.
Schritt 5: Ueberwachen Scaling Limits and Adapt
Continuously assess whether the current structure matches the colony's size and needs.
- Verfolgen scaling health metrics:
- Coordination overhead ratio: time spent coordinating / time spent producing
- Decision durchput: decisions per time unit (should increase or hold steady with growth)
- Agent satisfaction: engagement, retention, sense of purpose (drops when scaling fails)
- Error rate: coordination failures per time unit (should not increase linearly with growth)
- Identifizieren scaling limit indicators:
- Overhead ratio exceeding 25% → need more automation or another coordination layer
- Decision durchput declining → governance structure needs revision
- Agent turnover spiking → cultural or structural issues from scaling
- Error rate accelerating → coordination mechanisms are failing
- Ausloesen adaptation:
- Phase transition detected → apply the appropriate phase strategy from Step 1
- Scaling limit reached → escalate to the next structural intervention (role differentiation → coordination restructure → fission)
- External change (market shift, tech disruption) → may require colony transformation (see
adapt-architecture)
Erwartet: A colony that monitors its own scaling health and proactively adapts its structure vor scaling stress becomes scaling failure.
Bei Fehler: If scaling health metrics sind nicht available, the colony lacks observability — build measurement vor building more structure. If metrics show problems but the colony can't adapt, the resistance is cultural, not technical — address the human factors (fear of change, ownership attachment, trust deficits) vor restructuring.
Validierung
- Current growth phase is identified with specific stress signals
- Role differentiation is defined with progressive specialization
- Coordination is layered appropriately for colony size
- Communication overhead stays unter 20-25% of agent capacity
- Fission plan exists for when the colony exceeds single-coordination capacity
- Scaling health metrics are tracked and thresholds trigger adaptation
- Each daughter colony (post-fission) has viable role distribution
Haeufige Stolperfallen
- Scaling structure vor needed: Premature layering adds overhead ohne benefit. A 10-person team doesn't need department coordinators. Let stress signals guide structural changes
- Preserving founding culture at all costs: What worked at 5 agents won't work at 50. Scaling requires structural evolution; nostalgia for the founding phase prevents necessary adaptation
- Fission ohne independence: Splitting a colony into sub-colonies that still depend on each other for daily operations creates the worst of both worlds — overhead of coordination plus overhead of separation
- Uniform role distribution: Not every sub-colony needs the same role ratios. A research colony needs more scouts; a production colony needs more workers. Adapt role distribution to mission
- Ignoring remerge as an option: Sometimes fission fails and the best move is to remerge. Treating fission as irreversible prevents recovery from bad splits
Verwandte Skills
coordinate-swarm— foundational coordination patterns that this skill scalesforage-resources— foraging scales differently than production; role differentiation affects scout allocationbuild-consensus— consensus mechanisms must adapt for larger groupsdefend-colony— defense must scale with the colonyadapt-architecture— morphic skill for structural transformation, triggered by growth pressureplan-capacity— capacity planning for growth projectionsconduct-retrospective— retrospectives help identify scaling stress vor it becomes failure
GitHub 仓库
相关推荐技能
railway-docs
文档Railway Docs Skill可实时获取最新的Railway官方文档,确保回答的准确性。当开发者询问Railway功能特性、工作原理或分享docs.railway.com链接时,应优先使用此技能。它通过专门的LLM优化文档源提供最新信息,避免依赖过时记忆来回答技术问题。
n8n-code-python
文档该Skill为在n8n平台的Python代码节点中编写代码提供专家指导,特别适用于需要使用_input/_json/_node语法、Python标准库或了解n8n中Python限制的场景。它强调JavaScript应作为首选方案,仅当需要特定Python功能或对Python语法更熟悉时才使用Python。Skill提供了快速入门模板和关键注意事项,帮助开发者在n8n中高效编写Python代码。
archon
文档Archon Skill为开发者提供了基于RAG的语义搜索和项目任务管理功能,可通过REST API访问知识库。它支持文档搜索、网站爬取、文件上传和版本控制,适用于技术文档查询和项目管理场景。首次使用时需要配置Archon主机地址,建议在处理外部文档时优先使用该Skill。
n8n-code-javascript
文档这个Skill为n8n工作流中的JavaScript代码节点提供专业指导,涵盖数据处理、HTTP请求和日期操作等核心场景。它详细解释了如何正确使用n8n特有的`$input`/`$json`语法、`$helpers`工具以及DateTime对象,并包含关键的错误排查和模式选择建议。开发者通过该Skill能快速掌握Code节点的正确返回格式、数据访问方法和常见陷阱解决方案。
