scale-colony
정보
이 스킬은 출아 및 역할 분화와 같은 기법을 사용하여 생물 군집을 모델로 분산 시스템과 팀의 확장 전략을 제공합니다. 이는 성장 단계를 인식하고, 분리 프로토콜을 구현하며, 커뮤니케이션 오버헤드가 생산성을 압도할 때 확장 한계를 감지하는 데 도움을 줍니다. 10개의 에이전트에서는 작동하던 시스템이 50개에서 고장 나거나, 작업 중복과 같은 조정 실패에 직면했을 때 사전에 사용하세요.
빠른 설치
Claude Code
추천npx skills add pjt222/agent-almanac -a claude-code/plugin add https://github.com/pjt222/agent-almanacgit clone https://github.com/pjt222/agent-almanac.git ~/.claude/skills/scale-colonyClaude Code에서 이 명령을 복사하여 붙여넣어 스킬을 설치하세요
문서
Scale Colony
Scale distributed systems, teams, orgs through colony budding (split), role differentiation (age polyethism), and growth-triggered architectural transitions — keep coordination quality as colony grows past initial design.
When Use
- Team or system that worked at 10 agents breaks down at 50
- Comms overhead grows faster than productive output
- Coordination patterns implicit must become explicit
- Planning growth phase, want scale proactive not reactive
- Coordination failures correlate with size (lost messages, duplicated work, unclear ownership)
- Existing system needs split into semi-autonomous sub-colonies
Inputs
- Required: Current colony size + target growth (or growth rate)
- Required: Current coordination mechanisms + stress points
- Optional: Colony structure (flat, hierarchical, clustered)
- Optional: Role differentiation already in place
- Optional: Growth timeline + constraints
- Optional: Inter-colony coordination needs (if splitting)
Steps
Step 1: Recognize Growth Phase
Identify which scaling phase colony is in to apply right strategies.
- Classify current growth phase.
Colony Growth Phases:
┌───────────┬──────────────┬───────────────────────────────────────────┐
│ Phase │ Size Range │ Characteristics │
├───────────┼──────────────┼───────────────────────────────────────────┤
│ Founding │ 1-7 agents │ Everyone does everything, direct comms, │
│ │ │ implicit coordination, high agility │
├───────────┼──────────────┼───────────────────────────────────────────┤
│ Growth │ 8-30 agents │ Roles emerge, some specialization, comms │
│ │ │ overhead increases, need for structure │
├───────────┼──────────────┼───────────────────────────────────────────┤
│ Maturity │ 30-100 agents│ Formal roles, layered coordination, │
│ │ │ sub-groups form, inter-group coordination │
├───────────┼──────────────┼───────────────────────────────────────────┤
│ Fission │ 100+ agents │ Colony too large for single coordination │
│ │ │ framework, must bud into sub-colonies │
└───────────┴──────────────┴───────────────────────────────────────────┘
- Identify growth stress signals.
- Communication overload: messages per agent per day growing faster than colony size
- Decision latency: time from proposal to decision growing
- Coordination failures: duplicated work, dropped tasks, conflicting actions growing
- Knowledge dilution: new agents take longer to become productive
- Identity loss: agents cannot describe colony's purpose consistent
- Determine if colony is about to cross phase boundary or has crossed it
Got: Clear identification of current growth phase + specific stress signals showing colony approaching or crossed phase boundary.
If fail: Phase not clear? Measure three concrete metrics: comm volume per agent, decision latency, coordination failure rate. Plot over time. Inflection points = phase transitions. No metrics? Colony likely Founding phase (where metrics not yet needed).
Step 2: Implement Role Differentiation (Age Polyethism)
Introduce progressive specialization where agents take different roles by experience + colony needs.
- Define role progression path.
- Newcomers: observation, learning, simple tasks (low autonomy, high guidance)
- Workers: standard task execution, signal following (moderate autonomy)
- Specialists: domain expertise, complex tasks, mentor newcomers (high autonomy)
- Foragers/Scouts: exploration, innovation, external interface (see
forage-resources) - Coordinators: inter-group communication, conflict resolution, quorum management
- Implement role transitions.
- Triggered by experience thresholds, not appointment
- Agent that completed threshold of tasks transitions to next role (calibrate by complexity, growth — 5-10 for simple, 20-30 for specialist)
- Reverse transitions possible (specialist → worker in new domain)
- Colony's role distribution adapts to needs.
- Growing colony → more newcomer slots, active mentoring
- Stable colony → balanced distribution across all roles
- Threatened colony → more defenders, fewer scouts (see
defend-colony)
- Preserve role flexibility.
- No agent permanently locked into role
- Emergency protocols can temp reassign any agent to any role
- Cross-training ensures agents can cover adjacent roles
Got: Role structure where agents progress from simple to complex responsibilities, colony's role distribution reflects current needs + phase.
If fail: Role differentiation creates rigid silos? Up cross-training + rotation frequency. Newcomers struggle to progress? Mentoring insufficient — pair each newcomer with specialist for first N tasks. Too many cluster in one role? Transition triggers miscalibrated — adjust by colony-wide role demand.
Step 3: Restructure Coordination for Scale
Adapt coordination from coordinate-swarm to handle bigger colony.
- Replace direct comms with layered signaling.
- Founding: everyone talks to everyone (N×N comms)
- Growth: cluster into squads of 5-8; direct comms within squads, signal-based between
- Maturity: squads form departments; intra-squad direct, inter-squad signal, inter-department broadcast
- Implement coordination layers.
- Local coordination: within squad, direct signal exchange (stigmergy)
- Regional coordination: between squads in same department, aggregated signals
- Colony coordination: between departments, broadcast only for colony-wide decisions
- Design inter-layer interfaces.
- Each squad has one designated communicator who aggregates + relays signals
- Communicators filter noise: not every local signal relayed up
- Colony-wide broadcasts rare, reserved for quorum decisions, alarm escalation, major state changes
- Communication overhead budget.
- Target: each agent spends <20% of capacity on coordination
- Measure actual overhead; if exceeds budget, add another coordination layer or split oversized squad
Got: Layered coordination structure where comms overhead grows logarithmic (not linear) with colony size. Local coordination fast + direct; colony-wide slower but functional.
If fail: Coordination layers create info bottlenecks (communicators overloaded)? Add redundant communicators or reduce relay frequency. Layers create isolation (squads do not know what others do)? Up inter-layer signal frequency or create cross-squad liaison roles.
Step 4: Execute Colony Budding (Fission)
Split colony into semi-autonomous sub-colonies when exceeds single-coordination capacity.
- Recognize fission triggers.
- Colony exceeds 100 agents (or coordination layer count exceeds 3)
- Comms overhead exceeds 30% of agent capacity despite layering
- Decision latency exceeds acceptable thresholds for time-sensitive ops
- Subgroups have distinct identities, can operate independently
- Plan fission.
- Identify natural split lines (existing clusters, domain boundaries, geographic separation)
- Each daughter colony has viable role distribution (cannot split all specialists into one)
- Each daughter colony must have: at least one coordinator, sufficient workers, access to shared resources
- Define inter-colony interface: what info shared, what independent
- Execute split.
- Announce fission plan + timeline (consensus required — see
build-consensus) - Transfer agents to daughter colonies based on existing cluster membership
- Establish inter-colony comm channels (lightweight, async)
- Each daughter colony bootstraps own local coordination (inheriting patterns from parent)
- Announce fission plan + timeline (consensus required — see
- Post-fission stabilization.
- Monitor each daughter for viability (can sustain itself?)
- Inter-colony coordination minimal (quarterly sync, not daily)
- If daughter fails, reabsorb into nearest viable colony
Got: Two or more viable daughter colonies, each operating semi-autonomous with own coordination, connected by lightweight inter-colony interfaces.
If fail: Daughter colonies too small to be viable? Fission was premature — remerge, try again at larger size. Inter-colony coordination as heavy as pre-fission single-colony? Split lines wrong — colonies too interdependent. Re-draw boundaries along natural independence lines.
Step 5: Monitor Scaling Limits and Adapt
Continuous assess if current structure matches colony's size + needs.
- Track scaling health metrics.
- Coordination overhead ratio: time coordinating / time producing
- Decision throughput: decisions per time unit (should grow or hold steady with growth)
- Agent satisfaction: engagement, retention, sense of purpose (drops when scaling fails)
- Error rate: coordination failures per time unit (should not grow linear with growth)
- Identify scaling limit indicators.
- Overhead ratio exceeding 25% → need more automation or another coordination layer
- Decision throughput declining → governance structure needs revision
- Agent turnover spiking → cultural or structural issues from scaling
- Error rate accelerating → coordination mechanisms failing
- Trigger adaptation.
- Phase transition detected → apply right phase strategy from Step 1
- Scaling limit reached → escalate to next structural intervention (role differentiation → coordination restructure → fission)
- External change (market shift, tech disruption) → may require colony transformation (see
adapt-architecture)
Got: Colony that monitors own scaling health, proactively adapts structure before scaling stress becomes scaling failure.
If fail: Scaling health metrics not available? Colony lacks observability — build measurement before more structure. Metrics show problems but colony cannot adapt? Resistance cultural not technical — address human factors (fear of change, ownership attachment, trust deficits) before restructuring.
Checks
- Current growth phase identified with specific stress signals
- Role differentiation defined with progressive specialization
- Coordination layered for colony size
- Comms overhead stays below 20-25% of agent capacity
- Fission plan exists for when colony exceeds single-coordination capacity
- Scaling health metrics tracked, thresholds trigger adaptation
- Each daughter colony (post-fission) has viable role distribution
Pitfalls
- Scale structure before needed: Premature layering adds overhead without benefit. 10-person team does not need department coordinators. Let stress signals guide structural changes
- Preserve founding culture at all costs: What worked at 5 agents will not work at 50. Scaling needs structural evolution; nostalgia for founding phase prevents necessary adaptation
- Fission without independence: Splitting colony into sub-colonies that still depend on each other for daily ops = worst of both worlds — overhead of coordination + overhead of separation
- Uniform role distribution: Not every sub-colony needs same role ratios. Research colony needs more scouts; production needs more workers. Adapt role distribution to mission
- Ignore remerge as option: Sometimes fission fails, best move is remerge. Treat fission as irreversible = prevents recovery from bad splits
See Also
coordinate-swarm— foundational coordination patterns this skill scalesforage-resources— foraging scales different than production; role differentiation affects scout allocationbuild-consensus— consensus mechanisms must adapt for larger groupsdefend-colony— defense must scale with colonyadapt-architecture— morphic skill for structural transformation, triggered by growth pressureplan-capacity— capacity planning for growth projectionsconduct-retrospective— retrospectives help identify scaling stress before failure
GitHub 저장소
연관 스킬
railway-docs
문서이 스킬은 Railway의 기능, 작동 방식 또는 특정 문서 URL에 대한 질문에 답하기 위해 최신 Railway 문서를 가져옵니다. 개발자들이 Railway의 공식 소스로부터 정확하고 최신 정보를 직접 받을 수 있도록 보장합니다. 사용자가 Railway의 작동 방식을 묻거나 Railway 문서를 참조할 때 사용하세요.
n8n-code-python
문서이 Claude Skill은 n8n의 Code 노드에서 Python 코드를 작성할 때 전문적인 지침을 제공하며, 특히 Python 표준 라이브러리 사용과 n8n의 특수 구문인 `_input`, `_json`, `_node` 작업에 중점을 둡니다. 이는 개발자가 n8n 내에서 Python의 제한 사항을 이해하도록 돕고, 대부분의 워크플로에는 JavaScript 사용을 권장하면서도 특정 데이터 변환 요구사항에 대한 Python 솔루션을 제안합니다.
archon
문서Archon 스킬은 REST API를 통해 RAG 기반 시맨틱 검색과 프로젝트 관리를 제공합니다. 이 스킬을 사용하여 문서 검색, 계층적 프로젝트/태스크 관리, 문서 업로드 기능을 갖춘 지식 검색을 수행할 수 있습니다. 외부 문서를 검색할 때는 다른 소스를 사용하기 전에 항상 Archon을 최우선으로 활용하세요.
n8n-code-javascript
문서이 Claude Skill은 n8n의 Code 노드에서 JavaScript 코드 작성에 대한 전문적인 지침을 제공합니다. `$input`/`$json` 변수, HTTP 헬퍼, DateTime 처리와 같은 필수적인 n8n 특정 구문을 다루며 일반적인 오류를 해결합니다. Code 노드에서 사용자 정의 JavaScript 처리가 필요한 n8n 워크플로우를 개발할 때 활용하세요.
