SKILL·D96347

observe-guidance

Name: observe-guidance
Author: pjt222

pjt222

Mis à jour 1 month ago

9 vues

Testsaidesign

À propos

Cette compétence guide les utilisateurs à travers une observation systématique pour comprendre les systèmes avant d'agir. Elle enseigne la collecte neutre de données, la reconnaissance de motifs et la production de rapports structurés pour le débogage ou la recherche. Utilisez-la lorsque les utilisateurs ont besoin de recueillir des preuves avant de former des conclusions ou lorsqu'ils préparent une analyse fondée sur des preuves.

Installation rapide

Claude Code

Recommandé

Principal

npx skills add pjt222/agent-almanac -a claude-code

Commande PluginAlternatif

/plugin add https://github.com/pjt222/agent-almanac

Git CloneAlternatif

git clone https://github.com/pjt222/agent-almanac.git ~/.claude/skills/observe-guidance

Copiez et collez cette commande dans Claude Code pour installer cette compétence

Documentation

Observe (Guidance)

Coach human in field study: frame → protocol → witness → record → analyze → report. Separate fact from interpretation.

Use When

Person wants understand system before intervene (debug by obs, not trial-error)
Conducting research / evidence → needs structured method
Person jumps to conclusions → needs obs discipline
Preparing evidence-based report (not opinion)
Team dynamics, user behavior, process effectiveness via direct obs
After meditate-guidance cultivated attention → direct it at system

In

Required: What to observe (system, process, behavior, codebase, team, phenomenon)
Required: Why (debug, research, audit, curiosity, improvement)
Optional: Time available (single vs multi-day)
Optional: Prior attempts
Optional: Specific Qs / hypotheses
Optional: Recording tools (notebook, screen capture, logging, metrics)

Do

Step 1: Frame

Help set bounded frame.

Ask what: "What system/behavior trying to understand?"
Narrow scope: "What specific aspect interests you most?"
Purpose: understanding / debug / improve / evidence / curiosity
Boundaries: in/out scope → prevents endless expansion
Hypothesis? state explicit, then set aside → "look for evidence both for + against"
Stance:
- Naturalist: no interfere (best for behavior)
- Controlled: change one var, observe effect (best for debug)
- Longitudinal: over time (best for trends)

→ Clear frame: target, scope, purpose, stance defined.

If err: can't narrow ("understand everything") → pick one entry point: "what behavior most confusing?" Already committed conclusion ("just prove X") → gently challenge: "what would disprove it?"

Step 2: Prep protocol

Systematic recording.

Method by type:
- Codebase/system: paths, line numbers, timestamps, log entries
- Behavior/process: time-stamped notes — actor, action, context
- Team/communication: quotes, speaker IDs, non-verbal cues
- Natural/physical: sketches, measurements, env conditions
Template:

Field Notes Template:
┌─────────────┬────────────────────────────────────────────────────────┐
│ Timestamp   │ When the observation occurred                          │
├─────────────┼────────────────────────────────────────────────────────┤
│ Observation │ What was seen/heard/measured (fact only)               │
├─────────────┼────────────────────────────────────────────────────────┤
│ Context     │ What was happening around the observation              │
├─────────────┼────────────────────────────────────────────────────────┤
│ Reaction    │ Observer's response (thoughts, emotions, surprises)    │
├─────────────┼────────────────────────────────────────────────────────┤
│ Hypothesis  │ Tentative interpretation (kept separate from fact)     │
└─────────────┴────────────────────────────────────────────────────────┘

Stress separation: "obs row = fact. hypothesis row = interpretation. Never mix."
Min count: "≥10 obs before any conclusion"
Set up monitoring tools if applicable

→ Recording method ready. Person gets obs↔interpretation distinction. Prepared.

If err: too formal → simplify: "write what you see, separately what you think it means." Resist recording ("I'll remember") → unrecorded = memory bias; writing makes obs accurate.

Step 3: Witness

Guide actual obs session.

Remind stance: "naturalist studying new species. No interfere — just watch"
First 5min: pure obs no recording — just attend
After immersion: begin recording w/ template
Coach neutral lang: instead "system crashed" → "system stopped responding 14:32 after 47th request"
Watch interpretation creeping: "that's interpretation — record in hypothesis row"
Note surprises: "what surprised? surprises = most valuable data"
Check frame: "still observing what set out, or drifted?"
Wants to intervene: "note what + why, but don't change yet — keep observing"

→ ≥5-10 concrete obs w/ specific evidence. Experiences obs vs interpret diff. Finds harder than expected.

If err: keep interpreting → exercise: "describe as if to someone never seen this. Only verifiable facts." Run out fast → too high level → zoom in: timing, ordering, edge cases, exceptions.

Step 4: Record

Organize raw → structured.

Review together
Completeness: enough context for later?
Factual accuracy: verifiable, or hidden assumptions?
Group similar: "patterns forming?"
Frequencies: how often?
Absences: "what expected but not there?"
Strong (clear evidence) vs weak (ambiguous)

→ Organized field notes cleanly separate obs from interpretation. Detailed enough another can verify.

If err: too vague ("things slow") → add specifics: "how slow? compared to what? which conditions?" Too detailed (record everything) → which relate to frame, which noise.

Step 5: Analyze

Obs → structured analysis.

Look for patterns:
- Repetition: "happened many times — systematic?"
- Correlation: "X always w/ Y — related?"
- Sequence: "A always before B — A causes B?"
- Absence: "X never in condition Z — why?"
- Anomaly: "all follow P except this — what diff?"
Each pattern: "alternative explanation?"
2-3 hypotheses
Correlation ≠ causation: "co-occur ≠ proves cause"
Testable + what test confirms/refutes
Confidence levels: well-supported vs speculative

→ Raw obs → structured hypotheses, data/theory separation kept. ≥1 testable hypothesis for original Q.

If err: jumps single explanation → challenge: "one possibility. another?" No patterns → too few obs → continue. Every obs same conclusion → filtering → ask: "what would contradict your theory?"

Step 6: Report

Communicate findings.

Structure:
- Context: what/when/why/conditions
- Method: protocol, tools, duration
- Findings: key obs w/ evidence (data, not interpretation)
- Analysis: patterns, hypotheses, confidence
- Recommendations: next steps (more obs, test, intervene)
- Limitations: not covered, potential biases
Findings in neutral lang separating fact from interpretation
Review for hidden assumptions / unsupported claims
Debug? translate hypotheses → concrete tests
Report? evidence cited specifically
Personal? summarize insights + remaining Qs

→ Clear report communicates obs/patterns/hypotheses, distinction kept. Reader can evaluate evidence independently.

If err: buries obs in interpretation → restructure: "facts one section, theories another." No confidence ("definitely because...") → calibrate: "how sure? what would change mind?"

Check

Frame set before obs (not wandering)
Recording protocol established + used consistently
Obs as facts, separate from interpretations
≥5 concrete evidence-backed obs
Patterns from analysis, not assumed
Hypotheses testable, stated confidence
Person experienced obs-before-interpret discipline

Traps

Confirmation bias: only obs supporting belief. Frame must include "look for evidence against your hypothesis"
Intervention urge: see + fix immediately → masks root cause → observe first
Recording fatigue: detail = taxing. Breaks + realistic lengths (30-60min focused = substantial)
Over-protocol: simple obs needs notebook+timestamps. Protocol serves obs, not replaces
Obs ≠ surveillance: ethical boundaries matter. Visible behavior, no spy. People → transparency > secrecy
Skip frame: no target → attention scatters → unfocused. Rough frame > none

→

observe — AI self-directed variant
learn-guidance — obs feeds learning
listen-guidance — focused obs of speaker; obs broader to any system
remote-viewing-guidance — shares method adapted for non-local
read-garden — garden obs uses similar CRV-adapted sensory protocols

Dépôt GitHub

pjt222/agent-almanac

Chemin: i18n/caveman-ultra/skills/observe-guidance

agentsagentskillsai-assisted-developmentclaude-codeskillsteams

FAQ

Frequently asked questions

What is the observe-guidance skill?

observe-guidance is a Claude Skill by pjt222. Skills package instructions and resources that Claude loads on demand, so Claude can perform observe-guidance-related tasks without extra prompting.

How do I install observe-guidance?

Use the install commands on this page: add observe-guidance to Claude Code as a plugin, or clone its repository into your skills directory, then restart Claude so it picks up the skill.

What category does observe-guidance belong to?

observe-guidance is in the Testing category, tagged ai and design.

Is observe-guidance free to use?

Yes. observe-guidance is listed on AIMCP and free to install. It runs inside Claude, so no separate service account is required to use the skill itself.

Compétences associées

evaluating-llms-harness

Tests

Cette compétence Claude exécute le lm-evaluation-harness pour évaluer les modèles de langage sur plus de 60 tâches académiques standardisées telles que MMLU et GSM8K. Elle est conçue pour permettre aux développeurs de comparer la qualité des modèles, de suivre les progrès de l'entraînement ou de rapporter des résultats académiques. L'outil prend en charge différents backends, incluant les modèles HuggingFace et vLLM.

Voir la compétence

cloudflare-cron-triggers

Tests

Cette compétence fournit une connaissance complète pour la mise en œuvre de Déclencheurs Cron Cloudflare afin de planifier des Workers à l'aide d'expressions cron. Elle couvre la configuration de tâches périodiques, de travaux de maintenance et de flux de travail automatisés, tout en traitant des problèmes courants tels que les expressions cron non valides et les problèmes de fuseau horaire. Les développeurs peuvent l'utiliser pour configurer des gestionnaires planifiés, tester des déclencheurs cron et intégrer avec Workflows et Green Compute.

Voir la compétence

webapp-testing

Tests

Cette Compétence Claude fournit une boîte à outils basée sur Playwright pour tester des applications web locales via des scripts Python. Elle permet la vérification frontend, le débogage d'interface utilisateur, la capture d'écrans et la consultation des journaux, tout en gérant les cycles de vie du serveur. Utilisez-la pour les tâches d'automatisation de navigateur, mais exécutez les scripts directement plutôt que de lire leur code source pour éviter la pollution du contexte.

Voir la compétence

finishing-a-development-branch

Tests

Cette compétence aide les développeurs à finaliser leur travail en vérifiant que les tests passent, puis en présentant des options d'intégration structurées. Elle guide le processus de fusion, de création de PRs ou de nettoyage des branches une fois l'implémentation terminée. Utilisez-la lorsque votre code est prêt et testé pour finaliser systématiquement le cycle de développement.

Voir la compétence