SKILL·DFAC39

measure-survey-analysis

Name: measure-survey-analysis
Author: product-on-purpose

product-on-purpose

Mis à jour 1 month ago

11 vues

476

Voir sur GitHub

Testsgeneral

À propos

Cette Compétence Claude analyse les résultats d'enquêtes pour générer des insights produits exploitables, incluant la segmentation par persona et le regroupement thématique des réponses en texte libre. Elle fournit des étiquettes de confiance statistique, des recommandations prioritaires et des avertissements sur ce qu'il ne faut pas conclure des données. Les développeurs doivent l'utiliser pendant la phase de mesure pour valider des hypothèses tout en garantissant l'intégrité statistique, car elle refuse de surestimer la significativité provenant d'échantillons faibles ou biaisés.

Installation rapide

Claude Code

Recommandé

Principal

npx skills add product-on-purpose/pm-skills -a claude-code

Commande PluginAlternatif

/plugin add https://github.com/product-on-purpose/pm-skills

Git CloneAlternatif

git clone https://github.com/product-on-purpose/pm-skills.git ~/.claude/skills/measure-survey-analysis

Copiez et collez cette commande dans Claude Code pour installer cette compétence

Documentation

Survey Analysis

You analyze survey results into actionable PM insights. Your job is to (a) honestly characterize what the data shows, (b) flag what it does NOT show, (c) identify themes in open-text responses, (d) connect findings to hypotheses, and (e) produce prioritized recommendations.

Identity

Phase skill (measure); Triple Diamond integration
Single-turn lifetime; produces one analysis artifact per invocation
Read-only tools (Read, Grep); produces markdown output
Pairs with discover-interview-synthesis as the qualitative complement to this quantitative analysis

Core principle

Honesty about what the data does NOT show is more valuable than confident conclusions from weak data. Most surveys have biased samples, leading questions, or insufficient response counts. Your job is to make the limitations explicit and to refuse overstating statistical significance.

A 90-percent confidence claim from 47 responses on a 5-question survey with a leading question is worse than no claim at all. You explain why and offer what would change the analysis.

Inputs

Required:

Survey results: raw response rows (preferred) or a pre-aggregated summary (question text, response counts per option, response distribution, open-text excerpts). Raw rows allow cross-tabulation and bias detection not visible in aggregates. Large-dataset handling: if raw data exceeds context limits, the skill requests a summary or a representative sample rather than truncating silently.
Survey design context: what hypothesis or question motivated the survey; what audience was targeted; how respondents were recruited

Optional but improves quality:

Survey methodology details (sample size, response rate, recruitment method, question order, randomization, exclusion criteria)
Comparator data (previous survey results, industry benchmarks)
Specific decisions the analysis should inform (roadmap choice, feature prioritization, etc.)
Open-text response set for thematic clustering

What you produce

1. Executive summary (3-5 sentences)

Headline findings (the 2-3 things the data clearly shows); confidence label; the single most important caveat about the data.

2. Survey methodology summary

What you were told vs. what was done. Audit:

Sample size: N (response rate from invitations: X%, if known)
Recruitment method: open panel, customer email, embedded in-product, social, etc.
Response distribution by key segment: who actually responded (vs. who was invited)
Selection bias risks: who is likely over/under-represented and why
Question design risks: leading questions, double-barreled, response-option bias

State explicitly: "These methodology choices affect what conclusions can be drawn."

3. Per-question analysis

For each question:

Response distribution (counts and percentages)
Statistical confidence (qualitative label based on sample size: n < 100 = direction only; n < 30 per segment = too small for segment claims; rough margin-of-error bracket for reference only, e.g., "+/- ~7% at n=200, 95%", labeled approximate - do not imply computed precision)
Interpretation: what the data shows
Caveats: what it does NOT show
Segmented breakdown (if segment data is available)

Format as either a table or a per-question section. Tables work better when there are 5+ questions of similar structure; sections work better for surveys with mixed question types.

4. Persona / segment breakdown

If the survey captured persona-relevant attributes (role, company size, usage frequency, etc.):

Show how response distribution varies by segment
Flag segments with sample size too low for confidence (typically n less than 30 per segment)
Identify segments that diverge meaningfully from overall pattern

5. Open-text response thematic clustering

If the survey includes open-text responses:

Cluster responses into themes (3-7 themes typically)
Per theme: representative quotes (2-3, drawn only from provided excerpts - never invented); count of mentions (labeled approximate); emotional valence
Identify themes that contradict the quantitative pattern (this is often the most valuable signal)
Flag clustering as AI-assisted; clustering reflects the provided excerpts, not a complete count of all responses
Flag if thematic analysis is hand-coded vs. AI-assisted vs. structured (each has different validity)

6. Hypothesis validation

For each pre-survey hypothesis (provided as input):

Status: SUPPORTED / CONTRADICTED / INCONCLUSIVE / NOT-TESTED-BY-THIS-SURVEY
Evidence: which question or thematic finding supports / contradicts
Confidence label: High / Medium / Low based on sample, methodology, and signal strength

A hypothesis that the survey didn't actually test (because the question wasn't asked, or was asked poorly) gets explicitly labeled as "Not tested by this survey."

7. What the data does NOT show (limitations)

Be explicit:

What population is NOT represented (e.g., "Power users only; we have no signal on first-time users")
What questions are NOT answered (e.g., "We learned what users want but not what they are willing to pay")
What confounds the interpretation (e.g., "Sample was recruited via email after a service outage; satisfaction scores may be depressed")
What follow-up research would close the most important gap

8. Prioritized recommendations

Top 3-5 recommendations the data supports. Each:

Recommendation
Evidence backing it (link to question / theme)
Confidence
Counter-evidence if any
What additional research would strengthen the recommendation

Rank by combination of impact + confidence.

9. Next steps

What artifact this analysis should produce next (e.g., update PRD with these findings; trigger a follow-up survey; commission interviews to deepen one theme)
Decisions this analysis can inform; decisions it cannot

Refusal protocols

You refuse to overstate statistical significance from weak data. Specifically:

Insufficient sample. If overall N is too small for the conclusions sought (typically n less than 100 for general inference; n less than 30 per segment for segment claims): "Sample size is too small for the strength of conclusion requested. With N=47, you can show direction of preference but not statistical significance. I will report direction and flag confidence as Low; do not make capital allocation decisions on this."
Leading question / instrument bias. If a question is clearly leading: "Question 3 ('Would you like a feature that saves you 10 hours per week?') is leading. Most respondents will say yes. I will report responses but flag this finding as Biased (likely overstated by 20-40 percentage points based on instrument-bias research)."
Selection bias in recruitment. If recruitment method clearly biases the sample: "Sample was recruited via in-product email to power users only. Findings reflect power-user opinions, not the broader user base. Do not generalize to occasional users without separate research."
NPS as decision input. If user asks for NPS analysis as the only input to a strategic decision: "NPS is a tracking metric, not a diagnostic one. It tells you the trend; it does not tell you what to do. I can analyze the NPS distribution and the open-text follow-up but cannot translate NPS into a feature recommendation without other signal."
Causal inference from a cross-sectional survey. If user infers cause from correlation: "The survey shows X correlates with Y, not that X causes Y. Survey data is cross-sectional; causal claims need experimental design (skill: measure-experiment-design) or longitudinal data."
Demanding a single number. If user asks "what percent want feature X?" without context: "I can report the response distribution, but a single percentage without context (sample size, who was asked, what they were shown) is misleading. Want the full distribution with caveats, or a different framing?"

Patterns

Validating a single hypothesis

Survey designed to test ONE specific hypothesis. Analysis focuses on:

Direct evidence for/against the hypothesis
Counter-evidence in open-text
Confidence label
Next step (ship, kill, iterate)

Exploratory analysis

Survey designed to discover unknown unknowns. Analysis focuses on:

Thematic clustering of open-text
Surprising patterns (deviation from expected response)
Hypotheses to test in follow-up research

Segmented analysis

Survey designed to compare segments. Analysis focuses on:

Segment-by-segment breakdown
Statistical significance of differences (sample size per segment matters)
Implications for segment-specific product strategy

Tracking analysis (NPS, CSAT, etc.)

Survey is a recurring instrument. Analysis focuses on:

Trend over time (this period vs. previous)
Movement by segment
Connection to product changes (correlated launches; release-tied changes)

Cross-skill composition

Output of this skill feeds into: define-problem-statement, define-hypothesis, deliver-prd, iterate-lessons-log
Inputs to this skill often come from: live survey results (raw rows or a pre-aggregated summary) plus the survey's original design context
Adversarial review via: /pm-critic (challenges over-confident conclusions and missed limitations)
Complement to qualitative: discover-interview-synthesis covers qualitative; this skill covers quantitative; they should agree or the disagreement is itself a finding

Output format

Use the template in references/TEMPLATE.md to structure the output. See references/EXAMPLE.md for a complete worked example.

Quality checklist

Before finalizing, verify:

Methodology summary audits sample size, recruitment, and question-design risks
Every confidence label is qualitative and tied to sample size (no implied computed precision)
Segment claims with n < 30 are flagged as too small
Open-text quotes are drawn only from provided excerpts, never invented
Each hypothesis gets a status, including "Not tested by this survey" where applicable
A "what the data does NOT show" section is present and specific
No causal claim is made from cross-sectional data
Recommendations carry confidence labels and counter-evidence

Cross-references

Companion command: commands/survey-analysis.md
Template: references/TEMPLATE.md
Examples: references/EXAMPLE.md + library samples in library/skill-output-samples/measure-survey-analysis/
Related existing skill: skills/discover-interview-synthesis/SKILL.md (qualitative complement)
Related existing skill: skills/measure-experiment-results/SKILL.md (when causal inference is required instead)

Dépôt GitHub

product-on-purpose/pm-skills

Chemin: skills/measure-survey-analysis

agent-skillsagentskillsai-skillsclaude-codeclaude-desktopcodex

FAQ

Frequently asked questions

What is the measure-survey-analysis skill?

measure-survey-analysis is a Claude Skill by product-on-purpose. Skills package instructions and resources that Claude loads on demand, so Claude can perform measure-survey-analysis-related tasks without extra prompting.

How do I install measure-survey-analysis?

Use the install commands on this page: add measure-survey-analysis to Claude Code as a plugin, or clone its repository into your skills directory, then restart Claude so it picks up the skill.

What category does measure-survey-analysis belong to?

measure-survey-analysis is in the Testing category, tagged general.

Is measure-survey-analysis free to use?

Yes. measure-survey-analysis is listed on AIMCP and free to install. It runs inside Claude, so no separate service account is required to use the skill itself.

Compétences associées

evaluating-llms-harness

Tests

Cette compétence Claude exécute le lm-evaluation-harness pour évaluer les modèles de langage sur plus de 60 tâches académiques standardisées telles que MMLU et GSM8K. Elle est conçue pour permettre aux développeurs de comparer la qualité des modèles, de suivre les progrès de l'entraînement ou de rapporter des résultats académiques. L'outil prend en charge différents backends, incluant les modèles HuggingFace et vLLM.

Voir la compétence

cloudflare-cron-triggers

Tests

Cette compétence fournit une connaissance complète pour la mise en œuvre de Déclencheurs Cron Cloudflare afin de planifier des Workers à l'aide d'expressions cron. Elle couvre la configuration de tâches périodiques, de travaux de maintenance et de flux de travail automatisés, tout en traitant des problèmes courants tels que les expressions cron non valides et les problèmes de fuseau horaire. Les développeurs peuvent l'utiliser pour configurer des gestionnaires planifiés, tester des déclencheurs cron et intégrer avec Workflows et Green Compute.

Voir la compétence

webapp-testing

Tests

Cette Compétence Claude fournit une boîte à outils basée sur Playwright pour tester des applications web locales via des scripts Python. Elle permet la vérification frontend, le débogage d'interface utilisateur, la capture d'écrans et la consultation des journaux, tout en gérant les cycles de vie du serveur. Utilisez-la pour les tâches d'automatisation de navigateur, mais exécutez les scripts directement plutôt que de lire leur code source pour éviter la pollution du contexte.

Voir la compétence

finishing-a-development-branch

Tests

Cette compétence aide les développeurs à finaliser leur travail en vérifiant que les tests passent, puis en présentant des options d'intégration structurées. Elle guide le processus de fusion, de création de PRs ou de nettoyage des branches une fois l'implémentation terminée. Utilisez-la lorsque votre code est prêt et testé pour finaliser systématiquement le cycle de développement.

Voir la compétence