MCP HubMCP Hub
返回技能列表

when-testing-code-use-testing-framework

DNYoussef
更新于 Yesterday
23 次查看
9
2
9
在 GitHub 上查看
测试testing

关于

This skill provides a standardized procedure for running your testing framework to validate code changes with reproducible commands and evidence capture. It triggers when you need to test changes before a merge, reproduce bugs, or establish baselines. Key features include enforcing confidence ceilings, ensuring result reproducibility, and integrating with tools like Bash and Grep for execution.

快速安装

Claude Code

推荐
插件命令推荐
/plugin add https://github.com/DNYoussef/context-cascade
Git 克隆备选方式
git clone https://github.com/DNYoussef/context-cascade.git ~/.claude/skills/when-testing-code-use-testing-framework

在 Claude Code 中复制并粘贴此命令以安装该技能

技能文档

STANDARD OPERATING PROCEDURE

Purpose

Guide reviewers and developers to execute the testing framework for code validation, ensuring evidence capture, reproducibility, and confidence ceilings.

Trigger Conditions

  • Positive: validating changes before merge, reproducing reported bugs, or establishing baselines for new features.
  • Negative: style-only polish (use style-audit) or verification of claims without execution (use verification-quality).

Guardrails

  • Confidence ceiling: Include Confidence: X.XX (ceiling: TYPE Y.YY) using ceilings {inference/report 0.70, research 0.85, observation/definition 0.95}.
  • Reproducibility: Document commands, environment, fixtures, and seeds; attach logs.
  • Structure-first: Maintain readme.md, process.md, and scripts to run/generate tests; keep examples/tests synced with the current framework.
  • Adversarial validation: Run boundary/negative cases in addition to happy paths.

Execution Phases

  1. Setup
    • Review readme.md and scripts (slash-command-test-run.sh, slash-command-test-generate.sh).
    • Prepare environment per subagent-testing-framework.md and ensure dependencies are installed.
  2. Test Selection & Generation
    • Identify suites relevant to the change; generate missing cases using provided scripts if needed.
  3. Execution
    • Run tests with reproducible commands; capture outputs and failures with file:line references.
    • Re-run flaky tests to confirm stability; note nondeterminism.
  4. Reporting & Confidence
    • Summarize pass/fail counts, failing cases, and reproduction steps.
    • Recommend fixes or reruns; provide confidence with ceiling.

Output Format

  • Environment and commands used.
  • Test results (pass/fail, logs, failing file:line).
  • Flaky cases and follow-up actions.
  • Confidence statement using ceiling syntax.

Validation Checklist

  • Environment and dependencies prepared.
  • Relevant suites selected and/or generated.
  • Tests executed with logs captured; flakiness noted.
  • Confidence ceiling provided; English-only output.

Confidence: 0.72 (ceiling: inference 0.70) – SOP rewritten using Prompt Architect confidence discipline and Skill Forge structure-first testing workflow.

GitHub 仓库

DNYoussef/context-cascade
路径: skills/quality/testing/when-testing-code-use-testing-framework

相关推荐技能

content-collections

Content Collections 是一个 TypeScript 优先的构建工具,可将本地 Markdown/MDX 文件转换为类型安全的数据集合。它专为构建博客、文档站和内容密集型 Vite+React 应用而设计,提供基于 Zod 的自动模式验证。该工具涵盖从 Vite 插件配置、MDX 编译到生产环境部署的完整工作流。

查看技能

evaluating-llms-harness

测试

该Skill通过60+个学术基准测试(如MMLU、GSM8K等)评估大语言模型质量,适用于模型对比、学术研究及训练进度追踪。它支持HuggingFace、vLLM和API接口,被EleutherAI等行业领先机构广泛采用。开发者可通过简单命令行快速对模型进行多任务批量评估。

查看技能

cloudflare-turnstile

这个Skill提供完整的Cloudflare Turnstile集成知识,用于在表单、登录页面和API端点中实现无验证码的机器人防护。它支持React/Next.js/Hono等框架集成,涵盖令牌验证、错误代码调试和端到端测试等场景。通过运行后台不可见挑战,在保持用户体验的同时有效阻止自动化流量和垃圾信息。

查看技能

webapp-testing

测试

该Skill为开发者提供了基于Playwright的本地Web应用测试工具集,支持自动化测试前端功能、调试UI行为、捕获屏幕截图和查看浏览器日志。它包含管理服务器生命周期的辅助脚本,可直接作为黑盒工具运行而无需阅读源码。适用于需要快速验证本地Web应用界面和交互功能的开发场景。

查看技能