running-mutation-tests
关于
This skill performs mutation testing to evaluate test suite quality by introducing code mutations and checking if tests detect them. It calculates a mutation survival rate to reveal test coverage gaps and effectiveness. Use it when developers need to assess test robustness using terms like "mutation testing" or "mutation score".
技能文档
Overview
This skill empowers Claude to execute mutation testing, providing insights into the effectiveness of a test suite. By introducing small changes (mutations) into the code and running the tests, it determines if the tests are capable of detecting these changes. This helps identify weaknesses in the test suite and improve overall code quality.
How It Works
- Mutation Generation: The plugin automatically introduces mutations (e.g., changing
+to-) into the code. - Test Execution: The test suite is run against the mutated code.
- Result Analysis: The plugin analyzes which mutations were "killed" (detected by tests) and which "survived" (were not detected).
- Reporting: A mutation score is calculated, and surviving mutants are identified for further investigation.
When to Use This Skill
This skill activates when you need to:
- Validate the effectiveness of a test suite.
- Identify gaps in test coverage.
- Improve the mutation score of a project.
- Analyze surviving mutants to strengthen tests.
Examples
Example 1: Improving Test Coverage
User request: "Run mutation testing on the validator module and suggest improvements to the tests."
The skill will:
- Execute mutation tests on the validator module.
- Analyze the results and identify surviving mutants, indicating areas where tests are weak.
- Suggest specific improvements to the tests based on the surviving mutants, such as adding new test cases or modifying existing ones.
Example 2: Assessing Test Quality
User request: "What is the mutation score for the user authentication service?"
The skill will:
- Execute mutation tests on the user authentication service.
- Calculate the mutation score based on the number of killed mutants.
- Report the mutation score to the user, providing a metric for test quality.
Best Practices
- Targeted Mutation: Focus mutation testing on critical modules or areas with high complexity.
- Analyze Survivors: Prioritize the analysis of surviving mutants to identify the most impactful improvements to test coverage.
- Iterative Improvement: Use mutation testing as part of an iterative process to continuously improve test suite quality.
Integration
This skill integrates well with other testing and code analysis tools. For example, it can be used in conjunction with code coverage tools to provide a more comprehensive view of test effectiveness.
快速安装
/plugin add https://github.com/jeremylongshore/claude-code-plugins-plus/tree/main/mutation-test-runner在 Claude Code 中复制并粘贴此命令以安装该技能
GitHub 仓库
相关推荐技能
llamaguard
其他LlamaGuard是Meta推出的7-8B参数内容审核模型,专门用于过滤LLM的输入和输出内容。它能检测六大安全风险类别(暴力/仇恨、性内容、武器、违禁品、自残、犯罪计划),准确率达94-95%。开发者可通过HuggingFace、vLLM或Sagemaker快速部署,并能与NeMo Guardrails集成实现自动化安全防护。
sglang
元SGLang是一个专为LLM设计的高性能推理框架,特别适用于需要结构化输出的场景。它通过RadixAttention前缀缓存技术,在处理JSON、正则表达式、工具调用等具有重复前缀的复杂工作流时,能实现极速生成。如果你正在构建智能体或多轮对话系统,并追求远超vLLM的推理性能,SGLang是理想选择。
evaluating-llms-harness
测试该Skill通过60+个学术基准测试(如MMLU、GSM8K等)评估大语言模型质量,适用于模型对比、学术研究及训练进度追踪。它支持HuggingFace、vLLM和API接口,被EleutherAI等行业领先机构广泛采用。开发者可通过简单命令行快速对模型进行多任务批量评估。
langchain
元LangChain是一个用于构建LLM应用程序的框架,支持智能体、链和RAG应用开发。它提供多模型提供商支持、500+工具集成、记忆管理和向量检索等核心功能。开发者可用它快速构建聊天机器人、问答系统和自主代理,适用于从原型验证到生产部署的全流程。
