SKILL·C054C4

simulation-failure-triage

Name: simulation-failure-triage
Author: HeshamFS

HeshamFS

更新于 1 month ago

9 次查看

开发ai

关于

This skill helps developers triage failed materials simulations by diagnosing common issues like nonconvergence, NaN/Inf errors, and unstable timesteps. It proposes safe, defensible retry ladders and immediate actions for recovery. Use it when you encounter a suspicious or failed simulation and need a structured first response.

快速安装

Claude Code

技能文档

Simulation Failure Triage

Goal

Classify common simulation failure signatures and return immediate actions, retry ladders, and stop conditions.

Requirements

Python 3.10+
No external dependencies
Works on Linux, macOS, and Windows

Inputs to Gather

Input	Description	Example
Code	Simulation code	`LAMMPS`, `VASP`, `MOOSE`, `QE`
Stage	Setup, runtime, postprocess	`runtime`
Symptoms	Failure signs	`nan,pressure-blowup`
Log text or file	Error evidence	`Lost atoms`, `ZBRENT`
Recent change	Last modified setting	`larger timestep`

Decision Guidance

First preserve evidence: logs, inputs, executable version, and scheduler output.
Separate setup errors from numerical instability and physical model issues.
Retry with a single controlled change.
Stop retrying when the result becomes scientifically meaningless or a required model input is missing.

Script Outputs

scripts/failure_triage.py emits:

likely_causes
immediate_actions
retry_ladder
stop_conditions
evidence

Workflow

python3 skills/robustness/simulation-failure-triage/scripts/failure_triage.py \
  --code LAMMPS \
  --stage runtime \
  --symptoms nan,pressure-blowup \
  --recent-change "increased timestep" \
  --json

Error Handling

Invalid stages or oversized log files stop with exit code 2. Unknown symptoms are retained as custom evidence.

Limitations

This skill gives first-response triage. It does not guarantee that a failed simulation can be repaired.

Security

Log files are read with a 10 MB size cap.
Log text is truncated and never executed.
The script does not run external solvers.
The skill uses Bash only to run its bundled script.

References

See references/failure_patterns.md for common failure signatures and retry ladders.

Version History

1.0.0: Initial cross-code simulation failure triage skill.

GitHub 仓库

HeshamFS/materials-simulation-skills

路径: skills/robustness/simulation-failure-triage

agent-skillsagentscli-toolscomputational-sciencellmmaterials-science

FAQ

Frequently asked questions

What is the simulation-failure-triage skill?

simulation-failure-triage is a Claude Skill by HeshamFS. Skills package instructions and resources that Claude loads on demand, so Claude can perform simulation-failure-triage-related tasks without extra prompting.

How do I install simulation-failure-triage?

Use the install commands on this page: add simulation-failure-triage to Claude Code as a plugin, or clone its repository into your skills directory, then restart Claude so it picks up the skill.

What category does simulation-failure-triage belong to?

simulation-failure-triage is in the Development category, tagged ai.

Is simulation-failure-triage free to use?

Yes. simulation-failure-triage is listed on AIMCP and free to install. It runs inside Claude, so no separate service account is required to use the skill itself.