nextflow
关于
This skill provides comprehensive support for developing, executing, and troubleshooting Nextflow and nf-core pipelines. It assists with writing modules, configuring executors and containers, scaling to HPC/cloud platforms, and debugging runs. Use it for any reproducible scientific workflow development, even if Nextflow isn't explicitly mentioned.
快速安装
Claude Code
推荐npx skills add K-Dense-AI/claude-scientific-skills -a claude-code/plugin add https://github.com/K-Dense-AI/claude-scientific-skillsgit clone https://github.com/K-Dense-AI/claude-scientific-skills.git ~/.claude/skills/nextflow在 Claude Code 中复制并粘贴此命令以安装该技能
技能文档
Nextflow
Overview
Nextflow is a workflow language and runtime for building reproducible, portable, scalable data pipelines. It is dominant in bioinformatics but works for any data-heavy computation. nf-core is a community curating production-grade Nextflow pipelines, reusable modules, and the nf-core tooling on top of Nextflow.
Key ideas:
- Dataflow programming: pipelines are
processtasks connected by channels. Nextflow infers execution order and parallelism from data dependencies — there is no explicit scheduler to write. - Write once, run anywhere: the same pipeline runs locally, on HPC (SLURM, SGE, LSF, PBS), and on cloud (AWS Batch, Google Batch, Azure Batch, Kubernetes) by changing config/profiles, not code.
- Reproducibility: per-task containers (Docker/Singularity/Apptainer/Conda/Wave) +
-resumecaching + pinned pipeline revisions. - DSL2 is the modern, required syntax: modular
process/workflow/includedefinitions.
This skill covers both running existing pipelines and developing your own (Nextflow language + nf-core conventions, testing with nf-test, configuration, and deployment).
When to Use This Skill
Use this skill when the user wants to:
- Run an nf-core or custom Nextflow pipeline, or debug a failing/resuming run.
- Write or modify
.nfscripts,nextflow.config, profiles, ornextflow_schema.json. - Author or test nf-core-style modules/subworkflows (
main.nf,meta.yml,tests/, nf-test). - Configure executors, containers, or resources; scale to HPC or cloud.
- Build a reproducible scientific/bioinformatics workflow (even if "Nextflow" is not named).
- Understand processes, channels, operators,
take/emit,publishDir,ext.args, meta maps.
Setup
Nextflow needs Bash and Java 17 or newer (17–25 supported). Verify with java -version.
# Install Nextflow (self-contained launcher)
curl -s https://get.nextflow.io | bash # creates ./nextflow
sudo mv nextflow /usr/local/bin/ # put on PATH
nextflow info # verify
# Or via conda/bioconda (also gets a managed Java)
conda create -n nf -c bioconda -c conda-forge nextflow nf-core
# nf-core tools (Python) for creating/linting/running nf-core assets
pip install nf-core # or: conda install -c bioconda nf-core
nf-core --version
Pin the engine for reproducibility: export NXF_VER=24.10.0 (use an [edge] release only if needed). For air-gapped/HPC, see references/running-pipelines.md (offline mode) and references/configuration.md.
Two Modes of Work
Decide which path the user is on — it changes everything:
| Goal | Start here |
|---|---|
Run an existing pipeline (nf-core or a .nf you were given) | references/running-pipelines.md |
| Develop a new pipeline / module / subworkflow | references/language.md + references/developing.md |
| Configure / scale (HPC, cloud, containers, resources) | references/configuration.md + references/containers.md |
| Test modules/pipelines | references/testing.md |
Quick Start
Run an nf-core pipeline
Always smoke-test with the bundled test profile first; it uses tiny data and proves your environment works.
# 1. Confirm setup works (downloads pipeline + tiny test data)
nextflow run nf-core/rnaseq -profile test,docker --outdir results
# 2. Real run: pin a revision (-r), pick a container engine, pass inputs
nextflow run nf-core/rnaseq -r 3.14.0 \
-profile docker \
--input samplesheet.csv \
--genome GRCh38 \
--outdir results \
-resume
-profile(single dash) selects bundled config profiles; combine them comma-separated, e.g.test,docker. Container/infra profiles (docker,singularity,conda) are mutually exclusive — pick one.--input,--genome,--outdir(double dash) are pipeline parameters. nf-core pipelines take a samplesheet CSV, not loose files.-resumereuses cached results from the last run.-r <version>pins a release for reproducibility.
Use nf-core pipelines launch <name> for an interactive, schema-validated way to build the command and a -params-file. See references/running-pipelines.md.
Write a minimal pipeline
#!/usr/bin/env nextflow
process SAYHELLO {
tag "$greeting"
publishDir "results", mode: 'copy'
input:
val greeting
output:
path "${greeting}.txt"
script:
"""
echo '$greeting world' > ${greeting}.txt
"""
}
workflow {
channel.of('hello', 'bonjour', 'hola') | SAYHELLO
}
nextflow run main.nf # add -resume on reruns
The full language (processes, channels, operators, DSL2 workflows with take/main/emit, modules) is in references/language.md.
Core Concepts at a Glance
- Process: a unit of work that runs a script (Bash by default). Declares
input:,output:, optionaldirectives(resources, container,publishDir,tag,errorStrategy), and ascript:/shell:/exec:block. Each task runs in its own isolated work directory (work/xx/yy…). - Channel: the async queues that connect processes. Queue channels are consumable streams; value channels hold a single reusable value. Created with factories like
channel.of,channel.fromPath,channel.fromFilePairs,channel.value. - Operator: transforms/combines channels —
map,filter,collect,groupTuple,join,combine,mix,flatten,branch,multiMap,splitCsv,view,set. - Workflow: composes processes. DSL2 workflows can declare
take:(inputs),main:(logic),emit:(named outputs) and beincluded as subworkflows. The unnamedworkflow {}is the entry point. - Module: a
.nffile exposing processes/workflows viainclude { NAME } from './path'(supportsasaliasing). - Configuration:
nextflow.configsetsparams,processdirectives,executor, container engines, and namedprofiles. SelectorswithName:/withLabel:target specific processes. Seereferences/configuration.md. - meta map (nf-core): the convention of carrying a metadata map (
[ id:'sample1', single_end:false ]) alongside files in input/output tuples so samples stay labeled through the pipeline. Seereferences/developing.md.
nf-core tools CLI
nf-core tools (v3+) group subcommands under pipelines, modules, and subworkflows. (Bare forms like nf-core lint still work but warn — prefer the grouped form.)
| Command | Purpose |
|---|---|
nf-core pipelines list | List/search nf-core pipelines (--json, keywords) |
nf-core pipelines create | Scaffold a new pipeline from the nf-core template |
nf-core pipelines launch <name> | Interactive, schema-driven run command + params file |
nf-core pipelines download <name> | Download pipeline + containers for offline/HPC use |
nf-core pipelines lint | Lint a pipeline against nf-core standards (run in repo root) |
nf-core pipelines schema build | Build/edit nextflow_schema.json via web GUI |
nf-core pipelines create-params-file <name> | Generate a documented YAML params file |
nf-core pipelines bump-version / sync | Bump version / sync with template updates |
nf-core modules list/info/install/update/remove | Manage modules from nf-core/modules |
nf-core modules create / lint / test | Author, lint, and nf-test a module |
nf-core modules patch / bump-versions | Patch an installed module / bump tool versions |
nf-core subworkflows install/create/lint/test | Same lifecycle for subworkflows |
Full command reference, flags, and examples: references/nf-core-tools.md.
Essential nextflow CLI
| Command | Purpose |
|---|---|
nextflow run <pipeline> -profile <p> --outdir <dir> | Run a pipeline (path, .nf, or user/repo) |
-resume | Reuse cached results from prior run |
-r <rev> | Run a specific git revision/tag/branch |
-params-file params.yml | Supply parameters from YAML/JSON |
-c custom.config | Layer in an extra config file |
-with-report -with-trace -with-timeline -with-dag flow.html | Execution report, trace, timeline, DAG |
-stub-run | Run stub: blocks only (dry-run plumbing) |
nextflow log | Inspect past runs |
nextflow clean -f -before <run> | Delete old work/ data |
nextflow pull / drop / list / info <repo> | Manage cached remote pipelines |
Config, executors, caching internals, and tracing details: references/configuration.md.
Best Practices (high-value habits)
- Always
testfirst:-profile test,docker(orsingularity/conda) before real data — fast and catches environment problems. - Pin everything: pipeline revision (
-r),NXF_VER, and tool versions (containers). Don't runlatestfor science you'll publish. - Use
-resumeand understand caching: a task re-runs if its inputs, script, or container change. See cache-debugging inreferences/configuration.md. - Parameterize via config/params-file, not hardcoded paths. Keep
paramsand profiles innextflow.config. - One container/conda env per process; never rely on tools installed on the host.
- For nf-core dev: reuse existing modules (
nf-core modules install) before writing new ones; pass tool flags throughext.args(not hardcoded in the script); always include astub:block and nf-test tests; runnf-core pipelines lintandprettierbefore committing. - Right-size resources with
process_low/medium/highlabels anderrorStrategy 'retry'with dynamictask.attemptscaling instead of one giant request. - Write forward-compatible syntax: the strict-syntax parser becomes the default in Nextflow 26.04. Prefer lowercase
channel.of(...), explicit closure params ({ v -> ... }),deffor all variables, andemit:-named outputs. Check withnextflow lint.
Reference Files
Read the relevant file when you need depth — each is self-contained:
references/language.md— DSL2 language: processes, directives, channels, operators, workflows (take/emit), modules, dynamic resources, error handling.references/configuration.md—nextflow.config, scopes,profiles,withName/withLabelselectors, executors (local/SLURM/cloud), caching/-resumeinternals, tracing/reports, thenextflowCLI.references/containers.md— Docker, Singularity/Apptainer, Podman, Conda, Wave containers; choosing and enabling engines; common gotchas.references/running-pipelines.md— finding/running nf-core pipelines, samplesheets, params files, reference genomes (iGenomes), offline runs, institutional configs, Seqera Platform.references/nf-core-tools.md— completenf-coreCLI reference (pipelines/modules/subworkflows), flags, and workflows.references/developing.md— authoring nf-core pipelines & modules: template layout, modulemain.nf/meta.yml, meta maps,ext.args/modules.config, subworkflows, resource labels, linting & Harshil alignment style.references/testing.md— nf-test for modules/subworkflows/pipelines: test structure, assertions, snapshots, tags, running tests, CI.
Official docs: Nextflow https://www.nextflow.io/docs/latest/ · nf-core https://nf-co.re/docs/ · Training https://training.nextflow.io/
GitHub 仓库
相关推荐技能
content-collections
元Content Collections 是一个 TypeScript 优先的构建工具,可将本地 Markdown/MDX 文件转换为类型安全的数据集合。它专为构建博客、文档站和内容密集型 Vite+React 应用而设计,提供基于 Zod 的自动模式验证。该工具涵盖从 Vite 插件配置、MDX 编译到生产环境部署的完整工作流。
polymarket
元这个Claude Skill为开发者提供完整的Polymarket预测市场开发支持,涵盖API调用、交易执行和市场数据分析。关键特性包括实时WebSocket数据流,可监控实时交易、订单和市场动态。开发者可用它构建预测市场应用、实施交易策略并集成实时市场预测功能。
creating-opencode-plugins
元该Skill帮助开发者创建OpenCode插件,用于接入命令、文件、LSP等25+种事件。它提供了插件结构、事件API规范和JavaScript/TypeScript实现模式,适合需要拦截操作、扩展功能或自定义事件处理的场景。开发者可通过它快速构建响应式模块来增强OpenCode AI助手的能力。
sglang
元SGLang是一个专为LLM设计的高性能推理框架,特别适用于需要结构化输出的场景。它通过RadixAttention前缀缓存技术,在处理JSON、正则表达式、工具调用等具有重复前缀的复杂工作流时,能实现极速生成。如果你正在构建智能体或多轮对话系统,并追求远超vLLM的推理性能,SGLang是理想选择。
