autoskill
关于
The autoskill analyzes your screen activity via the local screenpipe daemon to identify repeated research workflows and propose new automation skills. It matches patterns against existing skills and drafts new ones or compositions for uncovered workflows, running detection locally with only redacted summaries sent to the LLM. Use this when you want Claude to analyze your actual work patterns and suggest relevant automation skills based on your behavior.
快速安装
Claude Code
推荐npx skills add K-Dense-AI/claude-scientific-skills -a claude-code/plugin add https://github.com/K-Dense-AI/claude-scientific-skillsgit clone https://github.com/K-Dense-AI/claude-scientific-skills.git ~/.claude/skills/autoskill在 Claude Code 中复制并粘贴此命令以安装该技能
技能文档
autoskill
Requires a running screenpipe daemon. This skill has no alternate data source — it reads exclusively from the local screenpipe HTTP API (default
http://localhost:3030). If the daemon isn't running,run()raisesScreenpipeUnreachablewith install instructions.
Network access & environment variables. This skill makes authenticated HTTP requests to (a) the user's local screenpipe daemon on loopback, and (b) the user-configured LLM backend — one of
http://localhost:1234/v1(LM Studio, default),https://api.anthropic.com(opt-in Claude), or a user-supplied BYOK Foundry gateway. The skill reads three environment variables —SCREENPIPE_TOKEN,ANTHROPIC_API_KEY,FOUNDRY_API_KEY— and uses each only to authenticate to the single endpoint its name implies. No other network destinations, no telemetry, no data egress to any third party.
Overview
Turn the user's own workflow history — captured passively by the local screenpipe daemon — into new skills. This skill is on-demand: the user invokes it with a time window, it queries screenpipe's local HTTP API, clusters repeated workflow patterns, compares each pattern against the existing skills in this repo, and produces a staged folder of proposals the user can review, edit, and promote.
When to Use This Skill
Invoke this skill when the user asks to:
- "Analyze my last 4 hours / day / week and propose new skills."
- "Look at what I've been doing and tell me what's not covered yet."
- "Draft a skill from my recent workflow."
- "Find composition recipes for workflows I repeat."
Do not invoke it for one-off questions about screenpipe itself, for real-time screen queries, or without an explicit user request — the skill analyzes sensitive local content and must stay explicitly user-triggered.
Privacy Posture
- Screenpipe handles app/window filtering at capture time. Install a starter deny-list by copying
references/screenpipe-config.yamlinto the user's screenpipe config. Sensitive apps (password managers, messaging, banking) are never OCR'd in the first place. - Raw OCR never leaves the machine.
scripts/fetch_window.pypulls data over localhost HTTP.scripts/cluster.pyreduces the timeline to app/duration/title summaries.scripts/redact.pystrips emails, API keys, bearer tokens, and phone numbers as defense-in-depth before any cluster summary reaches the LLM. - LLM backend defaults to
local. The recommended setup is LM Studio runningGemma-4-31B-it— strong reasoning at a size that fits on most workstation GPUs, and no data ever leaves your machine. Cloud backends (claude,foundry) are opt-in and documented inconfig.yamlfor users who explicitly want them. Detection and embeddings always run locally regardless of backend choice. - Dry-run mode (
--plan) prints the exact timeline that will be analyzed before any LLM call. - TLS for localhost (optional, for corporate policy): see
references/https-proxy.mdfor the Caddy pattern.
Prerequisites
1. Screenpipe daemon
Either install the official release or build from source. Either way the daemon binds HTTP on localhost:3030 by default.
From source (recommended if you want the CLI daemon without the desktop GUI):
git clone --depth 1 https://github.com/mediar-ai/screenpipe.git
cd screenpipe
cargo build -p screenpipe-engine --release
# System deps (macOS): cmake + full Xcode.app (not just Command Line Tools).
# brew install cmake
# # if xcodebuild plug-ins error: sudo xcodebuild -runFirstLaunch
./target/release/screenpipe doctor # confirm permissions + ffmpeg
./target/release/screenpipe record --disable-audio --use-pii-removal
First run will prompt for macOS Screen Recording permission. Grant it and relaunch.
2. Screenpipe API token
The local API now requires bearer auth. Retrieve your token and export it:
export SCREENPIPE_TOKEN=$(screenpipe auth token)
(Or set screenpipe.token directly in config.yaml — env var is preferred since it keeps secrets out of version control.)
3. Python environment
Via pipenv from the repo root:
pipenv install httpx pyyaml sentence-transformers
The embedding model (sentence-transformers/all-MiniLM-L6-v2, ~80 MB) downloads on first run.
4. Local LLM (default path) — LM Studio
- Install LM Studio.
- Download
Gemma-4-31B-it(or another strong reasoning model; adjustlocal.modelinconfig.yaml). - Load it via the CLI for headless use (no GUI required):
lms load gemma-4-31b-it --context-length 131072 --gpu max -y
lms status # confirm server running on :1234
5. Cloud LLM backends (optional, opt-in)
Only if you explicitly opt out of local:
claude: setANTHROPIC_API_KEY, flipbackend: claudeinconfig.yaml.foundry: setFOUNDRY_API_KEY, flipbackend: foundry, setfoundry.endpointto your corporate gateway URL.
Architecture
screenpipe daemon (user-installed)
│ HTTP on localhost:3030
▼
scripts/fetch_window.py → normalized timeline events
scripts/redact.py → regex scrub (defense-in-depth)
scripts/cluster.py → sessions + clusters (local only)
scripts/match_skills.py → top-k vs existing 135 skills (local embeddings)
scripts/synthesize.py → LLM judge: reuse / compose / novel
│
▼
~/.autoskill/proposed/<timestamp>/ (default; override with --out)
├── report.md
├── composition-recipes/<name>/SKILL.md
└── new-skills/<name>/SKILL.md
scripts/promote.py → user-approved proposal → scientific-skills/<name>/
Workflow
The skill ships a unified CLI at scripts/autoskill.py with three subcommands:
python scripts/autoskill.py doctor --config config.yaml --skills-dir ../
python scripts/autoskill.py run --start ... --end ... --config config.yaml
python scripts/autoskill.py promote --proposed ~/.autoskill/proposed/<ts> --skills-dir ../ --name <skill>
0. Preflight with doctor
Before a full run, verify every dependency in one shot:
python scripts/autoskill.py doctor \
--config scientific-skills/autoskill/config.yaml \
--skills-dir scientific-skills
The report covers config (backend choice valid), skills_dir (exists), screenpipe (reachable + authed), and llm (LM Studio serving or API key present). Non-zero exit on any failure, with the offending line marked error.
1. Run the pipeline
export SCREENPIPE_TOKEN=$(screenpipe auth token)
python scripts/autoskill.py run \
--start "2026-04-17T00:00:00Z" \
--end "2026-04-17T23:59:59Z" \
--config scientific-skills/autoskill/config.yaml \
--skills-dir scientific-skills
Proposals land in ~/.autoskill/proposed/<timestamp>/ by default, keeping experimental output out of the skills repo. Pass --out PATH to override.
Internally:
- Fetch —
fetch_windowpaginates screenpipe's/searchendpoint, normalizes events to{ts, app, window_title, text, content_type}. - Redact —
redactscrubs emails, API keys, bearer tokens, phones from OCR text and window titles as defense-in-depth over screenpipe's own PII removal. - Cluster —
segment_sessionssplits on idle gaps (default 10 min) and drops short sessions;cluster_sessionsgroups sessions by app-signature and keeps clusters of sizemin_cluster_size(default 2). - Match —
load_skill_descriptionsreads frontmatter from everySKILL.mdinscientific-skills/;top_k_matchesranks each cluster against all skills using localsentence-transformersembeddings (cosine similarity). - Synthesize —
synthesizeprompts the configured LLM backend to classify each cluster asreuse,compose, ornoveland emit a SKILL.md body where appropriate. - Report — writes
<out_dir>/<ts>/report.md, plusnew-skills/<name>/SKILL.mdorcomposition-recipes/<name>/SKILL.mdfor each proposal.
Add --dry-run to stop after clustering; this skips the LLM (and the sentence-transformers load), writing only plan.md for inspection.
2. Review and promote
Open ~/.autoskill/proposed/<ts>/report.md, edit drafts in place, delete anything you don't want. Then:
python scripts/autoskill.py promote \
--proposed ~/.autoskill/proposed/2026-04-17T14-30-00 \
--skills-dir scientific-skills \
--name zotero-pubmed-helper
promote moves the directory into scientific-skills/<name>/, refusing to overwrite an existing skill. Exits non-zero with a friendly error if the proposal isn't found or the target already exists.
Configuration
See config.yaml for the full shape. Default values (local-first):
backend: local
local:
endpoint: http://localhost:1234/v1 # LM Studio's Developer server
model: Gemma-4-31B-it
screenpipe:
url: http://localhost:3030 # or https://screenpipe.local via Caddy
cluster:
min_session_minutes: 5
idle_gap_minutes: 10
min_cluster_size: 2
To opt into a cloud backend:
backend: claude # or foundry
claude:
model: claude-opus-4-7
Composition recipes vs new skills
- compose: the LLM judged that chaining existing skills covers the workflow. The emitted SKILL.md is intentionally thin — frontmatter + a "Workflow" section that invokes existing skills in order. The same agent runtime that discovered the skill can then invoke it end-to-end.
- novel: no combination of existing skills covers it. A fuller SKILL.md is drafted, still following repo conventions (frontmatter, Overview, When to Use, Workflow). The user should always review new-skill drafts before promoting.
Testing
The skill is covered by a small pytest suite at tests/. Each script is unit-tested in isolation with dependency injection (mock HTTP transport, stub backend, stub embedder):
cd scientific-skills/autoskill
python -m pytest tests/ -v
Composition with other skills in this repo
The autoskill's embedding index covers all 135 sibling skills. Workflows that look like scientific writing will match scientific-writing / literature-review / citation-management; figure work will match scientific-schematics / generate-image / infographics; slide prep matches scientific-slides / pptx; etc. When a cluster scores high against two or three sibling skills the emitted composition recipe names them explicitly, so the user's future agent invocations use the optimized paths already documented in this repo.
GitHub 仓库
相关推荐技能
content-collections
元Content Collections 是一个 TypeScript 优先的构建工具,可将本地 Markdown/MDX 文件转换为类型安全的数据集合。它专为构建博客、文档站和内容密集型 Vite+React 应用而设计,提供基于 Zod 的自动模式验证。该工具涵盖从 Vite 插件配置、MDX 编译到生产环境部署的完整工作流。
polymarket
元这个Claude Skill为开发者提供完整的Polymarket预测市场开发支持,涵盖API调用、交易执行和市场数据分析。关键特性包括实时WebSocket数据流,可监控实时交易、订单和市场动态。开发者可用它构建预测市场应用、实施交易策略并集成实时市场预测功能。
creating-opencode-plugins
元该Skill帮助开发者创建OpenCode插件,用于接入命令、文件、LSP等25+种事件。它提供了插件结构、事件API规范和JavaScript/TypeScript实现模式,适合需要拦截操作、扩展功能或自定义事件处理的场景。开发者可通过它快速构建响应式模块来增强OpenCode AI助手的能力。
sglang
元SGLang是一个专为LLM设计的高性能推理框架,特别适用于需要结构化输出的场景。它通过RadixAttention前缀缓存技术,在处理JSON、正则表达式、工具调用等具有重复前缀的复杂工作流时,能实现极速生成。如果你正在构建智能体或多轮对话系统,并追求远超vLLM的推理性能,SGLang是理想选择。
