Back to Skills

c-screen

daxaur
Updated 2 days ago
6 views
143
6
143
View on GitHub
Designscreenshotocrwebcampeekaboocamsnapvision

About

The c-screen skill enables Claude to capture screenshots with OCR text extraction and take webcam images for visual analysis. It provides tools for full/region-specific screenshots, window capture, and immediate OCR processing. Use this skill when you need to programmatically analyze screen content or camera input within your applications.

Quick Install

Claude Code

Recommended
Primary
npx skills add daxaur/openpaw -a claude-code
Plugin CommandAlternative
/plugin add https://github.com/daxaur/openpaw
Git CloneAlternative
git clone https://github.com/daxaur/openpaw.git ~/.claude/skills/c-screen

Copy and paste this command in Claude Code to install this skill

Documentation

What This Skill Does

Enables Claude to take screenshots, extract text from the screen via OCR, and capture webcam images for visual analysis using peekaboo and camsnap.

Available CLI Tools

peekaboo — Screenshots & OCR

# Take a full screenshot and save to file
peekaboo screenshot --output ~/Desktop/screen.png

# Take screenshot and extract all text via OCR
peekaboo ocr

# OCR a specific region (x, y, width, height)
peekaboo ocr --region 0,0,1280,720

# Screenshot a specific window by app name
peekaboo screenshot --app "Safari" --output window.png

# Screenshot and pipe to stdout for immediate analysis
peekaboo screenshot --stdout

camsnap — Webcam Capture

# Capture a single webcam frame
camsnap --output ~/Desktop/photo.jpg

# Capture with a specific camera device
camsnap --device 0 --output shot.jpg

# Capture and print file path
camsnap --output /tmp/cam.jpg && echo "Saved"

Usage Guidelines

  • Use peekaboo ocr when the user wants text extracted from the screen
  • Use camsnap only when the user explicitly wants a webcam image
  • After capturing, read the image file to analyze its contents visually

Notes

  • macOS screen recording permission required for peekaboo
  • Camera permission required for camsnap
  • OCR accuracy depends on font size and screen resolution

GitHub Repository

daxaur/openpaw
Path: skills/c-screen
0
ai-agentanthropicautomationclaudeclaude-codecli

Related Skills

executing-plans

Design

Use the executing-plans skill when you have a complete implementation plan to execute in controlled batches with review checkpoints. It loads and critically reviews the plan, then executes tasks in small batches (default 3 tasks) while reporting progress between each batch for architect review. This ensures systematic implementation with built-in quality control checkpoints.

View skill

requesting-code-review

Design

This skill dispatches a code-reviewer subagent to analyze code changes against requirements before proceeding. It should be used after completing tasks, implementing major features, or before merging to main. The review helps catch issues early by comparing the current implementation with the original plan.

View skill

connect-mcp-server

Design

This skill provides a comprehensive guide for developers to connect MCP servers to Claude Code using HTTP, stdio, or SSE transports. It covers installation, configuration, authentication, and security for integrating external services like GitHub, Notion, and custom APIs. Use it when setting up MCP integrations, configuring external tools, or working with Claude's Model Context Protocol.

View skill

web-cli-teleport

Design

This skill helps developers choose between Claude Code Web and CLI interfaces based on task analysis, then enables seamless session teleportation between these environments. It optimizes workflow by managing session state and context when switching between web, CLI, or mobile. Use it for complex projects requiring different tools at various stages.

View skill