Back to Skills

vision

mattnigh
Updated 4 days ago
6 views
22
1
22
View on GitHub
Documentspdf

About

The vision skill processes images and PDFs to perform tasks like description, summarization, and analysis based on user prompts. A key feature is its ability to precisely recreate UI elements from screenshots or documents using CSS, HTML, and JavaScript. Developers can use it by executing a local Python script that takes a file path and a text request as inputs.

Quick Install

Claude Code

Recommended
Primary
npx skills add mattnigh/skills_collection -a claude-code
Plugin CommandAlternative
/plugin add https://github.com/mattnigh/skills_collection
Git CloneAlternative
git clone https://github.com/mattnigh/skills_collection.git ~/.claude/skills/vision

Copy and paste this command in Claude Code to install this skill

GitHub Repository

mattnigh/skills_collection
Path: collection/flyingtimes__podcast-using-skill__claude__skills__vision__SKILL.md
0

Related Skills