Back to Skills

qwen-edit

digitalsamba
Updated 2 days ago
6 views
1,259
215
1,259
View on GitHub
Otherai

About

This skill provides AI image editing using Qwen-Image-Edit for tasks like preserving identity in photos, reframing images, changing clothing/poses, and applying style transfers. It offers prompt patterns, parameter tuning, and examples for developers implementing these edits. Use it for transformations while avoiding background replacement due to artifact issues.

Quick Install

Claude Code

Recommended
Primary
npx skills add digitalsamba/claude-code-video-toolkit -a claude-code
Plugin CommandAlternative
/plugin add https://github.com/digitalsamba/claude-code-video-toolkit
Git CloneAlternative
git clone https://github.com/digitalsamba/claude-code-video-toolkit.git ~/.claude/skills/qwen-edit

Copy and paste this command in Claude Code to install this skill

Documentation

Qwen-Image-Edit Skill

AI-powered image editing using Qwen-Image-Edit-2511 via RunPod serverless.

Status: Evolving - learnings being captured as we experiment

When to Use This Skill

Use when the user wants to:

  • Edit/transform photos while preserving identity
  • Reframe cropped images (fix cut-off heads, etc.)
  • Change clothing, add accessories
  • Change pose (arm positions, hand placement)
  • Apply style transfers (cyberpunk, anime, oil painting)
  • Adjust lighting/color grading
  • Add/remove objects
  • Character transformations (Bond, Neo, etc.)

When NOT to Use

  • Background replacement (single image) - creates cut-out artifacts, halos
  • Face swapping - cannot preserve identity from reference
  • Outpainting - can't extend canvas reliably

Use With Care

  • Multi-image compositing - CAN work with explicit identity anchors (see examples.md for prompt patterns). Requires describing distinctive features (hair texture/color, ethnicity, outfit) and using guidance ~2.0
  • Camera angle changes - Inconsistent results. Vertical angles (low/high) work better than rotational (three-quarter view)

Quick Reference

# Basic edit
python tools/image_edit.py --input photo.jpg --prompt "Add sunglasses"

# With negative prompt (recommended)
python tools/image_edit.py --input photo.jpg \
  --prompt "Reframe as portrait with full head visible" \
  --negative "blur, distortion, artifacts"

# Style transfer
python tools/image_edit.py --input photo.jpg --style cyberpunk

# Background (use cautiously - often fails)
python tools/image_edit.py --input photo.jpg --background office

# Higher quality
python tools/image_edit.py --input photo.jpg --prompt "..." --steps 16 --guidance 3.0

# Multi-image composite (identity-preserving)
python tools/image_edit.py --input person.jpg background.jpg \
  --prompt "The [ethnicity] [gender] with [hair description] from first image is now in [scene] from second image. Same [features], [outfit]." \
  --negative "different ethnicity, different hair color, different face shape, generic stock photo" \
  --steps 16 --guidance 2.0

Key Files

  • prompting.md - Prompt patterns and structure
  • examples.md - Good/bad examples from experiments
  • parameters.md - Tuning steps, guidance, negative prompts

Tool Location

tools/image_edit.py - CLI wrapper for RunPod endpoint

Related Docs

  • docs/qwen-edit-patterns.md - Character transformation patterns
  • .ai_dev/qwen-edit-research.md - Research notes

GitHub Repository

digitalsamba/claude-code-video-toolkit
Path: .claude/skills/qwen-edit
0
ai-video-generatorclaude-codedeveloper-toolselevenlabsopen-sourceopenclaw

Related Skills

llamaguard

Other

LlamaGuard is Meta's 7-8B parameter model for moderating LLM inputs and outputs across six safety categories like violence and hate speech. It offers 94-95% accuracy and can be deployed using vLLM, Hugging Face, or Amazon SageMaker. Use this skill to easily integrate content filtering and safety guardrails into your AI applications.

View skill

cost-optimization

Other

This Claude Skill helps developers optimize cloud costs through resource rightsizing, tagging strategies, and spending analysis. It provides a framework for reducing cloud expenses and implementing cost governance across AWS, Azure, and GCP. Use it when you need to analyze infrastructure costs, right-size resources, or meet budget constraints.

View skill

quantizing-models-bitsandbytes

Other

This skill quantizes LLMs to 8-bit or 4-bit precision using bitsandbytes, achieving 50-75% memory reduction with minimal accuracy loss. It's ideal for running larger models on limited GPU memory or accelerating inference, supporting formats like INT8, NF4, and FP4. The skill integrates with HuggingFace Transformers and enables QLoRA training and 8-bit optimizers.

View skill

dispatching-parallel-agents

Other

This Claude Skill dispatches multiple agents to investigate and fix 3+ independent problems concurrently. It is designed for scenarios involving unrelated failures that can be resolved without shared state or dependencies. The core capability is parallel problem-solving, assigning one agent per independent problem domain to maximize efficiency.

View skill