data-catalog-creator

majiayu000

Updated Yesterday

1 views

Otherdata

About

The data-catalog-creator skill helps developers design and plan systems for managing metadata, data lineage, and discovery. It generates implementation plans, architectural specs, and required artifacts based on your stack and constraints. Use this skill when you need to establish or improve data governance, compliance, and discoverability within your infrastructure.

Quick Install

Claude Code

Recommended

Plugin CommandRecommended

/plugin add https://github.com/majiayu000/claude-skill-registry

Git CloneAlternative

git clone https://github.com/majiayu000/claude-skill-registry.git ~/.claude/skills/data-catalog-creator

Copy and paste this command in Claude Code to install this skill

Documentation

Data Catalog Creator

Purpose

Manage metadata, lineage, and discovery.

Preconditions

Access to system context (repos, infra, environments)
Confirmed requirements and constraints
Required approvals for security, compliance, or governance

Inputs

Problem statement and scope
Current architecture or system constraints
Non-functional requirements (performance, security, compliance)
Target stack and environment

Outputs

Design or implementation plan
Required artifacts (diagrams, configs, specs, checklists)
Validation steps and acceptance criteria

Detailed Step-by-Step Procedures

Clarify scope, constraints, and success metrics.
Review current system state, dependencies, and integration points.
Select patterns, tools, and architecture options that match constraints.
Produce primary artifacts (docs/specs/configs/code stubs).
Validate against requirements and known risks.
Provide rollout and rollback guidance.

Decision Trees and Conditional Logic

If compliance or regulatory scope applies -> add required controls and audit steps.
If latency budget is strict -> choose low-latency storage and caching.
Else -> prefer cost-optimized storage and tiering.
If data consistency is critical -> prefer transactional boundaries and strong consistency.
Else -> evaluate eventual consistency or async processing.

Error Handling and Edge Cases

Partial failures across dependencies -> isolate blast radius and retry with backoff.
Data corruption or loss risk -> enable backups and verify restore path.
Limited access to systems -> document gaps and request access early.
Legacy dependencies with limited change tolerance -> use adapters and phased rollout.

Tool Requirements and Dependencies

CLI and SDK tooling for the target stack
Credentials or access tokens for required environments
Diagramming or spec tooling when producing docs

Stack Profiles

Use Profile A, B, or C from skills/STACK_PROFILES.md.
Note selected profile in outputs for traceability.

Validation

Requirements coverage check
Security and compliance review
Performance and reliability review
Peer or stakeholder sign-off

Rollback Procedures

Revert config or deployment to last known good state.
Roll back database migrations if applicable.
Verify service health, data integrity, and error rates after rollback.

Success Metrics

Measurable outcomes (latency, error rate, uptime, cost)
Acceptance thresholds defined with stakeholders

Example Workflows and Use Cases

Minimal: apply the skill to a small service or single module.
Production: apply the skill to a multi-service or multi-tenant system.

GitHub Repository

majiayu000/claude-skill-registry

Path: skills/data-catalog-creator

Related Skills

content-collections

Meta

This skill provides a production-tested setup for Content Collections, a TypeScript-first tool that transforms Markdown/MDX files into type-safe data collections with Zod validation. Use it when building blogs, documentation sites, or content-heavy Vite + React applications to ensure type safety and automatic content validation. It covers everything from Vite plugin configuration and MDX compilation to deployment optimization and schema validation.

View skill

llamaindex

Meta

LlamaIndex is a data framework for building RAG-powered LLM applications, specializing in document ingestion, indexing, and querying. It provides key features like vector indices, query engines, and agents, and supports over 300 data connectors. Use it for document Q&A, chatbots, and knowledge retrieval when building data-centric applications.

View skill

hybrid-cloud-networking

Meta

This skill configures secure hybrid cloud networking between on-premises infrastructure and cloud platforms like AWS, Azure, and GCP. Use it when connecting data centers to the cloud, building hybrid architectures, or implementing secure cross-premises connectivity. It supports key capabilities such as VPNs and dedicated connections like AWS Direct Connect for high-performance, reliable setups.

View skill

polymarket

Meta

This skill enables developers to build applications with the Polymarket prediction markets platform, including API integration for trading and market data. It also provides real-time data streaming via WebSocket to monitor live trades and market activity. Use it for implementing trading strategies or creating tools that process live market updates.

View skill