register-ml-model
정보
이 스킬은 훈련된 모델을 MLflow 모델 레지스트리에 버전 관리와 함께 등록하고, 승인 워크플로우를 통해 단계 전환(스테이징, 프로덕션, 보관)을 관리합니다. 이를 통해 모델을 프로덕션으로 승격시키고, 버전을 롤백하며, 규정 준수를 위한 변경 사항을 감사할 수 있습니다. 모델 계보를 관리하고, 배포를 추적하며, 모델 라이프사이클 전반에 걸쳐 거버넌스를 구현하는 데 활용하세요.
빠른 설치
Claude Code
추천npx skills add pjt222/agent-almanac -a claude-code/plugin add https://github.com/pjt222/agent-almanacgit clone https://github.com/pjt222/agent-almanac.git ~/.claude/skills/register-ml-modelClaude Code에서 이 명령을 복사하여 붙여넣어 스킬을 설치하세요
문서
ML-Modell registrieren
See Extended Examples for complete configuration files and templates.
Implementieren MLflow Modellieren Registry for systematic model versioning, stage management, and deployment governance.
Wann verwenden
- Promoting a trained model from experimentation to production
- Managing multiple model versions across development stages
- Implementing model approval workflows for governance
- Tracking model lineage from training to deployment
- Rolling back to previous model versions
- Comparing deployed model versions for A/B testing
- Auditing model changes for compliance requirements
Eingaben
- Erforderlich: MLflow tracking server with Modellieren Registry enabled
- Erforderlich: Trained model logged with MLflow (from tracking runs)
- Erforderlich: Modellieren name for registry registration
- Optional: Approval workflow integration (email, Slack, Jira)
- Optional: CI/CD pipeline for automated promotion
- Optional: Modellieren validation metrics thresholds
Vorgehensweise
Schritt 1: Konfigurieren Modellieren Registry Backend
Einrichten MLflow Modellieren Registry with database backend (file-based registry not recommended for production).
# Start MLflow server with Model Registry support
mlflow server \
--backend-store-uri postgresql://user:pass@localhost:5432/mlflow \
--default-artifact-root s3://mlflow-artifacts/models \
--host 0.0.0.0 \
--port 5000
Python configuration:
# model_registry_config.py
import mlflow
from mlflow.tracking import MlflowClient
# Set tracking URI (must support Model Registry)
MLFLOW_TRACKING_URI = "http://mlflow-server.company.com:5000"
mlflow.set_tracking_uri(MLFLOW_TRACKING_URI)
# ... (see EXAMPLES.md for complete implementation)
Erwartet: Modellieren Registry UI tab appears in MLflow, search_registered_models() returns erfolgreich (even if empty), database contains registered_models table.
Bei Fehler: Verifizieren MLflow version ≥1.2 (Modellieren Registry introduced in 1.2), check database backend (SQLite not fully supported for Modellieren Registry), ensure --backend-store-uri points to database (not file://), verify database user has CREATE TABLE Berechtigungs, check MLflow server logs for migration errors.
Schritt 2: Registrieren Modellieren from Training Run
Registrieren a logged model to the Modellieren Registry with comprehensive metadata.
# register_model.py
import mlflow
from mlflow.tracking import MlflowClient
from model_registry_config import MLFLOW_TRACKING_URI
mlflow.set_tracking_uri(MLFLOW_TRACKING_URI)
client = MlflowClient()
# ... (see EXAMPLES.md for complete implementation)
Erwartet: New model version appears in Modellieren Registry UI, version includes description and tags, model artifacts are accessible via models:/<model-name>/<version> URI, model signature and input example are preserved.
Bei Fehler: Verifizieren run_id exists and has completed (client.get_run(run_id)), check model artifact path matches logged artifact (mlflow.search_runs() to inspect), ensure model was logged with proper framework flavor (mlflow.sklearn.log_model not mlflow.log_artifact), verify no special characters in model name (use hyphens not underscores), check artifact storage accessibility.
Schritt 3: Implementieren Stage Transitions with Validation
Move model versions durch stages (None → Staging → Production → Archived) with validation checks.
# stage_management.py
import mlflow
from mlflow.tracking import MlflowClient
from datetime import datetime
client = MlflowClient()
class ModelStageManager:
# ... (see EXAMPLES.md for complete implementation)
Erwartet: Modellieren version stage updates in registry, old versions archived automatisch, transition timestamps recorded in tags, rollback restores previous production version.
Bei Fehler: Check version exists and is in expected stage, verify archive_existing_versions flag behavior (may not archive if only one version), ensure database supports concurrent transactions for stage updates, check for stage transition locks (only one transition per version at a time), verify approval workflow integration.
Schritt 4: Implementieren Modellieren Aliasing and References
Use model aliases for stable deployment references (MLflow ≥2.0).
# model_aliases.py
from mlflow.tracking import MlflowClient
client = MlflowClient()
def set_model_alias(model_name, version, alias):
"""
Set an alias for a model version (MLflow 2.0+).
# ... (see EXAMPLES.md for complete implementation)
Erwartet: Aliases appear in Modellieren Registry UI, loading models by alias works (models:/name@alias), updating alias sofort affects new loads, A/B test infrastructure functional.
Bei Fehler: Upgrade MLflow to ≥2.0 for native alias support, use tag-based fallback for older versions, verify alias naming (alphanumeric and hyphens only), check for alias conflicts (one alias per model version).
Schritt 5: Implementieren Modellieren Lineage Tracking
Verfolgen full lineage from data to deployment with comprehensive metadata.
# model_lineage.py
import mlflow
from mlflow.tracking import MlflowClient
import json
client = MlflowClient()
def enrich_model_metadata(model_name, version, lineage_data):
# ... (see EXAMPLES.md for complete implementation)
Erwartet: Modellieren version tags include comprehensive lineage information, get_model_lineage() returns full history, JSON report contains Datenquelle, training details, and deployment info.
Bei Fehler: Verifizieren tag values are strings (convert dicts to JSON), check tag key naming (no spaces or special chars), ensure lineage data captured waehrend training, verify run_id is valid and accessible.
Schritt 6: Automate Registry Operations with CI/CD
Integrieren model registration into CI/CD pipelines for automated promotion.
# .github/workflows/model_promotion.yml
name: Model Promotion Pipeline
on:
workflow_dispatch:
inputs:
model_name:
description: 'Model name to promote'
# ... (see EXAMPLES.md for complete implementation)
Python automation script:
# scripts/promote_model.py
import argparse
from stage_management import ModelStageManager
def main():
parser = argparse.ArgumentParser()
parser.add_argument("--model-name", required=True)
parser.add_argument("--version", type=int, required=True)
# ... (see EXAMPLES.md for complete implementation)
Erwartet: GitHub Actions workflow triggers on manual dispatch, validation tests pass, model promoted to target stage, Slack notification sent, deployment pipeline triggered automatisch.
Bei Fehler: Check GitHub secrets configuration for MLFLOW_TRACKING_URI, verify network access from GitHub Actions to MLflow server (may need VPN or IP allowlist), ensure validation script has correct metric thresholds, check Slack webhook configuration, verify Python script executable Berechtigungs.
Validierung
- Modellieren Registry accessible and backend configured
- Models register erfolgreich from training runs
- Stage transitions work (None → Staging → Production → Archived)
- Validation checks enforce quality thresholds
- Modellieren aliases set and resolved korrekt
- Lineage metadata captured comprehensively
- Rollback functionality restores vorherige Versions
- CI/CD pipeline automates promotions
- Team notifications working for stage changes
- Modellieren URIs resolve korrekt in all stages
Haeufige Stolperfallen
- SQLite limitations: Modellieren Registry requires database backend (PostgreSQL/MySQL) for production - file-based registry causes concurrency issues
- Stage conflicts: Multiple versions in same stage cause confusion - use
archive_existing_versions=Trueto auto-archive - Missing run linkage: Registering models ohne run_id loses lineage - always register from MLflow runs, not raw files
- Alias confusion: Using stages as deployment targets stattdessen of aliases - stages are for workflow, aliases for deployment references
- Validation skipped: Promoting to Production ohne checks - implement mandatory validation in CI/CD pipeline
- No rollback plan: Production issues ohne rollback capability - maintain previous Production version in Archived stage
- Tag overload: Too many unstructured tags - standardize tag schema and naming conventions
- Manual processes: Human-driven promotions are error-prone and slow - automate with CI/CD and approval workflows
- Lost artifacts: Modellieren registered but artifacts deleted from storage - ensure artifact retention policies align with model lifecycle
Verwandte Skills
track-ml-experiments- Log models to MLflow vor registering themdeploy-ml-model-serving- Bereitstellen registered models to serving infrastructurerun-ab-test-models- A/B test models using registry aliasesorchestrate-ml-pipeline- Automate model training and registrationversion-ml-data- Version training data for model lineage
GitHub 저장소
연관 스킬
qmd
개발qmd는 BM25, 벡터 임베딩, 재순위화를 결합한 하이브리드 검색을 통해 로컬 파일을 색인화하고 검색할 수 있는 로컬 검색 및 색인화 CLI 도구입니다. 명령줄 사용과 Claude 통합을 위한 MCP(Model Context Protocol) 모드를 모두 지원합니다. 이 도구는 임베딩에 Ollama를 사용하고 색인을 로컬에 저장하여 터미널에서 직접 문서나 코드베이스를 검색하는 데 이상적입니다.
subagent-driven-development
개발이 스킬은 각 독립적인 작업마다 새로운 하위 에이전트를 배치하고 작업 사이에 코드 리뷰를 진행하여 구현 계획을 실행합니다. 이 리뷰 프로세스를 통해 품질 게이트를 유지하면서 빠른 반복 작업을 가능하게 합니다. 동일한 세션 내에서 대부분 독립적인 작업을 진행할 때 내장된 품질 검증과 함께 지속적인 진행을 보장하기 위해 사용하세요.
mcporter
개발mcporter 스킬은 개발자가 Claude에서 직접 Model Context Protocol(MCP) 서버를 관리하고 호출할 수 있도록 합니다. 이 스킬은 사용 가능한 서버를 나열하고, 인수를 사용해 해당 서버의 도구를 호출하며, 인증 및 데몬 생명주기를 처리하는 명령어를 제공합니다. 개발 워크플로우에서 MCP 서버 기능을 통합하고 테스트할 때 이 스킬을 사용하세요.
adk-deployment-specialist
개발이 스킬은 A2A 프로토콜을 사용하여 Vertex AI ADK 에이전트를 배포하고 오케스트레이션하며, AgentCard 검색, 작업 제출, 코드 실행 샌드박스 및 메모리 뱅크와 같은 지원 도구를 관리합니다. Python, Java 또는 Go 언어로 순차, 병렬 또는 루프 오케스트레이션 패턴을 갖춘 다중 에이전트 시스템 구축을 가능하게 합니다. Google Cloud에서 ADK 에이전트 배포 또는 에이전트 워크플로우 오케스트레이션을 요청받았을 때 사용하세요.
