SKILL·EDA2A0

track-ml-experiments

Name: track-ml-experiments
Author: pjt222

pjt222

Updated 1 month ago

24 views

Metaaiautomationdesign

About

This skill sets up an MLflow tracking server for experiment management with autologging for popular frameworks. It enables systematic comparison of training runs via metrics and visualizations while managing artifacts in remote storage. Use it when starting ML projects requiring reproducible workflows, migrating from manual logging, or comparing multiple runs with full lineage tracking.

Quick Install

Claude Code

Recommended

Primary

npx skills add pjt222/agent-almanac -a claude-code

Plugin CommandAlternative

/plugin add https://github.com/pjt222/agent-almanac

Git CloneAlternative

git clone https://github.com/pjt222/agent-almanac.git ~/.claude/skills/track-ml-experiments

Copy and paste this command in Claude Code to install this skill

Documentation

追 ML 試

見 Extended Examples 為全配檔與模

立 MLflow 追服、行全試追含度、參、物。

用

始新 ML 案需試追→用
由手日轉至自追→用
系比多模訓行→用
與隊分試果→用
建可重現 ML 流含全系追→用
整試追於 CI/CD 流→用

入

必：含 ML 框（sklearn、pytorch、tensorflow、xgboost）之 Python 境
必：MLflow 裝（pip install mlflow）
可：遠存背（S3、Azure Blob、GCS）為物
可：庫背（PostgreSQL、MySQL）為元存
可：遠背認憑

行

一：初 MLflow 追服

立 MLflow 追服含宜背。

# Option 1: Local file-based tracking (development)
mkdir -p mlruns
export MLFLOW_TRACKING_URI="file:./mlruns"

# Option 2: SQLite backend with local artifacts
mlflow server \
  --backend-store-uri sqlite:///mlflow.db \
  --default-artifact-root ./mlartifacts \
# ... (see EXAMPLES.md for complete implementation)

建配檔為隊享：

# mlflow_config.py
import os

MLFLOW_TRACKING_URI = os.getenv(
    "MLFLOW_TRACKING_URI",
    "http://mlflow-server.company.com:5000"
)

# ... (see EXAMPLES.md for complete implementation)

得：MLflow UI 可訪於指 host:port、示空試列。服日確啟成無誤。

敗：察口可用以 netstat -tulpn | grep 5000、驗庫連串、確 S3 憑配（aws configure）、察防火牆規於遠訪。

二：配 ML 框自記

啟框自記以自捕度、參、模。

# training_script.py
import mlflow
from mlflow_config import MLFLOW_TRACKING_URI, MLFLOW_EXPERIMENT_NAME

# Set tracking URI
mlflow.set_tracking_uri(MLFLOW_TRACKING_URI)
mlflow.set_experiment(MLFLOW_EXPERIMENT_NAME)

# ... (see EXAMPLES.md for complete implementation)

PyTorch：

import mlflow.pytorch

mlflow.pytorch.autolog(
    log_every_n_epoch=1,
    log_every_n_step=None,
    log_models=True,
    disable=False,
    exclusive=False,
# ... (see EXAMPLES.md for complete implementation)

得：行現於 MLflow UI 含諸超參、度（訓/驗失、準）、模物、入例自記。

敗：驗 MLflow 版於 ML 框合（mlflow.sklearn.autolog() 需 MLflow ≥1.20）、察自記支於模類否、閉自記用手記為退、用 mlflow.set_tracking_uri() 察日為連誤。

三：行全手記

加自度、參、物、標為全試文。

# comprehensive_tracking.py
import mlflow
import numpy as np
import matplotlib.pyplot as plt
from pathlib import Path

def train_and_log_model(params, X_train, y_train, X_test, y_test):
    """
# ... (see EXAMPLES.md for complete implementation)

得：MLflow UI 示富試訊含逐步度、視物、模簽、入例、為濾搜之全標。

敗：察物存權（aws s3 ls s3://bucket/path）、驗 matplotlib 背為圖記（plt.switch_backend('Agg')）、確 JSON 可序資類為 log_dict、察碟空為本物存。

四：比行而生報

用 MLflow 比工析多試。

# compare_runs.py
import mlflow
from mlflow.tracking import MlflowClient

client = MlflowClient()

def compare_experiments(experiment_name, metric_name="test_accuracy", top_n=5):
    """
# ... (see EXAMPLES.md for complete implementation)

命行比：

# Compare runs using MLflow CLI
mlflow runs compare --experiment-name customer-churn \
  --order-by "metrics.test_accuracy DESC" \
  --max-results 10

# Export run data to CSV
mlflow experiments csv --experiment-name customer-churn \
  --output experiments.csv

得：終出示序行含要度、HTML 報生含格比表、CSV 檔含諸行資為深析。

敗：以 mlflow experiments list 驗試在、察度名精配（敏）、確行成（察行態）、驗檔書權於出檔。

五：配遠物存

立 S3/Azure/GCS 背為可長物管。

# artifact_storage_config.py
import mlflow
import os

def configure_s3_backend():
    """
    Configure S3 for artifact storage.
    """
# ... (see EXAMPLES.md for complete implementation)

Docker Compose 為 MLflow 含 PostgreSQL 與 S3：

# docker-compose.yml
version: '3.8'

services:
  postgres:
    image: postgres:14
    environment:
      POSTGRES_DB: mlflow
# ... (see EXAMPLES.md for complete implementation)

得：物上載成於遠存、MLflow UI 示物鏈指 S3/Azure/GCS URI、由 UI 載物正行。

敗：以 aws s3 ls 或 az storage blob list 驗雲憑、察桶/容權（需書權）、確 MLflow 含雲附（pip install mlflow[extras]）裝、測網於存端、察 CORS 為瀏訪。

六：行試生命管

立自清、藏、組策。

# lifecycle_management.py
import mlflow
from mlflow.tracking import MlflowClient
from datetime import datetime, timedelta

client = MlflowClient()

def archive_old_experiments(days_old=90):
# ... (see EXAMPLES.md for complete implementation)

得：舊試移至刪態、敗行自活列除、佳行標為易濾於 UI、存空復。

敗：察試權（必為主乃可刪）、驗行於 FAILED 態、確度於諸序行存、察庫連於批操、驗遠存物刪足權。

驗

忌

連超時：MLflow 服自訓腳本不可訪——驗 MLFLOW_TRACKING_URI 環變、察防火、確服行
物上載敗：S3/Azure 憑未配或桶不在——先測雲 CLI 訪、驗桶權
缺度：自記閉或框版不支——察 MLflow 版合、退至手記
行雜：試行過多污 UI——早行標策、常用生命管腳本
大物：記全資致存脹——唯記樣或參、用外資版（DVC）
名不一：諸行間參異名——於配檔標名規
庫鎖：SQLite 不支並書——多用境用 PostgreSQL/MySQL
自記衝：多自記配相擾——用 exclusive=True 或閉衝自記

參

register-ml-model - 登追之模於 MLflow 模登
version-ml-data - 用 DVC 為可重現試版資集
setup-automl-pipeline - 整試追於自 ML 流
deploy-ml-model-serving - 部最佳追之模於產
orchestrate-ml-pipeline - 合試追與流協

GitHub Repository

pjt222/agent-almanac

Path: i18n/wenyan-ultra/skills/track-ml-experiments

agentsagentskillsai-assisted-developmentclaude-codeskillsteams

FAQ

Frequently asked questions

What is the track-ml-experiments skill?

track-ml-experiments is a Claude Skill by pjt222. Skills package instructions and resources that Claude loads on demand, so Claude can perform track-ml-experiments-related tasks without extra prompting.

How do I install track-ml-experiments?

Use the install commands on this page: add track-ml-experiments to Claude Code as a plugin, or clone its repository into your skills directory, then restart Claude so it picks up the skill.

What category does track-ml-experiments belong to?

track-ml-experiments is in the Meta category, tagged ai, automation and design.

Is track-ml-experiments free to use?

Yes. track-ml-experiments is listed on AIMCP and free to install. It runs inside Claude, so no separate service account is required to use the skill itself.

Related Skills

content-collections

Meta

This skill provides a production-tested setup for Content Collections, a TypeScript-first tool that transforms Markdown/MDX files into type-safe data collections with Zod validation. Use it when building blogs, documentation sites, or content-heavy Vite + React applications to ensure type safety and automatic content validation. It covers everything from Vite plugin configuration and MDX compilation to deployment optimization and schema validation.

View skill

polymarket

Meta

This skill enables developers to build applications with the Polymarket prediction markets platform, including API integration for trading and market data. It also provides real-time data streaming via WebSocket to monitor live trades and market activity. Use it for implementing trading strategies or creating tools that process live market updates.

View skill

creating-opencode-plugins

Meta

This skill helps developers create OpenCode plugins that hook into 25+ event types like commands, files, and LSP operations. It provides the plugin structure, event API specifications, and implementation patterns for JavaScript/TypeScript modules. Use it when you need to intercept, monitor, or extend the OpenCode AI assistant's lifecycle with custom event-driven logic.

View skill

sglang

Meta

SGLang is a high-performance LLM serving framework that specializes in fast, structured generation for JSON, regex, and agentic workflows using its RadixAttention prefix caching. It delivers significantly faster inference, especially for tasks with repeated prefixes, making it ideal for complex, structured outputs and multi-turn conversations. Choose SGLang over alternatives like vLLM when you need constrained decoding or are building applications with extensive prefix sharing.

View skill