SKILL·3C8C6E

torchserve

Name: torchserve
Author: cuba6112

cuba6112

Updated 1 month ago

10 views

Developmentapi

About

TorchServe is a production-ready model serving engine for PyTorch that packages models into MAR files and serves them via REST/gRPC APIs. It's ideal when you need custom preprocessing/inference logic via Python handlers and automatic multi-GPU worker scaling. Use it for handling request batching, load balancing, and managing multiple model versions in deployment.

Quick Install

Claude Code

Recommended

Primary

npx skills add cuba6112/skillfactory -a claude-code

Plugin CommandAlternative

/plugin add https://github.com/cuba6112/skillfactory

Git CloneAlternative

git clone https://github.com/cuba6112/skillfactory.git ~/.claude/skills/torchserve

Copy and paste this command in Claude Code to install this skill

GitHub Repository

cuba6112/skillfactory

Path: skills/torchserve

FAQ

Frequently asked questions

What is the torchserve skill?

torchserve is a Claude Skill by cuba6112. Skills package instructions and resources that Claude loads on demand, so Claude can perform torchserve-related tasks without extra prompting.

How do I install torchserve?

Use the install commands on this page: add torchserve to Claude Code as a plugin, or clone its repository into your skills directory, then restart Claude so it picks up the skill.

What category does torchserve belong to?

torchserve is in the Development category, tagged api.

Is torchserve free to use?

Yes. torchserve is listed on AIMCP and free to install. It runs inside Claude, so no separate service account is required to use the skill itself.

Related Skills

qmd

Development

qmd is a local search and indexing CLI tool that enables developers to index and search through local files using hybrid search combining BM25, vector embeddings, and reranking. It supports both command-line usage and MCP (Model Context Protocol) mode for integration with Claude. The tool uses Ollama for embeddings and stores indexes locally, making it ideal for searching documentation or codebases directly from the terminal.

View skill

subagent-driven-development

Development

This skill executes implementation plans by dispatching a fresh subagent for each independent task, with code review between tasks. It enables fast iteration while maintaining quality gates through this review process. Use it when working on mostly independent tasks within the same session to ensure continuous progress with built-in quality checks.

View skill

mcporter

Development

The mcporter skill enables developers to manage and call Model Context Protocol (MCP) servers directly from Claude. It provides commands to list available servers, call their tools with arguments, and handle authentication and daemon lifecycle. Use this skill for integrating and testing MCP server functionality in your development workflow.

View skill

adk-deployment-specialist

Development

This skill deploys and orchestrates Vertex AI ADK agents using A2A protocol, managing AgentCard discovery, task submission, and supporting tools like Code Execution Sandbox and Memory Bank. It enables building multi-agent systems with sequential, parallel, or loop orchestration patterns in Python, Java, or Go. Use it when asked to deploy ADK agents or orchestrate agent workflows on Google Cloud.

View skill