MCP HubMCP Hub
Zurück zu Fähigkeiten

openai-whisper-api

steipete
Aktualisiert Today
477 Ansichten
468
45
468
Auf GitHub ansehen
Entwicklungaiapi

Über

Dieses Claude Skill transkribiert Audiodateien in Text unter Verwendung der Whisper-API von OpenAI über ein curl-basiertes Skript. Es unterstützt verschiedene Audioformate, ermöglicht optionale Parameter wie Sprache und Prompts und kann Klartext oder JSON ausgeben. Nutzen Sie es, um schnell Spracherkennungsfunktionen in Ihren Workflow zu integrieren.

Schnellinstallation

Claude Code

Empfohlen
Primär
npx skills add steipete/clawdis
Plugin-BefehlAlternativ
/plugin add https://github.com/steipete/clawdis
Git CloneAlternativ
git clone https://github.com/steipete/clawdis.git ~/.claude/skills/openai-whisper-api

Kopieren Sie diesen Befehl und fügen Sie ihn in Claude Code ein, um diese Fähigkeit zu installieren

Dokumentation

OpenAI Whisper API (curl)

Transcribe an audio file via OpenAI’s /v1/audio/transcriptions endpoint.

Quick start

{baseDir}/scripts/transcribe.sh /path/to/audio.m4a

Defaults:

  • Model: whisper-1
  • Output: <input>.txt

Useful flags

{baseDir}/scripts/transcribe.sh /path/to/audio.ogg --model whisper-1 --out /tmp/transcript.txt
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --language en
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --prompt "Speaker names: Peter, Daniel"
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --json --out /tmp/transcript.json

API key

Set OPENAI_API_KEY, or configure it in ~/.clawdis/clawdis.json:

{
  skills: {
    "openai-whisper-api": {
      apiKey: "OPENAI_KEY_HERE"
    }
  }
}

GitHub Repository

steipete/clawdis
Pfad: skills/openai-whisper-api
relaywhatsapp

Verwandte Skills

polymarket

Meta

This skill enables developers to build applications with the Polymarket prediction markets platform, including API integration for trading and market data. It also provides real-time data streaming via WebSocket to monitor live trades and market activity. Use it for implementing trading strategies or creating tools that process live market updates.

Skill ansehen

creating-opencode-plugins

Meta

This skill helps developers create OpenCode plugins that hook into 25+ event types like commands, files, and LSP operations. It provides the plugin structure, event API specifications, and implementation patterns for JavaScript/TypeScript modules. Use it when you need to intercept, monitor, or extend the OpenCode AI assistant's lifecycle with custom event-driven logic.

Skill ansehen

himalaya-email-manager

Kommunikation

This Claude Skill enables email management through the Himalaya CLI tool using IMAP. It allows developers to search, summarize, and delete emails from an IMAP account with natural language queries. Use it for automated email workflows like getting daily summaries or performing batch operations directly from Claude.

Skill ansehen

sglang

Meta

SGLang is a high-performance LLM serving framework that specializes in fast, structured generation for JSON, regex, and agentic workflows using its RadixAttention prefix caching. It delivers significantly faster inference, especially for tasks with repeated prefixes, making it ideal for complex, structured outputs and multi-turn conversations. Choose SGLang over alternatives like vLLM when you need constrained decoding or are building applications with extensive prefix sharing.

Skill ansehen