SKILL·0380B9

video-translation

Name: video-translation
Author: NoizAI

NoizAI

업데이트됨 1 month ago

8 조회

517

GitHub에서 보기

기타general

정보

이 스킬은 소스를 다운로드하고, 자막을 추출하며, 텍스트를 번역하고, TTS로 새로운 오디오를 생성한 후 원본 오디오 트랙을 대체하는 방식으로 동영상을 번역 및 더빙합니다. 사용자가 언어 변경을 요청할 때 YouTube 동영상을 현지화하는 것과 같은 사용 사례를 위해 설계되었습니다. 주요 기능으로는 동영상 다운로드 처리, 자막 처리, 원본 동영상을 보존하면서 오디오 합성 등이 포함됩니다.

빠른 설치

Claude Code

문서

Video Translation

Translate a video's speech into another language, using TTS to generate the dubbed audio and replacing the original audio track.

Triggers

translate this video
dub this video to English
把视频从 X 语译成 Y 语
视频翻译

Use Cases

The user wants to watch a foreign language YouTube video but prefers to hear it in their native language.
The user provides a video link and explicitly requests changing the audio language.

Workflow

When the user asks to translate a video:

Download Video & Subtitles: Use the youtube-downloader skill to download the video and its subtitles as SRT. Make sure you specify the source language to fetch the correct subtitle.
```
python path/to/youtube-downloader/scripts/download_video.py "VIDEO_URL" --subtitles --sub-lang <source_lang_code> -o /tmp/video-translation
```
Translate Subtitles: Read the downloaded .srt file. Translate its contents sentence by sentence into the target language using the following fixed prompt. Keep the exact same SRT index and timestamp format!

Translation Prompt:

Translate the following subtitle text from <Source Language> to <Target Language>. Provide ONLY the translated text. Do not explain, do not add notes, do not add index numbers. The translation must be colloquial, natural-sounding, and suitable for video dubbing.

Save the translated text into a new file translated.srt.
Generate Dubbed Audio: Use the tts skill to render the timeline-accurate audio from the translated SRT. The Noiz backend automatically aligns the duration of each sentence to the original video's subtitle timestamps.

To ensure the cloned voice matches the original speaker's exact tone and emotion for each sentence, pass the original video file to --ref-audio-track. The TTS engine will automatically slice the original audio at each subtitle's exact timestamp and use it as the reference for that specific segment.

Create a basic voice_map.json:
```
{
  "default": {
    "target_lang": "<target_lang_code>"
  }
}
```
Render the timeline-accurate audio:
```
bash skills/tts/scripts/tts.sh render --srt translated.srt --voice-map voice_map.json --backend noiz --auto-emotion --ref-audio-track original_video.mp4 -o dubbed.wav
```
Replace Audio in Video: Use the replace_audio.sh script to merge the original video with the new dubbed audio. To keep the original video's non-speech audio background outside of translated segments, pass the --srt file.
```
bash skills/video-translation/scripts/replace_audio.sh --video original_video.mp4 --audio dubbed.wav --output final_video.mp4 --srt translated.srt
```
Present the Result: Return the final_video.mp4 file path to the user.

Inputs

Required inputs:
- VIDEO_URL: The URL of the video to translate.
- target_language: The language to translate the audio to.
Optional inputs:
- source_language: The language of the original video (if not auto-detected or specified).
- reference_audio: Specific audio file/URL to use for voice cloning instead of the dynamic original video track.

Outputs

Success: Path to the final video file with replaced audio.
Failure: Clear error message specifying whether download, TTS, or audio replacement failed.

Requirements

Dependencies (other skills)
- youtube-downloader (crazynomad/skills) — SKILL.md
  Install: clone or copy the skills/youtube-downloader directory from crazynomad/skills into your skills/ folder so that skills/youtube-downloader/scripts/download_video.py is available.
- tts (NoizAI/skills) — SKILL.md
  If not already in this repo: clone or copy the skills/tts directory from NoizAI/skills into your skills/ folder. Ensure skills/tts/scripts/tts.sh and related scripts are present.
NOIZ_API_KEY configured for the Noiz backend. If it is not set, first guide the user to get an API key from https://developers.noiz.ai/api-keys. After the user provides the key, ask whether they want to persist it; if they agree, either write/update NOIZ_API_KEY=... in the project's .env file or run bash skills/tts/scripts/tts.sh config --set-api-key YOUR_KEY to store it.
ffmpeg installed.

Limitations

The source video must have subtitles (or auto-generated subtitles) available on the platform for the source language.
Very long videos may take a significant amount of time to translate and dub.

GitHub 저장소

NoizAI/skills

경로: skills/video-translation

FAQ

Frequently asked questions

What is the video-translation skill?

video-translation is a Claude Skill by NoizAI. Skills package instructions and resources that Claude loads on demand, so Claude can perform video-translation-related tasks without extra prompting.

How do I install video-translation?

Use the install commands on this page: add video-translation to Claude Code as a plugin, or clone its repository into your skills directory, then restart Claude so it picks up the skill.

What category does video-translation belong to?

video-translation is in the Other category, tagged general.

Is video-translation free to use?

Yes. video-translation is listed on AIMCP and free to install. It runs inside Claude, so no separate service account is required to use the skill itself.

연관 스킬

llamaguard

기타

LlamaGuard는 폭력 및 혐오 발언 등 6가지 안전 범주에서 LLM 입력과 출력을 조정하기 위한 Meta의 70-80억 파라미터 모델입니다. 94-95% 정확도를 제공하며 vLLM, Hugging Face 또는 Amazon SageMaker를 사용해 배포할 수 있습니다. 이 기술을 사용하여 AI 애플리케이션에 콘텐츠 필터링 및 안전 가드레일을 손쉽게 통합하세요.

스킬 보기

cost-optimization

기타

이 Claude Skill은 리소스 적정화, 태깅 전략, 지출 분석을 통해 개발자들이 클라우드 비용을 최적화할 수 있도록 지원합니다. AWS, Azure, GCP에서 클라우드 비용을 절감하고 비용 거버넌스를 구현하기 위한 프레임워크를 제공합니다. 인프라 비용을 분석하거나, 리소스를 적정화하거나, 예산 제약을 충족해야 할 때 사용하세요.

스킬 보기

sports-betting-analyzer

기타

이 Claude Skill은 스프레드, 오버/언더, 프로프 베트를 포함한 스포츠 베팅 시장을 분석합니다. 역사적 추이와 상황별 통계를 검토하여 가치 베트를 발견하고, 교육적 목적으로 실행 가능한 권장 사항이 담긴 구조화된 마크다운 결과를 제공합니다. 개발자는 이 기능을 스포츠 베팅 분석 도구에 활용할 수 있으며, 단순히 엔터테인먼트/교육 목적으로만 설계되었음을 유의해야 합니다.

스킬 보기

quantizing-models-bitsandbytes

기타

이 스킬은 bitsandbytes를 사용하여 LLM을 8비트 또는 4비트 정밀도로 양자화하며, 최소한의 정확도 손실로 50-75%의 메모리 감소를 달성합니다. 제한된 GPU 메모리에서 더 큰 모델을 실행하거나 추론을 가속화하는 데 이상적이며, INT8, NF4, FP4와 같은 형식을 지원합니다. 이 스킬은 HuggingFace Transformers와 통합되어 QLoRA 학습 및 8비트 옵티마이저를 가능하게 합니다.

스킬 보기