基本信息

案例ID:224663

技术顾问:肖旭 - 1年经验 - 商汤

联系沟通

微信扫码,建群沟通

项目名称:多语言视频翻译系统

所属行业:工具 - 拍照修图

->查看更多案例

案例介绍

1.项目介绍:帮助视频创作者将中文视频快速翻译为多语言视频物料。
2.技术方向:ASR、NLP、TTS

translate video from zh to en
The pipeline for the video translation task includes the following steps:

First, extract the audio from the video, a process that utilizes ffmpeg.
Use spleeter to separate the human voice from the audio, *I think this will improve the accuracy of downstream ASR.
Employ the Whisper encoder-decoder model for ASR voice recognition and generate an SRT subtitle file, *in the example, the "base" model is used.
Translate the SRT file, *using the Helsinki-NLP/opus-mt-zh-en model for Chinese to English translation processing.
After translation, use speecht5_tts for voice generation.
Finally, merge the results from the upstream processing.
Main purpose: To demonstrate the end-to-end process of a video translation task. Optimization space:

发布任务

企业点击发布任务,工程师会在任务下报名,招聘专员也会在1小时内与您联系,1小时内精准确定人才

微信接收人才推送

关注猿急送微信平台,接收实时人才推送

接收人才推送
联系需求方端客服
联系需求方端客服