Genve AI + TranscribeToText: Sell a Video Localization Pack (Lip-Sync + Dubbing + Subtitles)
Sell a Video Localization Pack: Genve AI for dubbing + lip-sync and TranscribeToText for transcript/subtitle exports and QA. Productized deliverables, clear scope, no outcome guarantees.

Tongyi Bailong's Speech Twins Go Open-Source: Alibaba Drops Upgraded Fun-CosyVoice3 and Fun-ASR — 3-Second Voice Cloning Across 9 Languages and 18 Dialects
On December 15, 2025, Alibaba's Tongyi Lab unleashed major upgrades to its Bailong speech twins — Fun-CosyVoice3 (TTS) and Fun-ASR (speech recognition) — while simultaneously open-sourcing lightweight versions like Fun-CosyVoice3-0.5B and Fun-ASR-Nano-0.8B. The star feature? Zero-shot voice cloning from just 3 seconds of audio, seamlessly switching across 9 languages, 18 Chinese dialects, and 9 emotions with uncanny fidelity. Latency slashed by 50%, noisy environment accuracy hitting 93%, and full local deployment support — this duo crushes rivals like ElevenLabs and Whisper in multilingual realism, flooding ModelScope and Hugging Face with instant downloads.





