Tongyi Bailing's Dual Voice Models Receive Major Upgrades and Are Now Open-Source, Markedly Boosting Speech Synthesis and Recognition Abilities
2025-12-15 / Read about 0 minute
Author:小编   

As per the official announcement from Tongyi Large Models, Tongyi Bailing's speech large models, Fun-CosyVoice3 and Fun-ASR, have undergone substantial enhancements and are now available as open-source. Fun-CosyVoice3 has witnessed notable advancements in content coherence, speaker resemblance, and the natural flow of intonation. Remarkably, it can recreate the target voice using merely a 3-second audio clip, and it also supports dialect synthesis and tone customization. On the flip side, Fun-ASR has bolstered its contextual understanding and highly accurate speech transcription capabilities, leading to a more than 15% increase in recognition precision in specialized fields like home decoration and insurance. Additionally, it facilitates custom training for models exclusive to enterprises.