Ali Unveils Two Innovative Voice Models for Tailored Roles and Realistic Background Sound Simulation - AI

7 x 24 Track global technological trends

Hot Topic

Day

News Topic

Ali Unveils Two Innovative Voice Models for Tailored Roles and Realistic Background Sound Simulation

23 hour ago / Read about 0 minute

Author：小编

Ali has rolled out two cutting-edge voice models: Fun - CosyVoice3.5 and Fun - AudioGen - VD. The first model, Fun - CosyVoice3.5, is a voice cloning tool that operates based on reference audio. In contrast, Fun - AudioGen - VD is a timbre design model that functions independently of any reference audio. Both models are equipped with 'instruction-following' features, making them versatile for use in a wide range of applications.

Fun - CosyVoice3.5 shines in the Chinese 'difficult cases' section of the Seed - TTS benchmark test, significantly lowering error rates for uncommon characters and phrases. It also offers free-style mode instruction control, effectively tackling the common issues associated with traditional cloning models.

On the other hand, Fun - AudioGen - VD is dedicated to 'creating something out of nothing' in timbre design. It allows for personalized timbre and emotion customization, as well as the simulation of intricate soundscapes.

Edited by Yang Juanjuan, and meticulously proofread by Chen Diyan.

Previous page：China Telecom Makes Strategic Investment in AI Fir...

Next page：AI.com Fetches a Mind-Blowing $70 Million: Registe...

Return to List

Hot Reading

1 day ago

Honor says its ‘Robot phone’ with moving camera can dance to music

2 day ago

The billion-dollar infrastructure deals powering the AI boom

2 day ago

OpenAI’s Sam Altman announces Pentagon deal with ‘technical safeguards’

2 day ago