StepFun Unveils Its Latest Speech Generation Model: StepAudio 2.5 TTS

5 day ago / Read about 0 minute

Author：小编

On April 16th, StepFun proudly announced the official launch of its cutting-edge speech generation model, StepAudio 2.5 TTS. This model has undergone significant enhancements in four key areas: global context control, in-text context control, zero-shot voice cloning, and comprehensive voice control. These improvements collectively contribute to a more natural, adaptable, and expressive speech generation experience. StepAudio 2.5 TTS is tailored to cater to a variety of use cases, including character voiceovers, audio content creation, and intelligent voice interactions. It empowers users with the ability to control synthesis through natural language, ensuring straightforward operation and results that closely align with user expectations. Presently, the model is readily accessible on the 'StepFun Open Platform' and through Step Plan.

Previous page：Reportedly, Guo Daya, a Key Researcher from DeepSe...

Next page：Man Suspected of Assaulting OpenAI CEO Had Previou...

Return to List

Hot Reading

2 day ago

This Logitech gaming mouse is unlike any tech I've ever used –it's completely wild

2 day ago

TechCrunch Mobility: Uber enters its assetmaxxing era

2 day ago

Palantir posts mini-manifesto denouncing inclusivity and ‘regressive’ cultures

1 day ago

NSA spies are reportedly using Anthropic’s Mythos, despite Pentagon feud