Ali Unveils End-to-End Speech Interaction Model: Fun-Audio-Chat

2025-12-23 / Read about 0 minute

Author：小编

According to information from Tongyi Large Model, on December 23, 2025, Alibaba made a significant move by releasing its latest end-to-end speech interaction model, Fun-Audio-Chat. In a generous gesture towards the tech community, Alibaba has open-sourced not only the 8B model weights but also the inference code and Function Call integration examples. This cutting-edge model has emerged as a frontrunner, claiming the top spot among models of similar size across multiple leaderboards. Its overall performance is outstanding, leaving behind competitors such as GLM4-Voice and Kimi-Audio. What sets Fun-Audio-Chat apart is its adoption of an end-to-end S2S (Speech-to-Speech) architecture. This design choice translates into enhanced efficiency and significantly reduced latency, making it a game-changer in the field of speech interaction.

Previous page：Shanghai Welcomes 12 New Generative AI Services Af...

Next page：Feizhu: In 2025, Platform Merchants' Utilization o...

Return to List

Hot Reading

2 day ago

Microsoft's "commitment to Windows quality" starts with overhaul of beta program

2 day ago

AI models are terrible at betting on soccer—especially xAI Grok

2 day ago

Nvidia-backed SiFive hits $3.65 billion valuation for open AI chips

1 day ago

The Netherlands is the first European country to approve Tesla’s supervised Full Self-Driving