Alibaba Launches Qwen3.5-Omni, Outperforming Gemini-3.1 Pro in Multimodal Proficiency
1 week ago / Read about 0 minute
Author:小编   

On March 30, 2026, Alibaba introduced Qwen3.5-Omni, a cutting-edge, full-modal large-scale model within its Qianwen series. This model has attained state-of-the-art (SOTA) levels of performance across 215 diverse tasks, encompassing audio-video comprehension, recognition, and interactive capabilities. It has surpassed Gemini-3.1 Pro, cementing its position as one of the premier full-modal large-scale models globally. The novel model boasts support for the recognition of 113 languages and dialects, incorporates audio-video Vibe Coding functionalities, and is capable of generating intricate code for applications, webpages, and beyond. Presently, Alibaba Cloud's BaiLian platform has rolled out three APIs for Qwen3.5-Omni—Plus, Flash, and Light—tailored to sectors such as short videos, gaming, and self-media, with complimentary access extended to general users.