Alibaba Unveils Another Large-Scale Open-Source Model: Seamless Performance Across Mobile and Desktop, Exceeding GPT-5 in Numerous Benchmarks - AI

7 x 24 Track global technological trends

Hot Topic

Day

News Topic

Alibaba Unveils Another Large-Scale Open-Source Model: Seamless Performance Across Mobile and Desktop, Exceeding GPT-5 in Numerous Benchmarks

2 day ago / Read about 0 minute

Author：小编

On September 24, 2025, Alibaba’s Tongyi Large Model Team unveiled the newly enhanced Qwen3-VL series models and declared the open-sourcing of its premier version, the Qwen3-VL-235B-A22B series. As the most advanced visual-language model within the Qwen family, Qwen3-VL not only equips the model to perceive images or videos but also endows it with the capacity to interpret the world, grasp events, and initiate actions. During official demonstrations, the model displayed impressive visual-driven reasoning and execution skills, enabling it to manage devices like smartphones and computers, and carry out tasks such as launching applications, tapping buttons, inputting data, and more, all through natural language commands. Additionally, it can effortlessly conduct flight searches and bookings. Qwen3-VL is adept at identifying a vast array of objects, with its knowledge spanning celebrities, culinary delights, flora and fauna, automotive brands, anime characters, and beyond. In a thorough assessment across ten categories, the Qwen3-VL-235B-A22B-Instruct model excelled in most metrics among non-reasoning models, outperforming proprietary models like Gemini 2.5 Pro and GPT-5, while establishing new standards for open-source multimodal models. Presently, both Qwen3-VL-235B-A22B-Instruct and Qwen3-VL-235B-A22B-Thinking are accessible as open-source resources on platforms including Github, Hugging Face, and ModelScope.

Previous page：LeCun's Team Releases Open-Source Code for the Pio...

Next page：US AI Industry Begins to Lose China's 'Elite Talen...

Return to List

Hot Reading

2 day ago

Clarifai’s new reasoning engine makes AI models faster and less expensive

2 day ago

Agentic AI Framework for Cyber Defense for Tackling Recent and Increasing Cyber Attacks in Ecommerce & Retail

2 day ago

From Early AI Insight to Industry Influence: How Nicole Lytle Built Craftly.AI Ahead of the Curve

2 day ago

Juicebox raises $30M from Sequoia to revolutionize hiring with LLM-powered search

2 day ago

Artificial General Intelligence Development: Bridging Theoretical Aspirations and Contemporary Enterprise Integration Frameworks

2 day ago

Gemini comes to Google TV, but it might not be available to you just yet

2 day ago

Spotify to label AI music, filter spam and more in AI policy change

2 day ago

What top VCs want from AI founders: Inside the investor lens with Jon McNeill, Aileen Lee, and Steve Jang at TechCrunch Disrupt 2025

2 day ago

Love, lies, and algorithms: Is AI really helping us find ‘the one’? Live at TechCrunch Disrupt 2025

2 day ago

From Chatbot to Cyber Threat: Securing the Next Wave of GenAI Tools

Previous page：LeCun's Team Releases Open-Source Code for the Pio...

Next page：US AI Industry Begins to Lose China's 'Elite Talen...

C114 Communication Network
Communication Home

7 X 24 Track global technological trends

Find

News Topic

Hot Topic

7 x 24 Track global technological trends

News Flash

News Topic

AI
/
Devices
/
Smart Car
/
Chip
/
Cloud

C114 Communication Network

Communication Home