Llama 4 Reclaims Open Source Lead, DeepSeek Matches Code Capabilities with Half the Parameters

2025-04-06 / Read about 0 minute

Author：小编

On a Sunday, Meta unexpectedly unveiled the Llama 4 series of models, marking its inaugural foray into the MoE (Mixture of Experts) architecture. This series comprises the Llama 4 Scout, Llama 4 Maverick, and the eagerly anticipated Llama 4 Behemoth. Both the Llama 4 Scout and Llama 4 Maverick are now available, each equipped with an impressive 17 billion active parameters. The Llama 4 Behemoth, serving as the series' "teacher model," boasts an astonishing total of nearly 2 trillion parameters but is still awaiting its official release. Leveraging the MoE architecture, these models enhance computational and inferential efficiency, support multimodal capabilities, and achieve groundbreaking advancements in long text processing.

Previous page：Accomplish Three Tasks in One Sentence: This Remar...

Next page：Beijing-Shanghai High-Speed Railway Establishes St...

Return to List

Hot Reading

2 day ago

Here are the 17 US-based AI companies that have raised $100m or more in 2026

2 day ago

As AI jitters rattle IT stocks, Infosys partners with Anthropic to build ‘enterprise-grade’ AI agents

2 day ago

Amazon Fire TV’s new interface is now rolling out in the US

2 day ago

WordPress.com adds an AI Assistant that can edit, adjust styles, create images, and more