Xiaohongshu Unveils Its First Large Model, Pioneering Pre-training Without Synthetic Data

6 day ago / Read about 0 minute

Author：小编

Xiaohongshu has recently released its groundbreaking first large model, named dots.llm1. This model represents a Mixture-of-Experts (MoE) architecture, boasting an impressive 142 billion parameters. Notably, during inference, it efficiently activates only 14 billion parameters, ensuring high performance while drastically cutting down on both training and inference expenses. In its pre-training phase, dots.llm1.ins leveraged an extensive 11.2 trillion tokens of non-synthetic data, achieving performance comparable to Alibaba's Qwen3-32b across Chinese and English language tasks, mathematics, and alignment challenges.

Previous page：Apple Unveils Xcode 26: An Advanced Developer Tool...

Next page：Apple's WWDC Highlights AI Innovations, Yet Delay ...

Return to List

Hot Reading

2 day ago

Apple CarPlay on iOS 26 Brings Massive Upgrades Including Playing Videos While Parked

2 day ago

Massive Blue Reveals How Deepfakes Are Creating New Threats to Public Figures

2 day ago

Apple's Siri AI Upgrade Is Coming This 2026, New Report Suggests its Releasing via iOS 26.4

2 day ago

Intel to begin fab personnel layoffs in mid-July

1 day ago

AMD quietly launches a budget gaming beast

2 day ago

Apple’s Liquid Glass design is paving the way for AR glasses

2 day ago

Meta beefs up disappointing AI division with $15 billion Scale AI investment

2 day ago

Google's AI Is Actively Destroying the News Media

2 day ago

Meta AI 'Discover' Feed Unveils What Users Ask the AI, Posting Sensitive Info Tied to Their Accounts

2 day ago

Google's Audio Overview Turns Search Results Into AI Podcasts—Here's How to Join the Testing

Previous page：Apple Unveils Xcode 26: An Advanced Developer Tool...

Next page：Apple's WWDC Highlights AI Innovations, Yet Delay ...