Code Leak: DeepSeek's Next-Generation "Blockbuster" Model Architecture Exposed - AI

7 x 24 Track global technological trends

Hot Topic

Day

News Topic

Code Leak: DeepSeek's Next-Generation "Blockbuster" Model Architecture Exposed

2026-01-21 / Read about 0 minute

Author：小编

On the first anniversary of DeepSeek-R1's release, its new model "MODEL1" was exposed in the GitHub code repository. This model appears 28 times across 114 files in the FlashMLA optimization library, cited either alongside or distinctly from the existing model V3.2. Technical analysis reveals that MODEL1 adopts a completely new architecture, optimized in areas such as key-value cache layout, sparsity handling, and FP8 decoding. It may serve as the development codename for DeepSeek's next-generation flagship model V4, with an expected release as early as February.

Previous page：ChatGPT Rolls Out 'Age Prediction' Function: An Ex...

Next page：OpenAI and Gates Foundation to Pump $50 Million in...

Return to List

Hot Reading

1 day ago

Online bot traffic will exceed human traffic by 2027, Cloudflare CEO says

1 day ago

Amazon brings Alexa+ to the UK

22 hour ago

OpenAI is acquiring open source Python tool-maker Astral

2 day ago

FlexiSpot Kana Japanese Bed Frame Review: Minimalist Design Meets Quiet, Sturdy Sleep