DeepSeek Quietly Updates: Mega MoE and FP4 Indexer Arrive

2 day ago / Read about 0 minute

Author：小编

After a period of silence, DeepSeek has made new moves by updating its DeepGEMM codebase and launching a new project called Mega MoE. Contributed by DeepSeek's infrastructure team, this project integrates previously scattered MoE computing processes into a single mega-kernel, enabling parallel data communication and computation, thereby improving GPU utilization. This improvement is particularly effective in multi-GPU, large-scale MoE scenarios. Additionally, DeepSeek is exploring technologies such as combined precision and developing an FP4 indexer to further enhance MoE efficiency. Currently, Mega MoE is still under development, and performance data is yet to be released. This update represents DeepSeek's attempt at a restructuring at the infrastructure layer, aiming to drive MoE towards large-scale and highly efficient operation. Mega MoE may be the first step in this process and could also imply that DeepSeek is using NVIDIA's latest top-tier B-series training cards.

Previous page：Two Robot Traffic Cops Hit the Streets in Wuxi: Ca...

Next page：Bloomberg: AI Disrupts India's IT Outsourcing Mode...

Return to List

Hot Reading

2 day ago

AMD's market cap hits all-time high, Intel hits 25-year high on Agentic AI's insatiable demand for CPUs

2 day ago

Microsoft and Stellantis want to use AI to help car owners

2 day ago

US appeals court restarts $3 billion patent infringement lawsuit against Intel

2 day ago

New Codex features include the ability to use your computer in the background