DeepSeek Teams Up with Peking University and Tsinghua University to Release a Paper: Concentrating on the Foundational Infrastructure for Intelligent Agents and Overcoming I/O Bottlenecks in Agent Rea - AI

7 x 24 Track global technological trends

Hot Topic

Day

News Topic

DeepSeek Teams Up with Peking University and Tsinghua University to Release a Paper: Concentrating on the Foundational Infrastructure for Intelligent Agents and Overcoming I/O Bottlenecks in Agent Rea

3 day ago / Read about 0 minute

Author：小编

DeepSeek, in a collaborative effort with Peking University and Tsinghua University, has unveiled a paper on ArXiv, introducing a novel framework for agent reasoning known as DualPath. This innovative framework is designed to tackle the I/O bottleneck problem that arises during long-text reasoning for agents. It achieves this by incorporating a 'storage-to-decode' pathway, which transforms the conventional single-path loading model. This transformation enables the global pooling of cluster storage bandwidth and facilitates dynamic load balancing. In practical tests utilizing a 660B-scale model, DualPath demonstrated a significant enhancement in performance. It boosted offline reasoning throughput by 1.87 times and online service throughput by an average of 1.96 times. Moreover, it optimized the latency of the first character without compromising the speed of token generation. DualPath establishes a dual-path model that encompasses a reasoning engine, a traffic manager, and a central scheduler. It also offers two optimization strategies: a compute NIC-centric traffic management system and an adaptive request scheduler. Experimental findings indicate that DualPath can effectively surmount I/O constraints in large-scale model reasoning and elevate the efficiency of LLM reasoning systems for intelligent agents. The paper's lead author is Wu Yongtong, a Ph.D. candidate at Peking University, who specializes in system software and large-scale model infrastructure research.

Previous page：Fintech Firm Block Slashes 40% of Workforce Amid A...

Next page：Actor Wang Jinsong Alleges AI Image Theft via WeCh...

Return to List

Hot Reading

1 day ago

Honor says its ‘Robot phone’ with moving camera can dance to music

2 day ago

Want the Most From Your Kindle? Try Out My Go-To Hacks

2 day ago

AI Robot 'Buddharoid' Brings 24/7 Spiritual Guidance to a Kyoto Temple

2 day ago

Apple says it has "a big week ahead." Here's what we expect to see.