Luo Fuli, once a researcher at DeepSeek and now heading Xiaomi's MiMo large-scale model initiative, teamed up with Peking University to craft the innovative unified resource management system, ARL-Tangram. Leveraging a unified action-level formula and an elastic scheduling algorithm, this system adeptly navigates heterogeneous resource constraints, accelerates action completion times, and enables tailored management of diverse resources. Evaluations indicate that the system not only enhances the average ACT (Action Completion Time) but also shortens the duration of reinforcement learning training phases, leading to significant external resource savings. This marks the second technological breakthrough published by Luo Fuli since her tenure at Xiaomi began, following her initial paper released in October of the previous year. At the 2025 Xiaomi Human-Vehicle-Home Ecosystem Partner Conference, Luo Fuli made her first appearance and shared on her social media platforms that she had become a part of the Xiaomi MiMo large-scale model team.
