In September 2025, Microsoft Research unveiled rStar2-Agent, an open-source AI agent reasoning framework. Despite having only 14 billion parameters, this framework attained an impressive 80.6% accuracy rate in the AIME24 mathematical reasoning test. This performance eclipses that of DeepSeek-R1, which boasts a massive 671 billion parameters (48 times larger than rStar2-Agent's). Similarly, in the scientific reasoning benchmark GPQA-Diamond test, rStar2-Agent achieved a 60.9% accuracy rate, again surpassing DeepSeek-V3. Moreover, in the BFCL v3 agent tool usage task, it reached a task completion rate of 60.8%, outstripping existing benchmarks.
The core technological innovations that underpin rStar2-Agent's success are as follows:
The rStar2-Agent project has been made open source on GitHub, with the aim of expediting the industrialization of agent technology and fostering innovation in the field.