Microsoft Unveils Open-Source Mathematical Reasoning Model: rStar2-Agent

2 week ago / Read about 0 minute

Author：小编

Microsoft has recently launched the open-source mathematical reasoning model, rStar2-Agent. Remarkably, despite boasting only 14 billion parameters, this model matches the performance of those with 671 billion parameters, thanks to its intelligent reasoning capabilities. rStar2-Agent can autonomously devise reasoning steps, utilize code tools, and validate ideas based on tool feedback. This proficiency is attributed to its incorporation of the GRPO-RoC algorithm, an efficient reinforcement learning infrastructure, and a multi-stage training approach. These innovations allow rStar2-Agent to achieve efficient training with minimal resources and demonstrate robust generalization across various tasks. This groundbreaking development offers fresh perspectives on the evolution of large models, hinting that future iterations may increasingly prioritize intelligent thinking and tool utilization skills.

Previous page：Tencent Open-Sources Youtu-Agent Framework for Sea...

Next page：Panda Securities Unveils QizAI Version 1.4, Integr...

Return to List

Hot Reading

2 day ago

Nvidia CEO Jensen Huang says that DGX Spark is powered by N1, confirming N1 SoC and GB10 Superchip as the same

2 day ago

TEAC's new CD deck promises a "pristine" signal – just don't look at the price

2 day ago

Gemini Now Available for Free on Chrome, But There's a Catch for a Subscription-Free AI Experience

2 day ago

Octopus Energy spins off its Kraken utility billing and AI platform

2 day ago

Live demo fails, AI safety wins, and the Golden Age of Robotics

2 day ago

Engineering the Future of Databases: How Ganesh Nerella Is Defining Cloud Transformation in the AI Era

2 day ago

Samsung's Vision Pro rival launch date leaked – the Android XR headset is in touching distance

2 day ago

Activo’s triple-driver earbuds are back in black with a big bundle discount

2 day ago

I've written about games for 40 years, and these are the 3 retro gaming handhelds I recommend most

2 day ago

Apple just gave me a huge reason to upgrade my go-to travel headphones

Previous page：Tencent Open-Sources Youtu-Agent Framework for Sea...

Next page：Panda Securities Unveils QizAI Version 1.4, Integr...