Recently, MiniMax, a provider of large-scale AI models, has established an in-depth partnership with Tencent Cloud. Focusing on the pivotal area of Agent RL (Reinforcement Learning) training, both companies have engaged in comprehensive technical and business collaboration, harnessing the capabilities of Tencent Cloud’s Agent Runtime Sandbox product. Through this collaboration, the Forge framework has achieved millisecond-level startup times, throughput capacity in the millions, concurrent processing capabilities in the hundreds of thousands, and the ability for instant destruction in large-scale interactive environments tailored for reinforcement learning scenarios. This has notably boosted training throughput and stability.
