Zhipu Unveils GLM-5.1 High-Speed API, Shattering Global Speed Benchmarks for Large-Scale Model APIs

1 day ago / Read about 0 minute

Author：小编

Zhipu has recently introduced the GLM-5.1 high-speed API, boasting an impressive model output velocity of up to 400 tokens per second. This achievement disrupts the prevailing industry trend where high-speed models are typically lightweight, marking the first instance in China where a large-scale model combines flagship-level capabilities with minimal latency. Thanks to the backing of the TileRT high-performance inference engine, practical tests demonstrate the model's outstanding performance in AI programming, 3D gaming, and interactive interfaces. This engine is the result of collaborative system-level optimizations by the Zhipu GLM team and the TileRT team. Presently, the GLM-5.1 high-speed API is tailored for scenarios demanding rapid responses and is accessible to a select group of enterprise clients via the Zhipu MaaS platform.

Previous page：Codex for Mac Updates Appshots Feature, Windows Ca...

Next page：Anthropic Predicts Second-Quarter Revenue to Soar ...

Return to List

Hot Reading

2 day ago

Yearslong fight over users' right to tweak smart TV software heads to trial

2 day ago

Waymo pauses Atlanta service as its robotaxis keep driving into floods

2 day ago

Google Stitch Launches Real-Time AI Agent, Multiplayer Editing: Figma Charges $15/Seat

2 day ago

Ryzen 7 5800X3D AM4 10th Anniversary Edition surfaces online for $310