Aliyun Bailian Unveils Price Cut for DeepSeek-V4-Pro Model's Implicit Caching

19 hour ago / Read about 0 minute

Author：小编

On April 29, 2026, Alibaba Cloud's large-scale model service platform, Bailian, made an announcement. It stated that starting from 23:59:59 on the same day, the billing rate for implicit caching of the DeepSeek-V4-Pro model would be slashed to 1 yuan per million Tokens. Implicit caching comes into play solely when a request successfully retrieves data from the cache. In such scenarios, input Tokens are charged as cached_tokens. Conversely, input Tokens that fail to hit the cache will continue to be billed at the standard input_token rate. It's important to note that this price adjustment is specific to the implicit caching aspect, leaving the model's base inference price untouched.

Previous page：Sinolink Securities: Large-scale AI Clusters Drive...

Next page：Microsoft’s Q3 Fiscal Revenue Hits $82.89 Billion,...

Return to List

Hot Reading

2 day ago

Russian-Chinese Irtysh 32-core CPU runs The Witcher 3 at 30+ FPS

2 day ago

AMD Ryzen 9950X3D tops Amazon best sellers as it falls to all-time low price of $573.99

2 day ago

China kills Meta’s acquisition of Manus as US-China AI rivalry deepens

2 day ago

OpenAI ends its exclusive partnership with Microsoft