Major Technological Breakthrough! OpenAI Achieves Cost Reduction in Inference by Over 50% and Drastically Cuts GPU Usage to Hundreds - AI

7 x 24 Track global technological trends

Hot Topic

Day

News Topic

Major Technological Breakthrough! OpenAI Achieves Cost Reduction in Inference by Over 50% and Drastically Cuts GPU Usage to Hundreds

2 day ago / Read about 0 minute

Author：小编

Sources familiar with the matter revealed that OpenAI showcased a major technological breakthrough earlier this month: through a new optimization solution, it successfully reduced model inference costs by over 50% while significantly improving computational efficiency. This solution has been applied to the ChatGPT scenarios for unregistered visitors, resulting in a substantial decrease in the number of GPUs required. Specific technical details have not been disclosed, but they may involve optimization techniques such as quantization compression, key-value caching, batch processing, and lightweight model shunting.

Previous page：Tim Cook to Prematurely Relinquish Apple CEO Post,...

Next page："Perception Era" Completes Ten-Million-Level Angel...

Return to List

Hot Reading

2 day ago

AMD confirms low-power CPU cores in Linux kernel patch — Zen 6 chips could follow in Intel's footsteps with new core type for background tasks

2 day ago

Samsung Patents a New HBM "Dummy Die" Structure for Taller Memory Stacks

2 day ago

Google's new Nano Banana 2 Lite image model is its fastest and cheapest yet

2 day ago

Samsung Display to Expand a Gen-6 OLED Line in Asan for Apple's Foldable and Future iPhones