NVIDIA Introduces Its New GPU, Rubin CPX, Tailored for Long-Context AI Inference - Chip

7 x 24 Track global technological trends

Hot Topic

Day

News Topic

NVIDIA Introduces Its New GPU, Rubin CPX, Tailored for Long-Context AI Inference

2025-09-10 / Read about 0 minute

Author：小编

On September 9, NVIDIA unveiled its latest GPU, the Rubin CPX, which is engineered specifically for long-context inference and video generation tasks. This GPU is set to dramatically boost the efficiency of AI inference computing, making it an ideal choice for applications that demand ultra-long context windows—such as programming and video generation. Constructed on the innovative Rubin architecture, the Rubin CPX chip features a distinct, separated inference architecture. This design divides the AI computing process into two distinct phases: a context phase and a generation phase. Such a division allows for the optimized allocation of computing and memory resources, enhancing overall performance. The Rubin CPX GPU boasts an impressive 30 petaflops of NVFP4 computing power and comes equipped with 128GB of GDDR7 memory. When compared to its predecessor, it offers a threefold improvement in attention processing capability, a critical metric for AI performance. The complete rack version of the Rubin CPX GPU is seamlessly integrated into the Vera Rubin NVL144 CPX platform. This platform delivers a staggering 8 exaflops of AI performance, which is 7.5 times greater than that of the previous system. Additionally, it features 100TB of high-speed memory and an impressive memory bandwidth of 1.7PB/s. NVIDIA asserts that deploying $100 million worth of this new chip hardware will enable customers to generate a substantial $5 billion in revenue. The Rubin CPX GPU is anticipated to hit the market by the end of 2026, marking a significant advancement in AI technology.

Previous page：NVIDIA Unveils New Chip System Rubin CPX to Superc...

Next page：CITIC Securities: Suggests Focusing on Silicon-Bas...

Return to List

Hot Reading

2 day ago

Custom AI Chips Outpace Nvidia GPU Growth in 2026: ASIC Shipments Set to Triple GPU Rate

2 day ago

Semiconductor Substrate Warpage Has a New Korean Fix: Viacore and Aqlaser Target Glass

2 day ago

Samsung 900-Layer NAND Prototype Sets World Record: CMB Technique Doubles Stack Height

2 day ago

Huawei Will Develop Cutting-Edge Semiconductors, Chips by 2031 to Compete With Samsung, TSMC

2 day ago

Flydigi Vader 5S Hands-On: The Mechanical Controller That Out-Specs First-Party Flagships

2 day ago

Driving Porsche's most powerful car—and no, it's not a 911

2 day ago

Amazing interior, controversial exterior: Ferrari's first electric car

2 day ago

Pope Leo XIV's First Encyclical Casts AI as Defining Challenge of Our Era

2 day ago

Jony Ive’s Ferrari looks nothing like a Ferrari

2 day ago

AI Investment Bubble Concerns Grow as Big Tech Spending Soars While Revenue Lags Behind Expectations

Previous page：NVIDIA Unveils New Chip System Rubin CPX to Superc...

Next page：CITIC Securities: Suggests Focusing on Silicon-Bas...

C114 Communication Network
Communication Home

7 X 24 Track global technological trends

Find

News Topic

Hot Topic

7 x 24 Track global technological trends

News Flash

News Topic

AI
/
Devices
/
Smart Car
/
Chip
/
Cloud

C114 Communication Network

Communication Home