Google’s Eighth-Gen TPU, Coupled with 2PB HBM, Successfully Shatters the Memory Bottleneck Holding Back AI Progress - AI

7 x 24 Track global technological trends

Hot Topic

Day

News Topic

Google’s Eighth-Gen TPU, Coupled with 2PB HBM, Successfully Shatters the Memory Bottleneck Holding Back AI Progress

5 hour ago / Read about 0 minute

Author：小编

Over the past year, memory prices have skyrocketed by three to five times, significantly dampening consumer enthusiasm for purchasing PCs and smartphones. The root cause? The explosive growth in demand for artificial intelligence (AI). AI systems place immense demands on both memory capacity and bandwidth. Take Google’s eighth-generation Tensor Processing Unit (TPU) as a prime example: its training-focused TPU 8t chip is outfitted with 216GB of High Bandwidth Memory (HBM) per chip, delivering a staggering memory bandwidth of 6.5 terabytes per second (TB/s). A single TPU Pod can interconnect a massive 9,600 chips, collectively sharing 2 petabytes (PB) of memory and boasting a computational power of 121 exaflops (when measured in FP4 precision). Meanwhile, its inference-optimized TPU 8i chip is equipped with an even larger 288GB of HBM per chip, offering memory bandwidth of 8.6TB/s and on-chip SRAM of 384 megabytes (MB)—triple that of its predecessor—with a computational capacity of 10.1 petaflops (FP4). Dell’s CEO has even sounded the alarm, predicting that by 2028, the memory requirements for AI accelerators will surge by a staggering 625 times compared to 2023 levels, with the supply-demand gap likely persisting until at least that year.

Previous page：The price of DeepSeek V4 tokens plummets by 75%: J...

Next page：NVIDIA Open-Sources Lyra 2.0 for Generating Naviga...

Return to List

Hot Reading

1 day ago

Report: Samsung execs worried company could lose money on smartphones for the first time

1 day ago

Apple Wallet's Digital ID May Now Be Used for Age Verification on Apple Accounts, Services

1 day ago

ComfyUI hits $500M valuation as creators seek more control over AI-generated media

2 day ago

BMW bumps the 7 Series for 2027, adds all-new battery

2 day ago

Porsche is adding an all-electric Cayenne coupe to its lineup

2 day ago

Greenhouse gases from data center boom could outpace entire nations

1 day ago

Tesla’s Cybercab goes into production — so why is Musk tapping the brakes?

2 day ago

Intel stock jumps 28%, setting record, after it posts strong Q1 with rising forecasts

2 day ago

One of the First Engineers to Deploy Fine-Tuned Language Models for Real-Time Content Safety Is Now Securing the Enterprise

1 day ago

CPU requirements for AI workloads are multiplying, driving intensifying shortages and price hikes

Previous page：The price of DeepSeek V4 tokens plummets by 75%: J...

Next page：NVIDIA Open-Sources Lyra 2.0 for Generating Naviga...

C114 Communication Network
Communication Home

7 X 24 Track global technological trends

Find

News Topic

Hot Topic

7 x 24 Track global technological trends

News Flash

News Topic

AI
/
Devices
/
Smart Car
/
Chip
/
Cloud

C114 Communication Network

Communication Home