GTC 2026 Hardware

NVIDIA Vera Rubin Achieves 40 Million Times More Compute In 10 Years: 288 GB HBM4, 22 TB/s Bandwidth, 50 PFLOPs of AI Horsepower

Hassan Mujtaba • Mar 16, 2026 at 04:12pm EDT

A lineup of NVIDIA hardware including 'Rubin,' 'Vera,' 'CX9,' 'BlueField-4,' 'NVLink-6 Switch,' 'Spectrum-X CPO,' 'Groq 3 LPU,' and various compute trays and servers displayed against a black background.

NVIDIA has officially unveiled its next-gen AI data center platform called Vera Rubin, powered by the Rubin GPU and Vera CPU architectures.

NVIDIA Vera Rubin AI Data Center Offers A Stunning 40,000,000x Compute Growth Within A Decade

The NVIDIA Vera Rubin platform is designed with a total of 7 chips and six different racks, each serving a singular purpose, to power next-gen AI datacenters. Those seven chips that have been announced today are:

Rubin (GPU)
Vera (CPU)
CX9 (Connectivity)
BlueField-4 (DPU)
NVLINK-6 Switch (Interconnect)
Spectrum-X CPO (Optics)
Groq 3 (LPU)

First up, we have the Vera Rubin Compute tray, and what has changed is the mounting system, with which AI data centers now take just 2 hours to install instead of 2 days. The Vera Rubin Compute Tray is entirely liquid-cooled, and liquid-cooled by hot water (45 °C), which takes pressure off the data center. This is also the main compute tray, which houses the new Rubin GPUs, featuring two massive reticle-sized dies and 8 HBM sites.

Each NVIDIA Rubin GPU features 288 GB of HBM4 memory, offering up to 22 TB/s of total bandwidth, and 50 PFLOPs of NVFP4 compute performance. Each chip packs 336B transistors, with an additional 2.5 trillion transistors from the HBM4 memory.

Jensen Huang on stage presenting the Rubin GPU and Groq 3 LPU with specs like '288 GB HBM4' and '500 MB SRAM,' captioned 'Uniting Processors of Extreme Performance.'

A presentation slide features the Rubin GPU and Groq 3 LPU with specifications, alongside a person on stage under the headline 'Uniting Processors of Extreme Performances'.

NVIDIA also has a few things to say about its Vera CPU, which offers extremely high single-threaded core performance, incredibly high data output, and extreme levels of energy efficiency. Vera is the world's first and only data center CPU to utilize LPDDR5 memory and offers unrivaled performance per watt. NVIDIA is not just integrating Vera CPUs into its Vera Rubin platform; these will also be shipped standalone, & the company expects this to open another multi-billion-dollar business front for it.

Next comes the NVLink Switch Tray, which is designed with 6th Gen NVLINK, and this rack is a scale-up switching system that is also entirely liquid-cooled.

The Groq 3 LPX compute tray is comprised of 8 GROK LPUs and is codenamed LP30. Each Groq 3 LPU offers 500 MB of SRAM, 150 TB/s of SRAM bandwidth, and 1.2 PFLOPs of FP8 performance. Each chip houses 98B transistors.

NVIDIA's Spectrum-X CPO Switch is the world's first co-packaged optics switch, which is made at TSMC using NVIDIA's Cu-Litho technology. The Spectrum-X switch is now in full production.

The Vera Compute Tray, or ConnectX-9, is also powered by the Vera CPU, and NVIDIA has also adopted a new storage platform to meet the demands of Vera Rubin, called the Bluefield-4 STX storage platform.

	NVIDIA Vera Rubin NVL72	NVIDIA Vera Rubin Superchip	NVIDIA Rubin GPU
Configuration	72 NVIDIA Rubin GPUs \| 36 NVIDIA Vera CPUs	2 NVIDIA Rubin GPUs \| 1 NVIDIA Vera CPU	1 NVIDIA Rubin GPU
NVFP4 Inference	3,600 PFLOPS	100 PFLOPS	50 PFLOPS
NVFP4 Training²	2,520 PFLOPS	70 PFLOPS	35 PFLOPS
FP8/FP6 Training²	1,260 PFLOPS	35 PFLOPS	17.5 PFLOPS
INT8²	18 POPS	0.5 POPS	0.25 POPS
FP16/BF16²	288 PFLOPS	8 PFLOPS	4 PFLOPS
TF32²	144 PFLOPS	4 PFLOPS	2 PFLOPS
FP32	9,360 TFLOPS	260 TFLOPS	130 TFLOPS
FP64	2,400 TFLOPS	67 TFLOPS	33 TFLOPS
FP32 SGEMM³	28,800 TFLOPS	800 TFLOPS	400 TFLOPS
FP64 DGEMM³	14,400 TFLOPS	400 TFLOPS	200 TFLOPS
GPU Memory \| Bandwidth	20.7 TB HBM4 \| 1,580 TB/s	576 GB HBM4 \| 44 TB/s	288 GB HBM4 \| 22 TB/s
NVLink Bandwidth	260 TB/s	7.2 TB/s	3.6 TB/s
NVLink-C2C Bandwidth	65 TB/s	1.8 TB/s	-
CPU Core Count	3,168 custom NVIDIA Olympus cores (Arm® compatible)	88 custom NVIDIA Olympus cores (Arm compatible)	-
CPU Memory	54 TB LPDDR5X	1.5 TB LPDDR5X	-
Total NVIDIA + HBM4 Chips	1,296	30	12

All of these come together in the NVIDIA Vera Rubin NVL72, which will be offered by various partners. Each NVL72 offers a 10x performance per watt increase, 3.6 ExaFlops of NVFP4 performance, 1.6 PB/s of HBM4 bandwidth, and 260 TB/s of NVLINK6 interconnect speeds. The NVIDIA Vera Rubin platform is opening the next AI frontier with:

NVIDIA Spectrum-6 SPX Ethernet racks
Vera Rubin NVL72 GPU racks
Vera CPU racks
NVIDIA Groq 3 LPX inference accelerator racks
NVIDIA BlueField-4 STX storage racks

Also, as mentioned above, NVIDIA Vera CPUs will also come in a 256 Vera CPU rack, offering 300 TB/s of LPDDR5X bandwidth, all connected together using an ETL Spine, and with 6.5x the throughput versus the last-gen solution.

Broad Ecosystem Support
Vera Rubin-based products will be available from partners starting the second half of this year. This includes leading cloud providers Amazon Web Services, Google Cloud, Microsoft Azure and Oracle Cloud Infrastructure, along with NVIDIA Cloud Partners CoreWeave, Crusoe, Lambda, Nebius, Nscale and Together AI.

Global system manufacturers Cisco, Dell Technologies, HPE, Lenovo and Supermicro are expected to deliver a wide range of servers based on Vera Rubin products, as well as Aivres, ASUS, Foxconn, GIGABYTE, Inventec, Pegatron, Quanta Cloud Technology (QCT), Wistron and Wiwynn.

AI labs and frontier model developers including Anthropic, Meta, Mistral AI and OpenAI are looking to use the NVIDIA Vera Rubin platform to train larger, more capable models and to serve long-context, multimodal systems at lower latency and cost than with prior GPU generations.

About the author: A Software Engineer by training and a PC enthusiast by passion, Hassan Mujtaba serves as Wccftech's Senior Editor for hardware section. With years of experience in the industry, he specializes in deep-dive technical analysis of next-generation CPU and GPU architectures, motherboards, and cooling solutions. His work involves not only breaking news on upcoming technologies but also extensive hands-on reviews and benchmarking.

Follow Wccftech on Google to get more of our news coverage in your feeds.

Read all comments on NVIDIA Vera Rubin Achieves 40 Million Times More Compute In 10 Years: 288 GB HBM4, 22 TB/s Bandwidth, 50 PFLOPs of AI Horsepower

NVIDIA Vera Rubin Achieves 40 Million Times More Compute In 10 Years: 288 GB HBM4, 22 TB/s Bandwidth, 50 PFLOPs of AI Horsepower

NVIDIA Vera Rubin AI Data Center Offers A Stunning 40,000,000x Compute Growth Within A Decade

Trending Stories

Square Enix’s Final Fantasy VII Rebirth Looks Like a Remaster on PC, as Shader Injector 2.0 Delivers Series’ Best Visuals

Ubisoft Copies The Crimson Desert’s Playbook, As Assassin’s Creed Black Flag Resynced Ditches Roadmap For Community Feedback

Intel’s Former CEO Gelsinger Admits Firm ‘Scoffed’ at NVIDIA’s GPUs While Riding High on CPU Dominance & Makes Big Quantum Computing Claims

PlayStation 6 Patent Scraps Liquid Metal Cooling After PS5 Leaks Fried APUs And Motherboards For Years

GameStop May Have Leaked Zelda: Ocarina of Time Remake Pre-Orders for August 4, Hinting First Real Footage Isn’t Far

Popular Discussions

AMD Radeon Drivers Silently Add Multi Frame Generation “MFG 8x”, Ray Regeneration, and Neural Radiance Overrides, Hinting At A Bigger FSR Push

AMD Ryzen 7 7700X3D 4.5 GHz “3D V-Cache” CPU Review: The Budget X3D Champ For AM5

NVIDIA GeForce RTX 50 SUPER GPUs Have Reportedly Arrived At AIBs, But Are On Hold Due To Undecided Memory Prices

AMD Ryzen 7 5800X3D Outsells Ryzen 7 7800X3D For The Same Price On Amazon Despite Being Weaker

AMD Ryzen 7 7800X3D CPU Drops To $299 A Day Ahead of 7700X3D’s Launch, Bringing 3D V-Cache Goodness To Mainstream Gamers

NVIDIA Vera Rubin Achieves 40 Million Times More Compute In 10 Years: 288 GB HBM4, 22 TB/s Bandwidth, 50 PFLOPs of AI Horsepower

NVIDIA Vera Rubin AI Data Center Offers A Stunning 40,000,000x Compute Growth Within A Decade

Related Story AMD’s Instinct MI355X Snags a $350 Million Customer as TensorWave Doubles Down Ahead of the MI455X vs Vera Rubin Showdown

Further Reading

Trending Stories

Popular Discussions