NVIDIA has officially unveiled its next-gen AI data center platform called Vera Rubin, powered by the Rubin GPU and Vera CPU architectures.
NVIDIA Vera Rubin AI Data Center Offers A Stunning 40,000,000x Compute Growth Within A Decade
The NVIDIA Vera Rubin platform is designed with a total of 7 chips and six different racks, each serving a singular purpose, to power next-gen AI datacenters. Those seven chips that have been announced today are:
- Rubin (GPU)
- Vera (CPU)
- CX9 (Connectivity)
- BlueField-4 (DPU)
- NVLINK-6 Switch (Interconnect)
- Spectrum-X CPO (Optics)
- Groq 3 (LPU)

First up, we have the Vera Rubin Compute tray, and what has changed is the mounting system, with which AI data centers now take just 2 hours to install instead of 2 days. The Vera Rubin Compute Tray is entirely liquid-cooled, and liquid-cooled by hot water (45 °C), which takes pressure off the data center. This is also the main compute tray, which houses the new Rubin GPUs, featuring two massive reticle-sized dies and 8 HBM sites.
Each NVIDIA Rubin GPU features 288 GB of HBM4 memory, offering up to 22 TB/s of total bandwidth, and 50 PFLOPs of NVFP4 compute performance. Each chip packs 336B transistors, with an additional 2.5 trillion transistors from the HBM4 memory.
NVIDIA also has a few things to say about its Vera CPU, which offers extremely high single-threaded core performance, incredibly high data output, and extreme levels of energy efficiency. Vera is the world's first and only data center CPU to utilize LPDDR5 memory and offers unrivaled performance per watt. NVIDIA is not just integrating Vera CPUs into its Vera Rubin platform; these will also be shipped standalone, & the company expects this to open another multi-billion-dollar business front for it.
Next comes the NVLink Switch Tray, which is designed with 6th Gen NVLINK, and this rack is a scale-up switching system that is also entirely liquid-cooled.
The Groq 3 LPX compute tray is comprised of 8 GROK LPUs and is codenamed LP30. Each Groq 3 LPU offers 500 MB of SRAM, 150 TB/s of SRAM bandwidth, and 1.2 PFLOPs of FP8 performance. Each chip houses 98B transistors.
NVIDIA's Spectrum-X CPO Switch is the world's first co-packaged optics switch, which is made at TSMC using NVIDIA's Cu-Litho technology. The Spectrum-X switch is now in full production.
The Vera Compute Tray, or ConnectX-9, is also powered by the Vera CPU, and NVIDIA has also adopted a new storage platform to meet the demands of Vera Rubin, called the Bluefield-4 STX storage platform.
| NVIDIA Vera Rubin NVL72 | NVIDIA Vera Rubin Superchip | NVIDIA Rubin GPU | |
|---|---|---|---|
| Configuration | 72 NVIDIA Rubin GPUs | 36 NVIDIA Vera CPUs | 2 NVIDIA Rubin GPUs | 1 NVIDIA Vera CPU | 1 NVIDIA Rubin GPU |
| NVFP4 Inference | 3,600 PFLOPS | 100 PFLOPS | 50 PFLOPS |
| NVFP4 Training² | 2,520 PFLOPS | 70 PFLOPS | 35 PFLOPS |
| FP8/FP6 Training² | 1,260 PFLOPS | 35 PFLOPS | 17.5 PFLOPS |
| INT8² | 18 POPS | 0.5 POPS | 0.25 POPS |
| FP16/BF16² | 288 PFLOPS | 8 PFLOPS | 4 PFLOPS |
| TF32² | 144 PFLOPS | 4 PFLOPS | 2 PFLOPS |
| FP32 | 9,360 TFLOPS | 260 TFLOPS | 130 TFLOPS |
| FP64 | 2,400 TFLOPS | 67 TFLOPS | 33 TFLOPS |
| FP32 SGEMM³ | 28,800 TFLOPS | 800 TFLOPS | 400 TFLOPS |
| FP64 DGEMM³ | 14,400 TFLOPS | 400 TFLOPS | 200 TFLOPS |
| GPU Memory | Bandwidth | 20.7 TB HBM4 | 1,580 TB/s | 576 GB HBM4 | 44 TB/s | 288 GB HBM4 | 22 TB/s |
| NVLink Bandwidth | 260 TB/s | 7.2 TB/s | 3.6 TB/s |
| NVLink-C2C Bandwidth | 65 TB/s | 1.8 TB/s | - |
| CPU Core Count | 3,168 custom NVIDIA Olympus cores (Arm® compatible) | 88 custom NVIDIA Olympus cores (Arm compatible) | - |
| CPU Memory | 54 TB LPDDR5X | 1.5 TB LPDDR5X | - |
| Total NVIDIA + HBM4 Chips | 1,296 | 30 | 12 |

All of these come together in the NVIDIA Vera Rubin NVL72, which will be offered by various partners. Each NVL72 offers a 10x performance per watt increase, 3.6 ExaFlops of NVFP4 performance, 1.6 PB/s of HBM4 bandwidth, and 260 TB/s of NVLINK6 interconnect speeds. The NVIDIA Vera Rubin platform is opening the next AI frontier with:
- NVIDIA Spectrum-6 SPX Ethernet racks
- Vera Rubin NVL72 GPU racks
- Vera CPU racks
- NVIDIA Groq 3 LPX inference accelerator racks
- NVIDIA BlueField-4 STX storage racks

Also, as mentioned above, NVIDIA Vera CPUs will also come in a 256 Vera CPU rack, offering 300 TB/s of LPDDR5X bandwidth, all connected together using an ETL Spine, and with 6.5x the throughput versus the last-gen solution.
Broad Ecosystem Support
Vera Rubin-based products will be available from partners starting the second half of this year. This includes leading cloud providers Amazon Web Services, Google Cloud, Microsoft Azure and Oracle Cloud Infrastructure, along with NVIDIA Cloud Partners CoreWeave, Crusoe, Lambda, Nebius, Nscale and Together AI.Global system manufacturers Cisco, Dell Technologies, HPE, Lenovo and Supermicro are expected to deliver a wide range of servers based on Vera Rubin products, as well as Aivres, ASUS, Foxconn, GIGABYTE, Inventec, Pegatron, Quanta Cloud Technology (QCT), Wistron and Wiwynn.
AI labs and frontier model developers including Anthropic, Meta, Mistral AI and OpenAI are looking to use the NVIDIA Vera Rubin platform to train larger, more capable models and to serve long-context, multimodal systems at lower latency and cost than with prior GPU generations.
Follow Wccftech on Google to get more of our news coverage in your feeds.







