NVIDIA Gears Up for Next-Gen Vera Rubin AI Servers While Rolling Out Blackwell Ultra GB300 Units; A Relentless Standard Only Jensen Dares to Set

Muhammad Zuhair
NVIDIA's CEO holding a Grace Blackwell die | Image Credits: NVIDIA

NVIDIA is now preparing for the next generation of AI servers, at a time when the company has just started low-volume production of Blackwell Ultra, which shows that there won't be another NVIDIA at all.

NVIDIA's Rubin Design to Be Finalized By This Month, Mass Production Expected Next Year

Team Green has been accelerating through the AI hype at a rate that no other competitor has managed to replicate. NVIDIA is operating at a six to eight-month product cycle for now, and these are ordinary GPUs; they are massive AI clusters that are worth billions of dollars, and the firm is producing them at a relentless pace, which is simply astonishing. In a report by the Taiwan Economic Daily, it is revealed that NVIDIA is preparing the design for Vera Rubin servers racks, which will be introduced to mainstream suppliers by the end of this month, marking the first step towards production.

Related Story Intel Crescent Island “Xe3P” GPU Scales To 480 GB of “Cost-Optimized” LPDDR5X Memory, Beating NVIDIA Rubin & AMD MI450X With Highest Capacity

NVIDIA's Rubin architecture is seen as the next leap in computing capabilities since Team Green is expected to make changes from the ground up, starting with HBM, process node, design, and much more. It can be called as a landmark similar to the Hopper generation, and how significant of a leap it took with Ampere AI accelerators. It is claimed that Vera Rubin AI server racks will be available in the market from 2026 to 2027, and it will keep the AI bandwagon up and running, to say the least.

For a quick rundown on what to expect with NVIDIA's Rubin, the firm will utilize the next-generation HBM4 chips to power its R100 GPUs, which are said to be a significant upgrade from the modern-day HBM3E standard. Team Green will also adopt TSMC's 3nm (N3P) process and CoWoS-L packaging, which means that Rubin will adopt newer industry standards that will likely take performance to greater levels. More importantly, Rubin will adopt a chiplet design, a first-of-a-kind NVIDIA implementation, and a 4x reticle design (versus 3.3x of Blackwell).

While the Rubin release certainly shows optimism, there's always a question about how NVIDIA would release independent architectures in such short intervals, given that the supply chain gets so little time to adopt newer frameworks. We saw a similar situation with the GB300 AI platform as well, and Team Green ultimately had to resort to using the older Bianca board, which came with the GB200 platform. So, it would be interesting to see how the situation pans out for NVIDIA, but one thing is certain, no company can match the "Jensen speed".

Muhammad Zuhair Photo

About the author: Muhammad Zuhair is a hardware and technology reporter for Wccftech, specializing in the semiconductor industry and the complex interplay between technology, manufacturing, and geopolitics. His coverage focuses on the corporate strategies and technological roadmaps of industry giants like TSMC, NVIDIA, Samsung, and Intel. Zuhair's expertise lies in deconstructing complex topics such as fabrication nodes (e.g., 2nm process), the economic impact of policies like the CHIPS Act, and the strategic development of AI infrastructure from NVIDIA, AMD and Intel.

Follow Wccftech on Google to get more of our news coverage in your feeds.

Button