Hardware Industry

NVIDIA’s Next-Gen Rubin AI Servers to Ship to Customers by August This Year, as Key Supplier Quanta Confirms No Complications in Mass Production

Muhammad Zuhair

• Jan 19, 2026 at 06:09am EST

NVIDIA Prepares For Next-Gen GPU Architectures As "Blackwell-Next" Spotted In Linux 7.2 Kernel Patch — Image Credits: NVIDIA

NVIDIA's next-gen Vera Rubin AI racks are expected to ship to customers by Q3 of this year, but initial production levels aren't likely to be "too high".

Suppliers Face No Issue Ramping Up Vera Rubin Production For Now, Given How NVIDIA Has Retained Several Elements From Blackwell

Team Green surprised us by announcing that Vera Rubin has entered "full production" in Q1 2026, which was seen as a massive achievement, given that NVIDIA delivered earlier than its product cadence. However, NVIDIA didn't reveal when hyperscalers would receive the first batch of the new AI racks, but according to Quanta's Executive VP Mike Yang (via Ctee), initial Rubin units could be delivered to customers by August, ensuring complete hyperscaler integration by the last quarter of this year.

Given that the Rubin architecture has shifted from chiplet design to advanced packaging and HBM integration, there were concerns about whether NVIDIA could deliver on its promised "H2 2026" timeline. However, Yang has revealed that several components within NVIDIA's Rubin infrastructure are the same as those in the Blackwell series, which means transitioning production lines doesn't pose as big a risk for now. NVIDIA is expected to follow a similar trend to what we saw with Blackwell Ultra, limiting initial shipments and then ramping up volume in Q1 of next year.

For those unaware, the Rubin lineup was unveiled extensively at this year's CES, with a key focus on "six different" chips involving networking and compute power. Here's what these chips look like:

Rubin GPU (with 336 Billion Transistors)
Vera CPU (with 227 Billion Transistors)
NVLINK 6 Switch for Interconnect
CX9 & BF4 for Networking
Spectrum-X 102.4T CPO for silicon photonics

It is anticipated that Rubin shipments will be limited to the NVL72 configuration for now, despite NVIDIA revealing an NVL144 configuration at last year's GTC. By keeping chip count in check, the company aims to control thermal draw and ease the production process. It appears that Team Green has learned that the supply chain does take time to adjust to the company's architectural changes, as we saw significant complications when switching from Blackwell to Blackwell Ultra due to the use of Cordeila board design and similar changes.

NVIDIA's Rubin AI series is expected to be fully adopted by hyperscalers within the Q4 2026 to Q1 2027 timeline, and frontier models will benefit from the significant performance coming on board soon. Early customers include OpenAI, Microsoft, Google, and neoclouds like CoreWeave.

About the author: Muhammad Zuhair is a hardware and technology reporter for Wccftech, specializing in the semiconductor industry and the complex interplay between technology, manufacturing, and geopolitics. His coverage focuses on the corporate strategies and technological roadmaps of industry giants like TSMC, NVIDIA, Samsung, and Intel. Zuhair's expertise lies in deconstructing complex topics such as fabrication nodes (e.g., 2nm process), the economic impact of policies like the CHIPS Act, and the strategic development of AI infrastructure from NVIDIA, AMD and Intel.

Follow Wccftech on Google to get more of our news coverage in your feeds.

Read all comments on NVIDIA’s Next-Gen Rubin AI Servers to Ship to Customers by August This Year, as Key Supplier Quanta Confirms No Complications in Mass Production

NVIDIA’s Next-Gen Rubin AI Servers to Ship to Customers by August This Year, as Key Supplier Quanta Confirms No Complications in Mass Production

Suppliers Face No Issue Ramping Up Vera Rubin Production For Now, Given How NVIDIA Has Retained Several Elements From Blackwell

Trending Stories

Apple Ran Out Of Patience In Arranging An M2 Max MacBook Pro Replacement Part, Customer Receives M5 Max Replacement Plus Store Credit

PlayStation 6 Delay Fears May Be Overblown, As Known Leaker Silently Pushes Back Against Latest Reports

Final Fantasy VII Revelation Brings Back Rebirth’s 8 Regions, but Hamaguchi Insists the World Map Now Feels Truly Substantial

God of War Laufey Star Deborah Ann Woll Breaks Silence on a Massive 10-Year Secret: “I Still Pinch Myself”

Epic Games Loses Level Design Legend Sjoerd de Jong After 12 Years As Unreal Engine 6 Embraces AI

Popular Discussions

Apple To Design & Build Chips At Intel on American Soil, US President Confirms

AMD Olympic Ridge “Zen 6” Ryzen CPUs Get Integrated NPU At The Cost of iGPU, CUDIMM Ready Platform

NVIDIA Blackwell Sweeps Every MLPerf 6.0 Benchmark With No Competition In Sight, While GB300 Systems Run Up to 60% Faster Than GB200

AMD Reportedly Plots Another 10-15% RX 9000 Price Hike As The RAMpocalypse Swallows The GPU Market

AMD’s Next-Gen Threadripper “Mustang Peak” Confirmed: Built For TR6 Platform, Bringing 2nm Zen 6 Cores and PCIe Gen6

NVIDIA’s Next-Gen Rubin AI Servers to Ship to Customers by August This Year, as Key Supplier Quanta Confirms No Complications in Mass Production

Suppliers Face No Issue Ramping Up Vera Rubin Production For Now, Given How NVIDIA Has Retained Several Elements From Blackwell

Related Story Tensordyne’s 3nm Napier AI Chip Promises 13x Higher Token Throughput Than Blackwell & Blazes Past Rubin With 1000 Tokens/s In Multi-Trillion Parameter Models

Further Reading

Trending Stories

Popular Discussions