NVIDIA Hopper GH100 GPU Rumored To House Over 140 Billion Transistors In A Massive 5nm Package

Hassan Mujtaba • Feb 7, 2022 at 05:52am EST

NVIDIA Hopper GPUs Featuring MCM Technology Rumored To Tape Out Soon

NVIDIA's next-generation Hopper GH100 GPU is going to be a monster of a chip based on its die size and transistor count.

NVIDIA Hopper GH100 GPU For Next-Gen Data Centers Rumored To Feature Over 140 Billion Transistors In A Monster 5nm Package

A few weeks ago, it was reported in a rumor that NVIDIA's Hopper GH100 flagship GPU would be based on a 5nm process node with a die size measuring close to 900mm2. This would make it the largest GPU ever produced, not only on the 5nm process node but also in all existence. But that's not all, now a new rumor has popped up over at Chiphell Forums which alleges that the GPU could feature over 140 Billion transistors.

Well, just how much are 140 Billion transistors? For comparison, the current flagship data center chips such as AMD's Aldebaran for Instinct MI200 series and NVIDIA Ampere GA100 for the A100 accelerators feature just 58.2 and 54.2 Billion transistors, respectively. That's almost a 2.5x overall transistor count bump for the Hopper GH100 GPU if the rumor holds true.

In terms of density, the NVIDIA Ampere A100 amounts to 65.6M transistors per mm2, while the Aldebaran GPU (based on its speculated die size of 790mm2) should have a density of 73.6M transistors per mm2. Assuming that the GH100 measures around 900mm2, its density should easily cross 150M transistors per mm2. That's more than twice the density increase on the 5nm process node.

But once again, these are all rumored figures and will only be applicable to the monolithic GH100 Hopper GPU. The MCM GPU is an entirely separate entity based on rumors and will come as the GH102 GPU. We don't know the exact specifications except what research papers & rumors have told us. But all in all, the NVIDIA Hopper GPU, both, in its monolithic and MCM form, will offer a serious increase in transistor count and feature advanced 5nm packaging solutions.

NVIDIA Hopper GPU - Everything We Know So Far

From previous information, we know that NVIDIA's GH100 accelerator would be based on TSMC's 5nm process node. Hopper is supposed to have two next-gen GPU modules so we are looking at 288 SM units in total.

We can't give a rundown on the core count yet since we don't know the number of cores featured in each SMs but if it's going to stick to 64 cores per SM, then we get 18,432 cores which are 2.25x more than the full GA100 GPU configuration. NVIDIA could also leverage more FP64, FP16 & Tensor cores within its Hopper GPU which would drive up performance immensely. And that's going to be a necessity to rival Intel's Ponte Vecchio which is expected to feature 1:1 FP64.

It is likely that the final configuration will come with 134 of the 144 SM units enabled on each GPU module and as such, we are likely looking at a single GH100 die in action. But it is unlikely that NVIDIA would reach the same FP32 or FP64 Flops as MI200's without using GPU Sparsity.

But NVIDIA may likely have a secret weapon in their sleeves and that would be the COPA-based GPU implementation of Hopper. NVIDIA talks about two Domain-Specialized COPA-GPUs based on next-generation architecture, one for HPC and one for DL segment. The HPC variant features a very standard approach which consists of an MCM GPU design and the respective HBM/MC+HBM (IO) chiplets but the DL variant is where things start to get interesting. The DL variant houses a huge cache on an entirely separate die that is interconnected with the GPU modules.

Architecture	LLC Capacity	DRAM BW	DRAM Capacity
Configuration	(MB)	(TB/s)	(GB)
GPU-N	60	2.7	100
COPA-GPU-1	960	2.7	100
COPA-GPU-2	960	4.5	167
COPA-GPU-3	1,920	2.7	100
COPA-GPU-4	1,920	4.5	167
COPA-GPU-5	1,920	6.3	233
Perfect L2	infinite	infinite	infinite

Various variants have been outlined with up to 960 / 1920 MB of LLC (Last-Level-Cache), HBM2e DRAM capacities of up to 233 GB, and bandwidth of up to 6.3 TB/s. These are all theoretical but given that NVIDIA has discussed them now, we may likely see a Hopper variant with such a design during the full unveil at GTC 2022.

NVIDIA Hopper GH100 'Official Specs':

NVIDIA Tesla Graphics Card	Tesla K40 (PCI-Express)	Tesla M40 (PCI-Express)	Tesla P100 (PCI-Express)	Tesla P100 (SXM2)	Tesla V100 (SXM2)	NVIDIA A100 (SXM4)	NVIDIA H100 (PCIe)	NVIDIA H100 (SMX5)
GPU	GK110 (Kepler)	GM200 (Maxwell)	GP100 (Pascal)	GP100 (Pascal)	GV100 (Volta)	GA100 (Ampere)	GH100 (Hopper)	GH100 (Hopper)
Process Node	28nm	28nm	16nm	16nm	12nm	7nm	4nm	4nm
Transistors	7.1 Billion	8 Billion	15.3 Billion	15.3 Billion	21.1 Billion	54.2 Billion	80 Billion	80 Billion
GPU Die Size	551 mm2	601 mm2	610 mm2	610 mm2	815mm2	826mm2	814mm2	814mm2
SMs	15	24	56	56	80	108	114	132
TPCs	15	24	28	28	40	54	57	66
FP32 CUDA Cores Per SM	192	128	64	64	64	64	128	128
FP64 CUDA Cores / SM	64	4	32	32	32	32	128	128
FP32 CUDA Cores	2880	3072	3584	3584	5120	6912	14592	16896
FP64 CUDA Cores	960	96	1792	1792	2560	3456	14592	16896
Tensor Cores	N/A	N/A	N/A	N/A	640	432	456	528
Texture Units	240	192	224	224	320	432	456	528
Boost Clock	875 MHz	1114 MHz	1329MHz	1480 MHz	1530 MHz	1410 MHz	TBD	TBD
TOPs (DNN/AI)	N/A	N/A	N/A	N/A	125 TOPs	1248 TOPs 2496 TOPs with Sparsity	1600 TOPs 3200 TOPs	2000 TOPs 4000 TOPs
FP16 Compute	N/A	N/A	18.7 TFLOPs	21.2 TFLOPs	30.4 TFLOPs	312 TFLOPs 624 TFLOPs with Sparsity	1600 TFLOPs	2000 TFLOPs
FP32 Compute	5.04 TFLOPs	6.8 TFLOPs	10.0 TFLOPs	10.6 TFLOPs	15.7 TFLOPs	19.4 TFLOPs 156 TFLOPs With Sparsity	800 TFLOPs	1000 TFLOPs
FP64 Compute	1.68 TFLOPs	0.2 TFLOPs	4.7 TFLOPs	5.30 TFLOPs	7.80 TFLOPs	19.5 TFLOPs (9.7 TFLOPs standard)	48 TFLOPs	60 TFLOPs
Memory Interface	384-bit GDDR5	384-bit GDDR5	4096-bit HBM2	4096-bit HBM2	4096-bit HBM2	6144-bit HBM2e	5120-bit HBM2e	5120-bit HBM3
Memory Size	12 GB GDDR5 @ 288 GB/s	24 GB GDDR5 @ 288 GB/s	16 GB HBM2 @ 732 GB/s 12 GB HBM2 @ 549 GB/s	16 GB HBM2 @ 732 GB/s	16 GB HBM2 @ 900 GB/s	Up To 40 GB HBM2 @ 1.6 TB/s Up To 80 GB HBM2 @ 1.6 TB/s	Up To 80 GB HBM2e @ 2.0 Gbps	Up To 80 GB HBM3 @ 3.0 Gbps
L2 Cache Size	1536 KB	3072 KB	4096 KB	4096 KB	6144 KB	40960 KB	51200 KB	51200 KB
TDP	235W	250W	250W	300W	300W	400W	350W	700W

News Source: HXL (@9550pro)

About the author: A Software Engineer by training and a PC enthusiast by passion, Hassan Mujtaba serves as Wccftech's Senior Editor for hardware section. With years of experience in the industry, he specializes in deep-dive technical analysis of next-generation CPU and GPU architectures, motherboards, and cooling solutions. His work involves not only breaking news on upcoming technologies but also extensive hands-on reviews and benchmarking.

Follow Wccftech on Google to get more of our news coverage in your feeds.

Read all comments on NVIDIA Hopper GH100 GPU Rumored To House Over 140 Billion Transistors In A Massive 5nm Package

NVIDIA Hopper GH100 GPU Rumored To House Over 140 Billion Transistors In A Massive 5nm Package

NVIDIA Hopper GH100 GPU For Next-Gen Data Centers Rumored To Feature Over 140 Billion Transistors In A Monster 5nm Package

NVIDIA Hopper GPU - Everything We Know So Far

NVIDIA Hopper GH100 'Official Specs':

Trending Stories

Thunderobot Packs An RTX 5070 Into A 1.64 Kg Carbon-Fiber Body, Undercutting LG And ASUS

Huawei Reveals Hybrid Bonding Process For The Kirin 2026’s 3D Stacking Design In New Paper To Intensify Competition; Perks Include Better Efficiency & Bandwidth

Square Enix Shareholder Derails 46th Meeting to Praise The Adventures of Elliot, As Publisher Hints At Future Of Final Fantasy

Apple’s A20 Pro To Break A 13-Year Convention With The 96-Bit LPDDR6, But Pinches Pennies On The iPhone 18 Pro Duo’s NAND As A Counter

Shuhei Yoshida Trashed Steam Machine’s Speed and Price, Yet Admits He Can’t Stop Enjoying Its Compact Design and Quietness

Popular Discussions

AMD Zen 6 Gains a New Low-Power Core Beyond Zen 6 and Zen 6C, Surfacing in Linux Kernel Patches

Intel Expected To Restart Supply Of 10th, 12th, 13th, And 14th Gen Processors In Mainland China

Intel’s Shot At Fabricating Apple’s A20 Chip For The Base iPhone 18 Collapses As A Credible Leaker Calls The Original Source A ‘Blowhard’

Intel Cites Rising Supply Chain Costs As The Reason For Raising Prices Of Intel Core Ultra 200S Plus Processors

Sony Just Killed the Disc for PlayStation 6, and Microsoft’s “Project Helix” Xbox Is Reportedly Following

NVIDIA Hopper GH100 GPU Rumored To House Over 140 Billion Transistors In A Massive 5nm Package

NVIDIA Hopper GH100 GPU For Next-Gen Data Centers Rumored To Feature Over 140 Billion Transistors In A Monster 5nm Package

Related Story NVIDIA GeForce RTX 5090D Becomes The First Blackwell GPU To Hit 4 GHz Overclock In A Spectacular Showcase By Team OGS Using GALAX HOC OC Lab Variant

NVIDIA Hopper GPU - Everything We Know So Far

NVIDIA Hopper GH100 'Official Specs':

Further Reading

Trending Stories

Popular Discussions