Hardware Rumor

Next-Gen NVIDIA GeForce RTX 4090 With Top AD102 GPU Could Be The First Gaming Graphics Card To Break Past 100 TFLOPs

Hassan Mujtaba • May 1, 2022 at 02:27pm EDT

Recent rumors regarding the next-generation NVIDIA GeForce RTX 4090 series suggest that the AD102-powered graphics card might be the first gaming product to break past the 100 TFLOPs barrier.

NVIDIA GeForce RTX 4090 Class Graphics Cards Might Become The First Gaming 'AD102' GPU To Break Past the 100 TFLOPs Barrier

Currently, the NVIDIA GeForce RTX 3090 Ti offers the highest compute performance amongst all gaming graphics cards, hitting anywhere between 40 to 45 TFLOPs of FP32 (Single-Precision) GPU compute. But with the next-generation GPUs arriving later this year, things are going to take a big boost.

To be honest, I don't have much information about AMD. Maybe Lisa and Jensen's competition will give us a 100TFLOPS gaming war in a few months.

— kopite7kimi (@kopite7kimi) April 29, 2022

https://twitter.com/greymon55/status/1520473548782927872

As per rumors from Kopite7kimi and Greymon55, the next-generation graphics cards, not only from NVIDIA but AMD too, are expected to reach the 100 TFLOPs mark. This would mark a huge milestone in the consumer graphics market which has definitely seen a major performance and also a power jump with the current generation of cards. We went straight from 275W being the limit to 350-400W becoming the norm and the likes of the RTX 3090 Ti are already sipping in over 500W of power. The next generation is going to be even more power-hungry but if the compute numbers are anything to go by, then we already know one reason why they are going to sip that much power.

As per the report, NVIDIA's Ada Lovelace GPUs, especially the AD102 chip, has seen some major breakthrough on TSMC's 4N process node. Compared to the previous 2.2-2.4 GHz clock speed rumors, the current estimates are that AMD and NVIDIA will have boost speeds similar to each other and that's around 2.8-3.0 GHz. For NVIDIA specifically, the company is going to fuse a total of 18,432 cores coupled with 96 MB of L2 cache and a 384-bit bus interface. These will be stacked in a 12 GPC die layout with 6 TPCs and 2 SMs per TPC for a total of 144 SMs.

Based on a theoretical clock speed of 2.8 GHz, you get up to 103 TFLOPs of compute performance and the rumors are suggesting even higher boost clocks. Now, these are definitely sounding like peak clocks, similar to AMD's peak frequencies which are higher than the average 'Game' clock. A 100+ TFLOPs compute performance means more than double the horsepower versus the 3090 Ti flagship. But one should keep in mind that compute performance doesn't necessarily indicate the overall gaming performance but despite that, it will be a huge upgrade for gaming PCs and an 8.5x increase over the current fastest console, the Xbox Series X.

FP32 Compute Horsepower Comparisons (Higher is Better)

Compute Power

100

120

100

120

RTX 4090 Ti (Theoretical)

RX 7900 XT (Theoretical)

RTX 3090 Ti

RX 6900 XTX

Xbox Series X

PlayStation 5

So at the end of the day, we are bound to see PC hardware, especially graphics cards, get more powerful but it will be great to see all that power put to good use to run the next generation of games, especially 8K titles with ray-tracing and future graphical effects.

Upcoming Flagship AMD, Intel, NVIDIA GPU Specs (Preliminary)

GPU Name	AD102	Navi 31	Xe2-HPG
Codename	Ada Lovelace	RDNA 3	Battlemage
Flagship SKU	GeForce RTX 4090 Series	Radeon RX 7900 Series	Arc B900 Series
GPU Process	TSMC 4N	TSMC 5nm+ TSMC 6nm	TSCM 5nm?
GPU Package	Monolithic	MCD (Multi-Chiplet Die)	MCM (Multi-Chiplet Module)
GPU Dies	Mono x 1	2 x GCD + 4 x MCD + 1 x IOD	Quad-Tile (tGPU)
GPU Mega Clusters	12 GPCs (Graphics Processing Clusters)	6 Shader Engines	10 Render Slices
GPU Super Clusters	72 TPC (Texture Processing Clusters)	30 WGPs (Per MCD) 60 WGPs (In Total)	40 Xe-Cores (Per Tile) 160 Xe-Cores (Total)
GPU Clusters	144 Stream Multiprocessors (SM)	120 Compute Units (CU) 240 Compute Units (in total)	1280 Xe VE (Per Tile) 5120 Xe VE (In Total)
Cores (Per Die)	18432 CUDA Cores	7680 SPs (Per GCD) 15360 SPs (In Total)	20480 ALUs (In Total)
Peak Clock	~2.85 GHz	~3.0 GHz	TBD
FP32 Compute	~105 TFLOPs	~92 TFLOPs	TBD
Memory Type	GDDR6X	GDDR6	GDDR6?
Memory Capacity	24 GB	32 GB	TBD
Memory Bus	384-bit	256-bit	TBD
Memory Speeds	~21 Gbps	~18 Gbps	TBD
Cache Subsystems	96 MB L2 Cache	512 MB (Infinity Cache)	TBD
TBP	~600W	~500W	TBD
Launch	Q4 2022	Q4 2022	2023

About the author: A Software Engineer by training and a PC enthusiast by passion, Hassan Mujtaba serves as Wccftech's Senior Editor for hardware section. With years of experience in the industry, he specializes in deep-dive technical analysis of next-generation CPU and GPU architectures, motherboards, and cooling solutions. His work involves not only breaking news on upcoming technologies but also extensive hands-on reviews and benchmarking.

Follow Wccftech on Google to get more of our news coverage in your feeds.

Read all comments on Next-Gen NVIDIA GeForce RTX 4090 With Top AD102 GPU Could Be The First Gaming Graphics Card To Break Past 100 TFLOPs

Next-Gen NVIDIA GeForce RTX 4090 With Top AD102 GPU Could Be The First Gaming Graphics Card To Break Past 100 TFLOPs

NVIDIA GeForce RTX 4090 Class Graphics Cards Might Become The First Gaming 'AD102' GPU To Break Past the 100 TFLOPs Barrier

FP32 Compute Horsepower Comparisons (Higher is Better)

Upcoming Flagship AMD, Intel, NVIDIA GPU Specs (Preliminary)

Trending Stories

Xbox Studio Leaders Reportedly Detest Game Pass, Arguing it Destroyed the Value of Their $40+ Games Now Available for Pennies

A Modder Fits Entire Grand Theft Auto PS2 Trilogy Inside a Single Game, While Rockstar Continues to Prepare GTA 6

Some Newer GeForce RTX 5060 GPUs Transition To 16-pin Connector As Vendors Deploy Cut-Down GB205 Die

Over 80% Of Samsung Foundry Workers Are Planning To Leave Amid A Yawning Pay Gap With The Memory Division

NVIDIA’s AI GPUs Face Overwhelming Data Growth Bottleneck, But Samsung’s V10 NAND Production For Next-Generation CMX Storage To Offer Relief, At The Industry’s Expense

Popular Discussions

AMD Medusa Point 10-Core “Zen 6” CPU Beats Strix Point 10-Core “Zen 5” By Nearly 35% While Operating at 5.4 GHz

AMD Ryzen 7 7700X3D 4.5 GHz “3D V-Cache” CPU Review: The Budget X3D Champ For AM5

NVIDIA GeForce RTX 50 SUPER GPUs Have Reportedly Arrived At AIBs, But Are On Hold Due To Undecided Memory Prices

AMD Ryzen 7 5800X3D Outsells Ryzen 7 7800X3D For The Same Price On Amazon Despite Being Weaker

AMD Ryzen 7 7800X3D CPU Drops To $299 A Day Ahead of 7700X3D’s Launch, Bringing 3D V-Cache Goodness To Mainstream Gamers

Next-Gen NVIDIA GeForce RTX 4090 With Top AD102 GPU Could Be The First Gaming Graphics Card To Break Past 100 TFLOPs

NVIDIA GeForce RTX 4090 Class Graphics Cards Might Become The First Gaming 'AD102' GPU To Break Past the 100 TFLOPs Barrier

Related Story CPUID Rolls Out HWMonitor v1.65.1, Removing Additional Hot Spot Temperature Reading

Upcoming Flagship AMD, Intel, NVIDIA GPU Specs (Preliminary)

Further Reading

Trending Stories

Popular Discussions