NVIDIA GeForce RTX 20 Series Review Ft. RTX 2080 Ti & RTX 2080 Founders Edition Graphics Cards – Turing Ray Traces The Gaming Industry
NVIDIA GeForce RTX 2080 Ti & GeForce RTX 208019th September, 2018
NVIDIA Turing GPU - NVIDIA Turing GPUs - TU102, TU104, TU106
NVIDIA is for the first time not only launching the **80 and **70 cards along with the flagship **80 Ti model but they are also launching graphics cards with three different GPUs. While the GPUs are similar in design, the configurations are very different and one thing we can tell is that the configs leave a lot of room for NVIDIA to expand upon in the future if they want to.
What I mean to say is that the RTX 2080 Ti isn’t based on the full TU102 GPU, the RTX 2080 is also not based on the full TU104 GPU while the RTX 2070 is the only card that utilizes the full config of the GPU its based upon, the Turing TU106.
One more thing, these GPUs are really huge in terms of die size compared to the Pascal GPU, while using the 12nm process. The reason being the added INT32 execution units and Tensor cores which weren’t available on any previous consumer based GeForce graphics cards. Hence, the TU106 GPU which succeeds the GP106 GPU is over twice as large as its predecessor (445mm2 versus 200mm2).
NVIDIA Turing TU102 GPU
The TU102 is made up of 6 graphics processing clusters with 6 SM units on each cluster. That makes up 72 SM units for a total of 4608 cores in an 18.6 billion transistor package measuring 754mm2.
NVIDIA TU102 GPU
|Flagship GPU Design||Titan X||Titan XP||Titan V||Quadro RTX 8000|
|Process Node||28nm||16nm FF||12nm FFN||12nm FFN|
|Transistors||8 Billion||12 Billion||21.1 Billion||18.6 Billion|
|L2 Cache||3 MB||3 MB||4.5 MB||6 MB|
|CUDA Cores / SM||128||128||64||64|
|CUDA Cores / GPU||3072||3840||5120||4608|
|Tensor Cores / SM||N/A||N/A||8||8|
|Tensor Cores / GPU||N/A||N/A||640||576|
|GPU Base Clock||1000 MHz||1405 MHz||1200 MHz||1455 MHz|
|GPU Boost Clock||1089 MHz||1582 MHz||1455 MHz||1770 MHz|
|Peak FP32 TFLOPs||6.6||12.1||13.8||16.3|
|Peak FP16 TFLOPs||N/A||N/A||27.6||32.6|
|Peak FP16 TOPs||N/A||N/A||110||130.5|
|Peak INT8 TOPs||N/A||N/A||N/A||261.0|
|Peak INT4 TOPs||N/A||N/A||N/A||522.0|
|VRAM||12 GB GDDR5||12 GB GDDR5X||12 GB HBM2||48 GB GDDR6|
|Memory Clock||7 Gbps||11.4 Gbps||1.7 Gbps||14 Gbps|
|Memory Bandwidth||336.6 GB/s||547.6 GB/s||652.8 GB/s||672 GB/s|
|Register File Size / SM||256 KB||256 KB||256 KB||256 KB|
|Register File / GPU||6 MB||7 MB||14 MB||18 MB|
|Texture Fill Rate||209.1 GT/s||380 GT/s||465.6 GT/s||510 GT/s|
NVIDIA Turing TU104 GPU
The TU104 is made up of 6 graphics processing clusters with 4 SM units on each cluster. That makes up 48 SM units for a total of 3072 cores in a 13.6 billion transistor package measuring 545mm2.
NVIDIA Turing TU106 GPU
The TU106 is made up of 3 Graphics processing clusters with 6 SM units on each cluster. That makes up 36 SM units for a total of 2304 Cores in a 10.6 billion transistor package measuring 445mm2.
Here’s another thing, the GP106 was used in the GTX 1060 which is more of a mainstream graphics card. However, while the RTX 2070 rocks a TU106 GPU which may make it look like a mainstream GPU with a much higher price tag, it does have overall better specifications compared to the GP104 based GTX 1070 with higher Cores, better memory, and more features. It also has around twice as many cores as the GTX 1060 so calling it a mainstream graphics card won’t be a wise choice.