⋮    ⋮  

NVIDIA’s Flagship Turing TU102 GPU For GeForce RTX 2080 Ti Detailed – 50% Faster Per Core Performance, 288 TMUs/96 ROPs on Full Die and New Overclocking Features


More details of the NVIDIA Turing TU102 GPU that powers the flagship GeForce RTX 2080 Ti and Quadro RTX 8000/6000 series graphics cards have leaked out over at Videocardz. In an exclusive article, the website discusses information that comes straight from the NVIDIA editor's day, an event which is heavily NDA'd and discusses the architectural and performance aspects of the new GeForce graphics cards coming to the market.

NVIDIA Turing TU102 GPU Powering RTX 2080 Ti Graphics Card Offers A Generational Leap Over Pascal - Full Block Diagram Unveiled

The first details that are talked about are the Turing TU102 GPU block diagram. We have already seen the bare chip and it's massive in terms of die size, the biggest ever **102 GPU that NVIDIA's ever produced. Measuring at 754mm2, the chip comes packed with 18.6 Billion transistors and a totally new architecture design featuring different cores that are operating in tandem to deliver the world's best graphics chips for gamers to date.

NVIDIA GeForce RTX 3090 Ti Graphics Card Specs, Performance, Price & Availability – Everything You Need To Know

The Turing TU102 GPU has 72 Streaming Multiprocessors (SM) featuring 64 CUDA cores each. The full die features 4608 CUDA cores while the GeForce RTX 2080 Ti features 4352 cores. The chip has 576 Tensor cores, 72 RT cores, 36 Geometry Units, 288 Texture Units (TMUs) and 96 ROPs (Raster Operation Units). In addition to the core specs, the chip has 384-bit memory interface supporting a 7 GHz GDDR6 (14 GHz Effective) DRAM design and 2 NVLINK channels. The chip features 6 MB of L2 cache too.

Now since we showed that the GeForce RTX 2080 Ti is based on a cut down TU102 core, the specs are slightly different. We get 68 SMs with 4352 Cores, 544 Tensor cores, 68 RT cores, 34 Geometry Units, 288 TMUs and 96 ROPs. The actual clock speeds are maintained at 1350 MHz base and 1545 MHz boost (1635 MHz OC). The chip features 11 GB of GDDR6 (next-gen) memory featured across a 352-bit bus and clocked at 14 GB/s. This leads to a total bandwidth of 616 GB/s.

NVIDIA GeForce RTX/GTX "Turing" Family:

Graphics Card NameNVIDIA GeForce GTX 1650NVIDIA GeForce GTX 1650 D6NVIDIA GeForce GTX 1650NVIDIA GeForce GTX 1660NVIDIA GeForce GTX 1660 SUPERNVIDIA GeForce GTX 1660 TiNVIDIA GeForce RTX 2060NVIDIA GeForce RTX 2070NVIDIA GeForce RTX 2080NVIDIA GeForce RTX 2080 Ti
GPU ArchitectureTuring GPU (TU117)Turing GPU (TU117)Turing GPU (TU116)Turing GPU (TU116)Turing GPU (TU116)Turing GPU (TU116)Turing GPU (TU106)Turing GPU (TU106)Turing GPU (TU104)Turing GPU (TU102)
Process12nm FNN12nm FNN12nm FNN12nm FNN12nm FNN12nm FNN12nm FNN12nm FNN12nm FNN12nm FNN
Die Size200mm2200mm2284mm2284mm2284mm2284mm2445mm2445mm2545mm2754mm2
Transistors4.7 Billion4.7 Billion6.6 Billion6.6 Billion6.6 Billion6.6 Billion10.6 Billion10.6 Billion13.6 Billion18.6 Billion
CUDA Cores896 Cores896 Cores1280 Cores1408 Cores1408 Cores1536 Cores1920 Cores2304 Cores2944 Cores4352 Cores
GigaRaysN/AN/AN/AN/AN/AN/A5 Giga Rays/s6 Giga Rays/s8 Giga Rays/s10 Giga Rays/s
Cache1.5 MB L2 Cache1.5 MB L2 Cache1.5 MB L2 Cache1.5 MB L2 Cache1.5 MB L2 Cache1.5 MB L2 Cache4 MB L2 Cache4 MB L2 Cache4 MB L2 Cache6 MB L2 Cache
Base Clock1485 MHz1410 MHz1530 MHz1530 MHz1530 MHz1500 MHz1365 MHz1410 MHz1515 MHz1350 MHz
Boost Clock1665 MHz1590 MHz1725 MHz1785 MHz1785 MHz1770 MHz1680 MHz1620 MHz
1710 MHz OC
1710 MHz
1800 MHz OC
1545 MHz
1635 MHz OC
Compute3.0 TFLOPs3.0 TFLOPs4.4 TFLOPs5.0 TFLOPs5.0 TFLOPs5.5 TFLOPs6.5 TFLOPs7.5 TFLOPs10.1 TFLOPs13.4 TFLOPs
MemoryUp To 4 GB GDDR5Up To 4 GB GDDR6Up To 4 GB GDDR6Up To 6 GB GDDR5Up To 6 GB GDDR6Up To 6 GB GDDR6Up To 6 GB GDDR6Up To 8 GB GDDR6Up To 8 GB GDDR6Up To 11 GB GDDR6
Memory Speed8.00 Gbps12.00 Gbps12.00 Gbps8.00 Gbps14.00 Gbps12.00 Gbps14.00 Gbps14.00 Gbps14.00 Gbps14.00 Gbps
Memory Interface128-bit128-bit128-bit192-bit192-bit192-bit192-bit256-bit256-bit352-bit
Memory Bandwidth128 GB/s192 GB/s192 GB/s192 GB/s336 GB/s288 GB/s336 GB/s448 GB/s448 GB/s616 GB/s
Power ConnectorsN/AN/A6 Pin8 Pin8 Pin8 Pin8 Pin8 Pin8+8 Pin8+8 Pin
TDP75W75W100W120W125W120W160W185W (Founders)
175W (Reference)
225W (Founders)
215W (Reference)
260W (Founders)
250W (Reference)
Starting Price$149 US$149 US$159 US$219 US$229 US$279 US$349 US$499 US$699 US$999 US
Price (Founders Edition)$149 US$149 US$159 US$219 US$229 US$279 US$349 US$599 US$799 US$1,199 US
LaunchApril 2019April 2020November 2019March 2019October 2019February 2019January 2019October 2018September 2018September 2018

NVIDIA Turing GPU Packs 50% Better Performance Per Core Than Pascal GPUs

In terms of shading performance which is the direct result of the enhanced core design and GPU architecture revamp, the Turing GPU offers an average uplift of 50% better performance per core compared to Pascal GPUs. In VR games, the shading performance would be a good 2x ahead than what Pascal achieved while many modern gaming titles show a ~50% lead over Pascal with Turing's enhanced core design.

NVIDIA GeForce RTX 3050 Benchmarks Show Poor Crypto Mining But Faster Than RX 6500 XT Graphics Performance

It should be pointed that these are just per core performance gains at the same clock speeds without adding the benefits of other technologies that Turing comes with. That would further increase the performance in a wide variety of gaming applications as we have already seen the gaming performance of a GeForce RTX 2080 to be 50% faster than the GTX 1080 on average and twice as fast with the new DLSS technology.

NVIDIA's New Overclocking Feature Let's The OC Utility Detect The Best Possible Clock Speeds and Voltages For You

Lastly, there's a new overclocking feature being talked about which the new Turing GeForce RTX graphics cards will be able to make use of. To be known as the Scanner (Final Name is still a work-in-progress), the feature will let the OC Utility detect the best clock speeds and voltages for you without the need to do anything. Just run a test through the overclocking utility and you are all set.

The feature is said to be implemented in many Overclocking utilities which will be updated by AIBs such as EVGA, MSI, ASUS, etc but would also be shipping inside new GeForce Experience software which sounds really interesting. This means we may be looking at an updated GeForce experience and software stack with the new cards launch.

NVIDIA GeForce RTX 2080/RTX 2080 Ti Performance Review Embargo Ends 14th September

One more thing to add, it is stated that the performance reviews of the GeForce RTX 2080 Ti and RTX 2080 have an embargo until 14th September so that is when we will be looking at the final reviewers. Also, NVIDIA hasn't shipped the press with working drivers so for the time being, we will only have these official performance figures with us. That gives a 6-day margin for users to get the GeForce RTX products based on their impressions of the reviews.

The GeForce RTX 20 Series Market Availability – Preorder and Shipping Today, On Shelves 20th September

The NVIDIA GeForce RTX 20 series launches today in reference variants first. This time, NVIDIA has already given the green light to their manufacturers to announce custom cards soon after the reference launch which are now available to pre-order on the official GeForce webpageOr you can head over to this article and check out all the glorious non-reference models which you will be able to get very soon.

Check out the other cards in the links below:

Which NVIDIA GeForce RTX 20 Series graphics card are you buying?