cover
Hardware

NVIDIA GeForce RTX 20 Series Review Ft. RTX 2080 Ti & RTX 2080 Founders Edition Graphics Cards – Turing Ray Traces The Gaming Industry

Hassan Mujtaba & Keith May

NVIDIA Turing GPU - Turing GDDR6 Memory Subsystem Deep Dive

Turing is a very powerful core of its generation and there's nothing like it. High-performance GPUs of this caliber require to be fed by lots of bandwidth. NVIDIA's Volta GPUs are fed by the fastest memory standard in the industry aka HBM2 but while they make sense from an HPC standpoint, they are nowhere near the price to be featured on consumer level products. And GDDR5 has already exceeded its maximum potential with G5X. So this is where GDDR6 enters the industry.

Related Story NVIDIA Moves Gaming Segment Under “Edge Computing”, Posts 29% Revenue Growth From Blackwell Workstations But Gaming GPUs Slow Down Due To “Elevated” Memory Prices

While GDDR6 follows an evolutionary path over GDDR5 and GDDR5X memory, there are still some significant changes in the underlying architecture to boost memory bandwidth while saving power. This makes the VRAM a viable option for next-generation consumer graphics cards such as NVIDIA's upcoming line of the GeForce products. Furthermore, unlike GDDR5X which was only supported and produced by Micron, GDDR6 has the backing of all three players which includes Samsung, SK Hynix, and Micron. NVIDIA is continuing their partnership with Micron and featuring their memory on the GeForce RTX cards but if they were ever to run in a shortage, there won't be an issue as NVIDIA can select from other manufacturers too.

For those who like to know what difference is between GDDR5 and GDDR6, we know from the official specifications published by JEDEC, that both memory standards are not a whole lot different from each other but they aren’t the same thing either. The GDDR6 solution is built upon the DNA of GDDR5X and has been updated to deliver twice the data rate and denser die capacities.

micron-gddr6-memory_3-custom
micron-gddr6-memory_4-custom
micron-gddr6-memory_5-custom
micron-gddr6-memory_6-custom
micron-gddr6-memory_7-custom
micron-gddr6-memory_8-custom
micron-gddr6-memory_9-custom
micron-gddr6-memory_10-custom
micron-gddr6-memory_11-custom
micron-gddr6-memory_12
micron-gddr6-memory_13
micron-gddr6-memory_14
micron-gddr6-memory_15

While the new memory technology would be very similar to GDDR5X, there are a few differences of which the major ones include:

  1. The introduction of an FBGA180 ball package with increased pitch
  2. A dual channel architecture

There are a lot of design changes that went in developing GDDR6 to achieve the faster transfer speeds, higher bandwidth and in a package that consumers just around the same power or even lower. Samsung states that GDDR6 has 35% lower power input than GDDR5 DRAM.

Coming to the specifications in detail, the Samsung 16 Gb GDDR6 memory die will be built on the 10nm process node which Samsung is calling as the most advanced memory node to date. It will double the density of their GDDR5 solution which was composed of a 20nm 8 Gb die. According to Samsung, their solution will be operating at up to 18 Gbps against a previous standard speed of 16 Gbps and that is a big deal here. Each die will be able to deliver a data transfer rate of 72 Gbps and hold a capacity of 2 GB VRAM. The solution will be able to do all of this with 35% lower power input at just 1.35V compared to 1.55V

This means that a solution based on a 384-bit interface and surrounded by 12 DRAM dies could feature up to 24 GB of VRAM while a 256-bit solution can house up to 16 GB of VRAM. That’s twice the VRAM capacity as current generation cards. While VRAM is one thing, the maximum bandwidth output on a 384-bit card can reach a blistering fast 672 GB/s while the 256-bit solution can reach a stunning 448 GB/s transfer rate on existing 14 Gbps dies which are in full production.

GPU Memory Technology Updates

Graphics Card NameMemory TechnologyMemory SpeedMemory BusMemory BandwidthRelease
AMD Radeon R9 Fury XHBM11.0 Gbps4096-bit512 GB/s2015
NVIDIA GTX 1080GDDR5X10.0 Gbps256-bit320 GB/s2016
NVIDIA Tesla P100HBM21.4 Gbps4096-bit720 GB/s2016
NVIDIA Titan XpGDDR5X11.4 Gbps384-bit547 GB/s2017
AMD RX Vega 64HBM21.9 Gbps2048-bit483 GB/s2017
NVIDIA Titan VHBM21.7 Gbps3072-bit652 GB/s2017
NVIDIA Tesla V100HBM21.7 Gbps4096-bit901 GB/s2017
NVIDIA RTX 2080 TiGDDR614.0 Gbps384-bit672 GB/s2018
AMD Instinct MI100HBM22.4 Gbps4096-bit1229 GB/s2020
NVIDIA A100 80 GBHBM2e3.2 Gbps5120-bit2039 GB/s2020
NVIDIA RTX 3090GDDR6X19.5 Gbps384-bit936.2 GB/s2020
AMD Instinct MI200HBM2e3.2 Gbps8192-bit3200 GB/s2021
NVIDIA RTX 3090 TiGDDR6X21.0 Gbps384-bit1008 GB/s2022
NVIDIA H100 80 GBHBM3/E2.6 Gbps5120-bit1681 GB/s2022

NVIDIA Turing GPUs With Better Memory Compression – Effective Memory Bandwidth Increased Up To 50% Over Pascal GPUs, Over 1.5 TB/s

One of the key improvements of Pascal over Maxwell was the faster memory compression algorithms which delivered very high bandwidth by using various compression and caching techniques.

With Turing, we are looking at the third generation of memory compression architecture which is said to effectively deliver up to 50% boost in effective bandwidth when compared to Pascal GPUs. We know that the Pascal GeForce GTX 1080 Ti memory bandwidth was boosted to 1.2 TB/s over the raw 484.4 GB/s bandwidth when using these algorithms and with Turing, NVIDIA is saying that we should expect 50% more effective bandwidth with Memory Compression 3.0.

Since Turing GPU already have higher raw bandwidth compared to Pascal GPUs (RTX 2080 Ti with 616 GB/s), we can expect the effective bandwidth using the new algorithm to reach past 1.5 TB/s which is very good considering it would help games deliver even better performance on higher resolutions which the graphics cards are aiming at.

You can find additional information about our hardware review process and ethics policy here.

Hassan Mujtaba Photo

About the author: A Software Engineer by training and a PC enthusiast by passion, Hassan Mujtaba serves as Wccftech's Senior Editor for hardware section. With years of experience in the industry, he specializes in deep-dive technical analysis of next-generation CPU and GPU architectures, motherboards, and cooling solutions. His work involves not only breaking news on upcoming technologies but also extensive hands-on reviews and benchmarking.

Follow Wccftech on Google to get more of our news coverage in your feeds.

Button