NVIDIA GeForce GTX 1650 4 GB GDDR5 Graphics Card Review Ft. MSI Gaming X and Gigabyte Gaming OC – Is Entry Level ($150 US) Turing Worth It?
MSI GeForce GTX 1650 Gaming X / Gigabyte GeForce GTX 1650 Gaming OCApril 2019
NVIDIA GeForce GTX 1650 and Turing TU117 GPU
The NVIDIA GeForce GTX 1650 is based on the same Turing architecture that is found on the RTX cards. The main differences between the Turing based RTX and GTX lineup is that only RTX 2060 and above feature hardware support for tensor and ray tracing operations while the GTX series has been stripped off from these specific features.
Based on the 12th Generation Turing GPU architecture, the TU117 GPU found on the GeForce GTX 1650 features the same shader innovations that were introduced on Turing but to balance it out in terms of power, cost and performance, a few adjustments had to be made. This is done through the exclusion of RT cores and Tensor cores on the GeForce GTX cards with Turing architecture. It is pointed out that the Turing architecture on GeForce GTX still delivers improved performance & better efficiency compared to its predecessor while supporting concurrent floating point and integer Ops.
So let's talk about the balanced architecture design of the Turing TU117 and how it still manages to improve upon its Pascal based predecessors. The first thing to mention is the three big changes in the Turing SM. The revamped structure of the Turing TU117 SM enables processing of FP32 & INT operations concurrently through the use of dedicated cores within the SM. The list of features that Turing TU117 GPU adds over Pascal GP106 include:
- Concurrent FP and INT operations
- Variable Rate Shading
- Unified Cache Architecture
- Dedicated FP16 Cores
- Volta NVENC Support
The Turing SM can also perform FP16 operations at double the rate of FP32. The Turing TU117 GPU is rated at 6 TOPs (FP+INT), 6 TFLOPs FP16 and an improved bandwidth due to the higher cache count compared to the Pascal-based GP107.
If we look at some modern gaming titles, then we can see that developers are widely mixing floating point operations with integer instructions. For every 100 instructions in Shadow of the Tomb Raider, for example, 62 are floating point and 38 integers, on average. In previous GPUs, the floating point math datapath in the SM would sit idly whenever one of these non-FP-math runs. Turing adds a second parallel Integer execution unit never to ever CUDA core that executes these instructions in parallel with floating point math.
Now coming to the raw specifications of the GeForce GTX 1650 graphics card. The TU117 GPU is fabricated on the TSMC's 12nm FFN (FinFET NVIDIA) process node. It features 2 GPCs, 8 TPCs, and 14 Turing SMs. Each SM contains 64 cores which equal to a total of 896 CUDA Cores. There's also 56 Texture Units and 32 Raster Operation Units on the card. The base clock is maintained at 1485 MHz while the boost clock is maintained at 1665 MHz.
The GeForce GTX 1650 features 4 GB of GDDR5 VRAM running along a 128-bit bus interface. The memory system would be clocked at 8.0 Gbps delivering an effective bandwidth of 128 GB/s. The card is rated at 3.0 TFLOPs FP32 and 6 TFLOPs FP16 compute power. Officially, the card doesn't require any external power connector but manufacturers offering factory overclocked cards have added them nonetheless. Display outputs include a single DisplayPort, a single DVI-D, and an HDMI connector.
NVIDIA GeForce RTX/GTX "Turing" Family:
|Graphics Card Name||NVIDIA GeForce GTX 1650||NVIDIA GeForce GTX 1660||NVIDIA GeForce GTX 1660 SUPER||NVIDIA GeForce GTX 1660 Ti||NVIDIA GeForce RTX 2060||NVIDIA GeForce RTX 2070||NVIDIA GeForce RTX 2080||NVIDIA GeForce RTX 2080 Ti|
|GPU Architecture||Turing GPU (TU117)||Turing GPU (TU116)||Turing GPU (TU116)||Turing GPU (TU116)||Turing GPU (TU106)||Turing GPU (TU106)||Turing GPU (TU104)||Turing GPU (TU102)|
|Process||12nm FNN||12nm FNN||12nm FNN||12nm FNN||12nm FNN||12nm FNN||12nm FNN||12nm FNN|
|Transistors||4.7 Billion||6.6 Billion||6.6 Billion||6.6 Billion||10.6 Billion||10.6 Billion||13.6 Billion||18.6 Billion|
|CUDA Cores||896 Cores||1408 Cores||1408 Cores||1536 Cores||1920 Cores||2304 Cores||2944 Cores||4352 Cores|
|GigaRays||N/A||N/A||N/A||N/A||5 Giga Rays/s||6 Giga Rays/s||8 Giga Rays/s||10 Giga Rays/s|
|Cache||1.5 MB L2 Cache||1.5 MB L2 Cache||1.5 MB L2 Cache||1.5 MB L2 Cache||4 MB L2 Cache||4 MB L2 Cache||4 MB L2 Cache||6 MB L2 Cache|
|Base Clock||1485 MHz||1530 MHz||1530 MHz||1500 MHz||1365 MHz||1410 MHz||1515 MHz||1350 MHz|
|Boost Clock||1665 MHz||1785 MHz||1785 MHz||1770 MHz||1680 MHz||1620 MHz|
1710 MHz OC
1800 MHz OC
1635 MHz OC
|Compute||3.0 TFLOPs||5.0 TFLOPs||5.0 TFLOPs||5.5 TFLOPs||6.5 TFLOPs||7.5 TFLOPs||10.1 TFLOPs||13.4 TFLOPs|
|Memory||Up To 4 GB GDDR5||Up To 6 GB GDDR5||Up To 6 GB GDDR6||Up To 6 GB GDDR6||Up To 6 GB GDDR6||Up To 8 GB GDDR6||Up To 8 GB GDDR6||Up To 11 GB GDDR6|
|Memory Speed||8.00 Gbps||8.00 Gbps||14.00 Gbps||12.00 Gbps||14.00 Gbps||14.00 Gbps||14.00 Gbps||14.00 Gbps|
|Memory Bandwidth||128 GB/s||192 GB/s||336 GB/s||288 GB/s||336 GB/s||448 GB/s||448 GB/s||616 GB/s|
|Power Connectors||N/A||8 Pin||8 Pin||8 Pin||8 Pin||8 Pin||8+8 Pin||8+8 Pin|
|Starting Price||$149 US||$219 US||$229 US||$279 US||$349 US||$499 US||$699 US||$999 US|
|Price (Founders Edition)||$149 US||$219 US||$229 US||$279 US||$349 US||$599 US||$799 US||$1,199 US|
|Launch||April 2019||March 2019||October 2019||February 2019||January 2019||October 2018||September 2018||September 2018|