Nvidia Drive PX 2 Uses Integrated and Discrete Pascal GPU Cores – 24 DL TOPS, 8 TFLOPs and Up To 4GB GDDR5 [Updated]

Nvidia's GTC 2016 event is going on as we speak and the company has revealed quite a few details about its upcoming Drive PX 2 board which will be the flagship SKU for the automobile and autonomous industry. The board will feature both integrated and discrete Pascal GPUs on board for a peak performance of 24 DL TOPS or 8 TFLOPS.


Nvidia demos Drive PX 2 at GTC 2016 - features a CPU/GPU complex with integrated and dedicated Pascal cores

Lets start with the basic specifications. Nvidia is claiming peak performance of 8 TFLOPs (FP32) and up to 24 DL TOPS. While I am sure the first metric is familiar, the second is one of Nvidia's own design and refers to "Deep Learning - Tera-Operations Per Second". This means that the Drive PX 2 can do 24,000,000,000,000 deep learning operations per second. This is a pretty huge number and considering that the automobile industry will be leaning more and more heavily on DNNs for auto-pilot and autonomous driving applications - this is an important number.

Moving on, the CPU complex of the Drive PX 2 board is as follows: There are 2 Denver2 cores present plus 4x Cortex A57 cores. The architecture used is ARM v8 64 bit. The CPU Complex will have upto 8 GB of LPDDR4 memory (UMA) with up to 50 GB/s bandwidth.  Nvidia's 5th generation architecture, aka Pascal, features custom acceleration for deep learning and the discrete GPUs present will have up to 80 GB/s of bandwidth.

Each CPU complex will have access to its own integrated Pascal Cores (as well as a dedicated Pascal GPU over PCIe) and will be connected by a 1 Gb Ethernet connection. The discrete Pascal cores will have up to 4GB of GDDR5 memory (each) and will feature approximately 80 GB/s of bandwidth - which tells us that we are probably looking at low end (or custom) cores around the GP106 spectrum. They use a 128-bit interface that connects to four GDDR5 memory chips clocked at 1.25 GHz. The clock can go as high as 1.5 GHz for a total of 96 GB/s.

The Pascal cores used on the Drive PX 2 has a specialized instruction set that is designed to accelerate DNN performance on the go . The interface itself (of the PX 2 board) supports an IO of 70 Gigabits per second. Interestingly, Nvidia has put a lot of thought on redundancy and mission critical system safety. An ASIL-D safety micro controller is also present on the board itself. Not only that, but the hardware is AutoSAR compliant, and designed from the ground up to allow devs to take full advantage of the resources on the board.

NVIDIA Drive PX Generation Comparison:

Product NameNVIDIA Drive PXNVIDIA Drive PX 2NVIDIA Drive XavierNVIDIA Drive PegasusNVIDIA Drive AGX Orin
SOC NameTegra X1ParkerXavierXavierOrin
Process Technology20nm SOC16nm FinFET12nm FinFET12nm FinFETTBA
SOC Transistors2 Billion (Tegra X1)N/A7 Billion (Xavier)7 Billion (Xavier)17 Billion (Orin)
GPU ArchitectureMaxwell (256 Core)Pascal (256 Core)Volta (512 Core)Volta (512 Core)Ampere?
CPU16 Core ARM CPU12 Core ARM CPU8 Core ARM CPU16 Core ARM CPU12 Core ARM CPU
CPU Architecture8x Cortex A57
8x Cortex A53
4x Denver
8x Cortex A57
Carmel ARM64 8 Core CPU (8 MB L2 + 4 MB L3)Carmel ARM64 8 Core CPU (8 MB L2 + 4 MB L3)ARM Herclues Cores
Compute DLTOPsN/A20 DLTOPs30 TOPs320 TOPs200 TOPs
Total Chips2 x Tegra X12 x Tegra X2
2 x Pascal MXM GPUs
1 x Xavier2 x Volta
2 x Turing
1 x Ampere
System MemoryLPDDR48 GB LPDDR4 (50+ GB/s)16 GB 256-bit LPDDR4LPDDR4 + GDDR6N/A
Graphics MemoryN/A4 GB GDDR5 (80+ GB/s)137 GB/s1 TB/s200 GB/s

Nvidia’s last financials indicated a very strong growth of their automotive department. The reason is of course that their Tegra chips are increasingly in demand as the ultimate choice to power digital cockpit systems for various automobile vendors. Nvidia has also been aspiring to break into the ADAS business with its new Drive PX chip. Infact, Elon Musk was present at CES this year when the CEO of Nvidia demonstrated the capabilities of the Drive PX board. This caused many to speculate that the board was present inside the Tesla models. This is however, not true, and you would be forgiven for thinking that the latest Tesla vehicles contain the module.

Nvidia's previous generation of Tegra GPUs only powered the infotainment cluster aboard cars like the Tesla. It is apparent however, that Nvidia aims to change this with its successive designs - each concentrated on providing more and more power for DNN-based autonomous capabilities. In fact, the Drive PX2 board has already shipped to Tier 1 customers for an estimated price tag of $15000. That might sound like a steep price for a computer board, but considering how much a LIDAR based DNN system with actual dGPUs cost - this number amounts to mere pennies.

WccfTech Tv
Filter videos by