Intel Unveils Rialto Bridge: Next-Gen AI Successor To Ponte Vecchio Xe-HPC GPU With Up To 160 Xe Cores, Over 20,000 ALUs, OAM 2.0, Sampling in 2023

Submit

Intel has officially unveiled its next-generation successor to its flagship Xe GPU, the Ponte Vecchio, known as Rialto Bridge. The new graphics chip is designed for the next generation of AI & HPC data center segment, aiming at AMD's CDNA & NVIDIA's CUDA processors.

Intel Rialto Bridge GPU Unveiled: Successor To Ponte Vecchio With 25% More Cores, Increased Flops, Targeting AMD & NVIDIA Data Center GPUs

The Intel Rialto Bridge GPU can be seen as an upgraded version of Ponte Vecchio with more cores, more flops, more bandwidth, and more GT/s. Intel hasn't disclosed a lot of details but claims Rialto Bridge will feature up to 160 Xe cores. We don't know yet if these are the same cores as the current Ponte Vecchio GPUs or based on a brand new architecture but it looks like the latter might be true.

Intel’s Arc A380 Is The First Graphics Card To Support DisplayPort 2.0 Standard But No Monitor Out There To Make Use of It

Today we are announcing our successor to this powerhouse data center GPU, code-named Rialto Bridge. By evolving the Ponte Vecchio architecture and combining enhanced tiles with next process node technology, Rialto Bridge will offer significantly increased density, performance, & efficiency, while providing software consistency.

Intel's Rialto Bridge is named after the bridge of the same name which is the oldest of the four bridges spanning the Grand Canal in Venice, Italy. So was the case with Ponte Vecchio & it looks like the next gen that comes after Rialto will also be named after an iconic bridge. According to Intel, its Rialto Bridge GPU will power the next generation of AI & HPC Data Center solutions while aiming at the AMD CDNA & NVIDIA CUDA accelerators.

In terms of specifications, we only know that the Rialto Bridge GPU will house up to 160 Xe cores in its brand new OAM v2 form factor. But besides the unveiling of the specs, Intel also gives us a first look at the chip itself and there are some things we can dissect. The biggest change to the GPU is in its GPU die layout. While Ponte Vecchio has 16 Xe-HPC dies, each with 8 Xe cores for a total of 128 cores or 16,384 ALUs, the Rialto Bridge GPU comes with 8 16 Xe-HPC dies. So that should be 20 Xe cores per die for a total of 160 Xe cores on the 8 dies. That rounds up to 20,480 ALUs which is a 25 percent increase over its predecessor.

The rest of the Rialto Bridge GPU structure is pretty much the same as the Ponte Vecchio GPU with two Xe Link Tiles, eight HBM Tiles (HBM3) with four HBM stacks tied to each compute tile (4 Xe HPC dies), There are also the passive die stiffeners located around the Compute Tiles while the Xe Link & HBM3 Tiles are connected to the Compute Tile using an EMIB Tile. The Foveros chip interconnect is used by the Compute Tile to communicate with the rest of the Xe Dies. We don't know the actual variation of each tile yet but it should be based on the new Foveros Omni (3rd Gen) design. Also, it looks like the Rambo Cache tile is missing but it is highly possible that given the die size increase of each Compute tile, the cache is now featured on the Compute tile itself rather than have it separate on its own tile.

Intel Arc Pro A50 & Pro A40 Workstation Graphics Cards Spotted: Full ACM-G11 Alchemist GPU With 1024 Cores at 2450 MHz, 6 GB GDDR6 Memory

As for performance, Intel hasn't revealed any clear numbers and only stated that we should expect more FLOPs, GT/s, and increased bandwidth. The increased bandwidth should be coming from the upgraded HBM3 memory dies. The Ponte Vecchio GPUs are already equipped with up to 128 GB of VRAM capacities so that should be what we also see on the Rialto Bridge GPUs but Intel could stack it up even higher. The GPUs will be drop-in compatible with the existing Ponte Vecchio platforms & will feature a slightly higher TDP of 800W, versus 600W on Ponte Vecchio chips. The performance aims roughly 30-35% increase.

Following is the full Intel Rialto Bridge die configuration that we can dissect at the moment:

  • 8 Xe HPC (internal/external)
  • 2 Xe Base (internal)
  • 11 EMIB (internal)
  • 2 Xe Link (external)
  • 8 HBM (external)

Intel hasn't given any release time or details regarding the process node for the Rialto Bridge GPU but it is likely that we will hear more about it in mid-2023 when it will be sampled to first customers and a launch that aims either late 2023 or 1H of 2024.

Next-Gen Data Center GPU Accelerators

GPU NameAMD Instinct MI250XNVIDIA Hopper GH100Intel Ponte VecchioIntel Rialto Bridge
Packaging DesignMCM (Infinity Fabric)MonolithicMCM (EMIB + Foveros)MCM (EMIB + Foveros)
GPU ArchitectureAldebaran (CDNA 2)Hopper GH100Xe-HPCXe-HPC
GPU Process Node6nm4N7nm (Intel 4)5nm (Intel 3)?
GPU Cores14,08016,89616,384 ALUs
(128 Xe Cores)
20,480 ALUs
(160 Xe Cores)
GPU Clock Speed1700 MHz~1780 MHzTBATBA
L2 / L3 Cache2 x 8 MB50 MB2 x 204 MBTBA
FP16 Compute383 TOPs2000 TFLOPsTBATBA
FP32 Compute95.7 TFLOPs1000 TFLOPs~45 TFLOPs (A0 Silicon)TBA
FP64 Compute47.9 TFLOPs60 TFLOPsTBATBA
Memory Capacity128 GB HBM2E80 GB HBM3128 GB HBM2e128 GB HBM3?
Memory Clock3.2 Gbps3.2 GbpsTBATBA
Memory Bus8192-bit5120-bit8192-bit8192-bit
Memory Bandwidth3.2 TB/s3.0 TB/s~3 TB/s~3 TB/s
Form FactorOAMOAMOAMOAM v2
CoolingPassive Cooling
Liquid Cooling
Passive Cooling
Liquid Cooling
Passive Cooling
Liquid Cooling
Passive Cooling
Liquid Cooling
TDP560W700W600W800W
LaunchQ4 20212H 20222022?2024?
Submit