AMD CDNA Architecture Based Arcturus GPU ‘Radeon Instinct’ Test Board Spotted – 120 CUs With 7680 Cores, 1200 MHz HBM2 Clock, 878 MHz GPU Clock

AMD's Radeon Instinct Arcturus GPU which will feature the CDNA architecture and aim the server market has been spotted by Rogame. Featured inside the next-generation Radeon Instinct graphics cards, the CDNA architecture will leverage its compute-optimized GPU design to deliver the highest performance Compute capabilities for data centers.

AMD's CDNA Architecture Based Arcturus GPU Test Board Leaks Out - Next-Gen Radeon Instinct With 120 CUs For A Total of 7680 Cores

The AMD Arcturus GPU leaked out all the way back in 2018 which was before AMD has introduced any 7nm GPU. The Radeon VII and Navi lineup launched in 2019 and featured 7nm GPUs with Navi being aimed at the mass consumer market.

Related StoryHassan Mujtaba
AMD Ryzen 7 7700X 8 Core Zen 4 Desktop CPU Smiles For The Camera

It was later revealed that AMD's next-generation HPC & AI GPUs would be designed separately from the consumer-end chips. This meant that the Arcturus GPU would be kept exclusive to the datacenter market. AMD just recently confirmed in its Radeon CDNA architecture roadmap that all CDNA based GPUs would be exclusively designed for the HPC & data center markets while Radeon RDNA GPUs will power the consumer segment.

Coming to the specifications, it was previously unveiled that AMD's Arcturus GPU would feature an increased cache and double the CUs as Vega. That along with a list of data center specific features such as XDLOPs, Rapid Packed Math, New Vector ALU & BFloat16 are to be expected in the Radeon Instinct cards that feature the new CDNA architecture. The previous Radeon Instinct MI100 proto-type 'D34303' board featured the Arcturus-XL die with a rated TDP of 200W and 32 GB HBM2 VRAM clocked at around 1000-1200 MHz.

The information for this part is based on a prototype so it is likely that final specifications would not be the same but here are the key points:

  • Based on Arcturus XL GPU
  • Test Board has a TDP of 200W
  • Up To 32 GB HBM2 Memory
  • HBM2 Memory Clocks Reported Between 1000-1200 MHz

Once again, a test board has been spotted by Rogame which is based on the Arcturus CDNA GPU and from the looks of it, this variant offers 120 CUs for a total of 7680 stream processors & a GPU clock speed of 878 MHz (750 MHz SOC clock). This variant also features an undefined amount of HBM2 memory clocked at 1200 MHz so if we are looking at a 4096-bit bus, we should get around 1.2 TB/s bandwidth which is what Aquabolt is able to offer. But it is very likely that both NVIDIA & AMD would end up utilizing the faster HBM2E 'Flashbolt' standard which goes into production this year and will be capable of delivering up to 1.8 TB/s bandwidth.

Related StoryHassan Mujtaba
AMD Ryzen 7000 “Zen 4” CPU Delay Rumors Mount Up, BIOS Said To Be The Main Culprit

Talking about the clock speeds, the 878 MHz for the test board are rather slow as we have seen variants going up to 1334 MHz in the past. At the mentioned speeds, the chip would boast around 13.5 TFLOPs of FP32 compute power which is lower than the Radeon Instinct MI60 and also the 21 TFLOPs that we got on the previous prototype sample. It is likely that the first iteration of CDNA GPUs would end up somewhere around 25 TFLOPs FP32.

Based on leaks of Ampere GPUs which are also expected to be announced later this year, it looks like NVIDIA might hold the upper hand in terms of Compute performance as they are speculated to hit almost 36 TFLOPs of FP32 and 18 TFLOPs of FP64 Compute power with next-generation Tesla 7nm GPU lineup.

AMD Radeon Instinct Accelerators 2020

Accelerator NameAMD Instinct MI300AMD Instinct MI250XAMD Instinct MI250AMD Instinct MI210AMD Instinct MI100AMD Radeon Instinct MI60AMD Radeon Instinct MI50AMD Radeon Instinct MI25AMD Radeon Instinct MI8AMD Radeon Instinct MI6
CPU ArchitectureZen 4 (Exascale APU)N/AN/AN/AN/AN/AN/AN/AN/AN/A
GPU ArchitectureTBA (CDNA 3)Aldebaran (CDNA 2)Aldebaran (CDNA 2)Aldebaran (CDNA 2)Arcturus (CDNA 1)Vega 20Vega 20Vega 10Fiji XTPolaris 10
GPU Process Node5nm+6nm6nm6nm6nm7nm FinFET7nm FinFET7nm FinFET14nm FinFET28nm14nm FinFET
GPU Chiplets4 (MCM / 3D Stacked)
1 (Per Die)
2 (MCM)
1 (Per Die)
2 (MCM)
1 (Per Die)
2 (MCM)
1 (Per Die)
1 (Monolithic)1 (Monolithic)1 (Monolithic)1 (Monolithic)1 (Monolithic)1 (Monolithic)
GPU Cores28,160?14,08013,3126656768040963840409640962304
GPU Clock SpeedTBA1700 MHz1700 MHz1700 MHz1500 MHz1800 MHz1725 MHz1500 MHz1000 MHz1237 MHz
FP16 ComputeTBA383 TOPs362 TOPs181 TOPs185 TFLOPs29.5 TFLOPs26.5 TFLOPs24.6 TFLOPs8.2 TFLOPs5.7 TFLOPs
FP32 ComputeTBA95.7 TFLOPs90.5 TFLOPs45.3 TFLOPs23.1 TFLOPs14.7 TFLOPs13.3 TFLOPs12.3 TFLOPs8.2 TFLOPs5.7 TFLOPs
FP64 ComputeTBA47.9 TFLOPs45.3 TFLOPs22.6 TFLOPs11.5 TFLOPs7.4 TFLOPs6.6 TFLOPs768 GFLOPs512 GFLOPs384 GFLOPs
VRAM192 GB HBM3?128 GB HBM2e128 GB HBM2e64 GB HBM2e32 GB HBM232 GB HBM216 GB HBM216 GB HBM24 GB HBM116 GB GDDR5
Memory ClockTBA3.2 Gbps3.2 Gbps3.2 Gbps1200 MHz1000 MHz1000 MHz945 MHz500 MHz1750 MHz
Memory Bus8192-bit8192-bit8192-bit4096-bit4096-bit bus4096-bit bus4096-bit bus2048-bit bus4096-bit bus256-bit bus
Memory BandwidthTBA3.2 TB/s3.2 TB/s1.6 TB/s1.23 TB/s1 TB/s1 TB/s484 GB/s512 GB/s224 GB/s
Form FactorOAMOAMOAMDual Slot CardDual Slot, Full LengthDual Slot, Full LengthDual Slot, Full LengthDual Slot, Full LengthDual Slot, Half LengthSingle Slot, Full Length
CoolingPassive CoolingPassive CoolingPassive CoolingPassive CoolingPassive CoolingPassive CoolingPassive CoolingPassive CoolingPassive CoolingPassive Cooling

So far, AMD has unveiled that the main focus of CDNA would be performance, efficiency, features, scalability in the data center market. Currently, AMD's GCN architecture has served this segment but with CDNA, AMD will be creating GPUs specifically optimized for high-performance Compute, Machine Learning, and HPC. The 1st Gen CDNA GPUs will feature a 2nd Gen Infinity Architecture and would utilize the ROCm (Radeon Open Compute Platform) to power the data center with key optimizations and enhanced scalability. The 2nd Gen Infinity Architecture will allow for 4-8 Way GPU connectivity in a singular node, allowing the new Radeon Instinct boards to run in harmony.

AMD has proved that they can offer more FLOPs at a competitive price so maybe that is where Arcturus would be targetting. There's no word on when Arcturus would land, but AMD has hinted at a Radeon Instinct product later this year which will feature its 1st Gen CDNA architecture.

WccfTech Tv
Filter videos by